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Method and nucleic acids for the improved treatment of breast cell proliferative disorders 
Field of the Invention 



In American women, breast cancer is the most frequently diagnosed cancer and the second 
leading cause of cancer death. In women aged 40-55, breast cancer is the leading cause of 
death (Greenlee et al. 9 2000). In 2002, there were 204,000 new cases of breast cancer in the 
US (data from the American Society of Clinical Oncology) and a comparable number in 
Europe. 

Breast cancer is defined as the uncontrolled proliferation of cells within breasts tissues. 
Breasts are comprised of 15 to 20 lobes joined together by ducts. Cancer arises most 
commonly in the duct, but is also found in the lobes with the rarest type of cancer termed 
inflammatory breast cancer. It will be appreciated by those skilled in the art that there exists a 
continuing need to improve methods of early detection, classification and treatment of breast 
cancers. In contrast to the detection of some other common cancers such as cervical and 
dermal there are inherent difficulties in classifying and detecting breast cancers. 
Due to current screening programs and the accessibility of this cancer to self-examination, 
breast cancer is diagnosed comparatively early: in about 93% of all newly diagnosed cases, 
the cancer has not yet metastasized, and in 65% of cases, even the lymph nodes are not yet 
affected.. 

The first step of any treatment is the assessment of the patient's condition comparative to 
defined classifications of the disease. However the value of such a system is inherently 
dependant upon the quality of the classification. Breast cancers are staged according to their 
size, location and occurrence of metastasis. Methods of treatment include the use of surgery, 
radiation therapy, chemotherapy and endocrine therapy, which are also used as adjuvant 
therapies to surgery. 

Although the vast majority of early cancers are operable, i.e. the tumor can be completely 

' r 

removed by surgery, about one third of the patients with lymph-node negative diseases and 
about 50-60% of patients with node-positive disease will develop metastases during follow- 
up. 
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Based on this observation, systemic adjuvant treatment has been introduced for both node- 
positive and node-negative breast cancers. Systemic adjuvant therapy is administered after 
surgical removal of the tumor, and has been shown to reduce the risk of recurrence 
significantly (Early Breast Cancer Trialists' Collaborative Group, 1998). Several types of 
adjuvant treatment are available: endocrine treatment (for hormone receptor positive tumors), 
different chemotherapy regimens, and novel agents like Herceptin. 

The growth of the majority of breast cancers (appr. 70-80%) is dependent on the presence of 
estrogen. Therefore, one important target for adjuvant therapy is the removal of estrogen (e.g. 
by ovarian ablation) or the blocking of its actions on the tumor cells (e.g. Tamoxifen). 
Endocrine treatment is thought to be efficient only in tumors that express hormone receptors 
(the estrogen receptor, ER, and/or the progesterone receptor, PR). Currently, the vast majority 
of women with hormone receptor positive breast cancer receive some form of endocrine 
treatment, independent of their nodal status. The most frequently used drug is Tamoxifen. 
However, even in hormone receptor positive patients, not all patients benefit from endocrine 
treatment. Adjuvant endocrine therapy reduces mortality rates by 22% while response rates to 
endocrine treatment in the advanced setting are 50 to 60% (Jordan et al^ 2002, Jordan et aL, 
1999, Osborne et aL, 1998, European Breast Cancer Cooperative Group, 1998). 
Since Tamoxifen has relatively few side effects, treatment may be justified even for patients 
with low likelihood of benefit. However, these patients may require additional, more 
aggressive adjuvant treatment. This is supported by the fact that, even in earliest and least 
aggressive tumors, such as node-negative, hormone receptor positive tumours, about 21 % of 
patients relapse within 10 years after initial diagnosis if they receive Tamoxifen monotherapy 
as adjuvant treatment (Early Breast Cancer Trialists Collaborative Group. Lancet, 1998). 
Several cytotoxic regimens have shown to be effective in reducing the risk of relapse in breast 
cancer (Mansour et aL, 1998). According to current treatment guidelines, most node-positive 
patients receive adjuvant chemotherapy both in the US and Europe, since the risk of relapse is 
considerable. Nevertheless, not all patients do relapse, and there is a proportion of patients 
who would never have relapsed even without chemotherapy, but who nevertheless receive 
chemotherapy due to the currently used criteria. In hormone receptor positive patients, 
chemotherapy is usually given before endocrine treatment, whereas hormone receptor 
negative patients receive only chemotherapy. 

The situation for node-negative patients is particularly complex. In the US, cytotoxic 
chemotherapy is recommended for node-negative patients, if the tumor is larger than 1 cm. In 
Europe^chemotherapy is considered for the node-negative cases if one or more risk-factors — 
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such as tumor size larger than 2 cm, negative hormone receptor status, or tumor grading of 
three or age <35 is present. In general, there is a tendency to select premenopausal women for 
additional chemotherapy whereas for postmenopausal women, chemotherapy is often omitted. 
Compared to endocrine treatment, in particular Tamoxifen, chemotherapy is highly toxic, 
with short-term side effects such as nausea, vomiting, bone marrow depression, and long-term 
effects such as cardiotoxicity and an increased risk for secondary cancers. 

It is currently not clear which breast cancer patients should be selected for more aggressive 
therapy, although clinicians agree that there is a need for a subset of patients. The difficulty of 
selecting the right patients for chemotherapy, and the lack of suitable criteria is also reflected 
by a recent study which showed that chemotherapy is used much less frequently than 
recommended, based on data from the New Mexico Tumor registry (Du et ah, 2003). This 
study provides substantial evidence that there is a need for better selection of patients for 
chemotherapy or other, more aggressive forms of breast cancer therapy. 

The levels of observation that have been studied by the methodological developments of 
recent years in molecular biology, are the genes themselves, the translation of these genes into 
RNA, and the resulting proteins. The question of which gene is switched on at which point in 
the course of the development of an individual, and how the activation and inhibition of 
specific genes in specific cells and tissues are controlled is correlatable to the degree and 
character of the methylation of the genes or of the genome. In this respect, pathogenic 
conditions may manifest themselves in a changed methylation pattern of individual genes or 
of the genome. 

DNA methylation plays a role, for example, in the regulation of the transcription, in genetic 
imprinting, and in tumorigenesis. Therefore, the identification of 5-methylcytosine as a 
component of genetic information is of considerable interest. However, 5-methylcytosine 
positions cannot be identified by sequencing since 5-methylcytosine has the same base pairing 
behaviour as cytosine. Moreover, the epigenetic information carried by 5-methylcytosine is 
completely lost during PGR amplification. 

■ 

The currently most frequently used method for analysing DNA for 5-methylcytosine is based 
upon the specific reaction of bisulfite with cytosine which, upon subsequent alkaline 
hydrolysis, is converted to uracil which corresponds to thymine in its base pairing behaviour. 



3 



-4- 



However, 5-methylcytosine remains unmodified under these conditions. Consequently, the 
original DNA is converted in such a manner that methylcytosine, which originally could not 
be distinguished from cytosine by its hybridisation behaviour, can now be detected as the only 
remaining cytosine using "normal" molecular biological techniques, for example, by 
amplification and hybridisation or sequencing. All of these techniques are based on base 
pairing which can now be fully exploited. In terms of sensitivity, the prior art is defined by a 
method which encloses the DNA to be analysed in an agarose matrix, thus preventing the 
diffusion and renaturation of the DNA (bisulfite only reacts with single-stranded DNA), and 
which replaces all precipitation and purification steps with fast dialysis (Olek A, Oswald J, 
Walter J. A modified and improved method for bisulphite based cytosine methylation 
analysis. Nucleic Acids Res. 1996 Dec 15;24(24):5064-6). Using this method, it is possible to 
analyse individual cells, which illustrates the potential of the method. However, currently 
only individual regions of a length of up to approximately 3000 base pairs are analysed, a 
global analysis of cells for thousands of possible methylation events is not possible. However, 
this method cannot reliably analyse very small fragments from small sample quantities either. 
These are lost through the matrix in spite of the diffusion protection. 

An overview of the further known methods of detecting 5-methylcytosine may be gathered 
from the following review article: Rein, T., DePamphilis, M. L., Zorbas, H., Nucleic Acids 
Res. 1998, 26, 2255. 

To date, barring few exceptions (e.g., Zeschnigk M, Lich C, Buiting K, Doerfler W, 
Horsthemke B. A single-tube PCR test for the diagnosis of Angelman and Prader-Willi 
syndrome based on allelic methylation differences at the SNRPN locus. Eur J Hum Genet. 
1997 Mar- Apr ; 5 (2) : 94- 8) the bisulfite technique is only used in research. Always, however, 
short, specific fragments of a known gene are amplified subsequent to a bisulfite treatment 
and either completely sequenced (Olek A, Walter J. The pre- implantation ontogeny of the 
H19 methylation imprint. Nat Genet. 1997 Nov;17(3):275-6) or individual cytosine positions 
are detected by a primer extension reaction (Gonzalgo ML, Jones PA. Rapid quantitation of 
methylation differences at specific sites using methylation-sensitive single nucleotide primer 
extension (Ms-SNuPE). Nucleic Acids Res. 1997 Jun 15;25(12):2529-31, WO 95/00669) or 
by enzymatic digestion (Xiong Z, Laird PW. COBRA: a sensitive and quantitative DNA 
methylation assay. Nucleic Acids Res. 1997 Jun 15;25(12):2532-4). In addition, detection by 
- hybridisation has also been described (Olek et al:, WO 99/28498); 
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Further publications dealing with the use of the bisulfite technique for methylation detection 
in individual genes are: Grigg G, Clark S. Sequencing 5-methylcytosine residues in genomic 
DNA. Bioessays. 1994 Jun;16(6):431-6, 431; Zeschnigk M, Schmitz B, Dittrich B, Buiting K, 
Horsthemke B, Doerfler W. Imprinted segments in the human genome: different DNA 
methylation patterns in the Prader-Willi/Angelman syndrome region as determined by the 
genomic sequencing method. Hum Mol Genet. 1997 Mar;6(3):3 87-95; Feil R, Charlton J, 
Bird AP, Walter J, Reik W. Methylation analysis on individual chromosomes: improved 
protocol for bisulphite genomic sequencing. Nucleic Acids Res. 1994 Feb 25 ;22(4): 695-6; 
Martin V, Ribieras S, Song- Wang X, Rio MC, Dante R. Genomic sequencing indicates a 
correlation between DNA hypomethylation in the 5' region of the pS2 gene and its expression 
in human breast cancer cell lines. Gene. 1995 May 19;157(l-2):261-4; WO 97/46705, WO 
95/15373, and WO 97/45560. 

An overview of the Prior Art in oligomer array manufacturing can be gathered from a special 
edition of Nature Genetics (Nature Genetics Supplement, Volume 21, January 1999), 
published in January 1999, and from the literature cited therein. 

Fluorescently labelled probes are often used for the scanning of immobilised DNA arrays. 
The simple attachment of Cy3 and Cy5 dyes to the 5-OH of the specific probe are particularly 
suitable for fluorescence labels. The detection of the fluorescence of the hybridised probes 
may be carried out, for example via a confocal microscope. Cy3 and Cy5 dyes, besides many 
others, are commercially available. 

Matrix Assisted Laser Desorption Ionisation Mass Spectrometry (MALDI-TOF) is a very 
efficient development for the analysis of biomolecuies (Karas M, Hillenkamp F. Laser 
desorption ionisation of proteins with molecular masses exceeding 10,000 daltons. Anal 
Chem. 1988 Oct 15;60(20):2299-301). An analyte is embedded in a light-absorbing matrix. 
The matrix is evaporated by a short laser pulse thus transporting the analyte molecule into the 
vapour phase in an unfragmented manner. The analyte is ionised by collisions with matrix 
molecules. An applied voltage accelerates the ions into a field-free flight tube. Due to their 
different masses; the ions are accelerated at different rates. Smaller ions feach the detector 
sooner than bigger ones. • ■ • • f ■ ■ - v 
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MALDI-TOF spectrometry is excellently suited to the analysis of peptides and proteins. The 
analysis of nucleic acids is somewhat more difficult (Gut I G, Beck S. DNA and Matrix 
Assisted Laser Desorption Ionization Mass Spectrometry, Current Innovations and Future 
Trends. 1995, 1; 147-57). The sensitivity to nucleic acids is approximately 100 times worse 
than to peptides and decreases disproportionally with increasing fragment size. For nucleic 
acids having a multiply negatively charged backbone, the ionisation process via the matrix is 
considerably less efficient. In MALDI-TOF spectrometry, the selection of the matrix plays an 
eminently important role. For the desorption of peptides, several very efficient matrixes have 
been found which produce a very fine crystallisation. There are now several responsive 
matrixes for DNA, however, the difference in sensitivity has not been reduced. The difference 
in sensitivity can be reduced by chemically modifying the DNA in such a manner that it 
becomes more similar to a peptide. Phosphorothioate nucleic acids in which the usual 
phosphates of the backbone are substituted with thiophosphates can be converted into a 
charge-neutral DNA using simple alkylation chemistry (Gut IG, Beck S. A procedure for 
selective DNA alkylation and detection by mass spectrometry. Nucleic Acids Res. 1995 Apr 
25; 23(8): 1367-73). The coupling of a charge tag to this modified DNA results in an increase 
in sensitivity to the same level as that found for peptides. A further advantage of charge 
tagging is the increased stability of the analysis against impurities which make the detection 
of unmodified substrates considerably more difficult. 

Genomic DNA is obtained from DNA of cell, tissue or other test samples using standard 
methods. This standard methodology is found in references such as Sambrook, Fritsch and 
Maniatis, Molecular Cloning: A Laboratory Manual, CSH Press, 2nd edition, 1989: Isolation 
of genomic DNA from mammalian cells, Protocol I, p. 9.16 - 9.19. Also the manuals of 
several DNA extraction kits such as the QIAamp DNA mini kit give a good guidance on how 
to isolate genomic DNA. 
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Currently several predictive markers are under evaluation. As up to now most patients have 
received Tamoxifen as endocrine treatment most of the markers have been shown to be 
associated with response or resistance to tamoxifen. However, it is generally assumed that 
there is a large overlap between responders to one or the other endocrine treatment. In fact, 
ER and PR expression are used to select patients for any endocrine treatment. Among the 
markers which have been associated with TAM response is bcl-2. High bcl-2 levels showed 
promising correlation to TAM therapy response in patients with metastatic disease and 
prolonged survival and added valuable information to an ER negative patient subgroup (J Clin 
Oncology, 1997, 15 5: 1916-1922; Endocrine, 2000, 13(1):1-10). There is conflicting 
evidence regarding the independent predictive value of c-erbB2 (Her2/neu) overexpression in 
patients with advanced breast cancer that require further evaluation and verification (British J 
of Cancer, 1999, 79 (7/8): 1220- 1226; J Natl Cancer Inst, 1998, 90 (21): 1601-1608). 

Other predictive markers include SRC-1 ( steroid receptor coactivator-1), CGA gene over 
expression, cell kinetics and S phase fraction assays ( Breast Cancer Res and Treat, 1998, 
48:87-92; Oncogene, 2001, 20:6955-6959). Recently, uPA ( Urokinase-type plasminogen 
activator) and PAI-1 (Plasminogen activator inhibitor type 1) together showed to be useful to 
define a subgroup of patients who have worse prognosis and who would benefit from 
adjuvant systemic therapy (J Clinical Oncology, 2002, 20 n° 4). However, all of these markers 
need further evaluations in prospective trials as none of them is yet a validated marker of 
response. 

A number of cancer-associated genes have been shown to be inactivated by hypermethylation 
of CpG islands during breast tumorigenesis. Decreased expression of the calcium binding 
protein S100A2 (Accession number NM_005978) has been associated with the development 
of breast cancers. Hypermethylation of the promoter region of this gene has been observed in 
neoplastic cells thus providing evidence that S100A2 repression in tumour cells is mediated 
by site-specific methylation. 

The SYK gene (Accession number NM_003177) encodes a protein tyrosine kinase, Syk 
(spleen tyrosine kinase), that is highly expressed in hematopoietic cells. Syk is expressed in 
normal breast ductal epithelial cells but not in a subset of invasive breast carcinoma. Also, the 
loss of Syk expression seems to be associated with malignant phenotypes such as increased 
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motility and invasion. The loss of expression occurs at the transcriptional level, and, as 
indicated by Yuan Y, Mendez R, Sahin A and Dai JL (Hypermethylation leads to silencing of 
the SYK gene in human breast cancer. Cancer Res. 2001 Jul 15;61(14):5558-61.)» as a result 
of DNA hypermethylation. 

The TGF-6 type 2 receptor (encoded by the TGFBR2 gene, NMJD03242) plays a role in 
trans-membrane signalling pathways via a complex of serine/threonine kinases. Mutations in 
the gene have been detected in some primary tumours and in several types of tumour-derived 
cell lines, including breast (Lucke CD, Philpott A, Metcalfe JC, Thompson AM, Hughes- 
Davies L, Kemp PR, Hesketh R. 'Inhibiting mutations in the transforming growth factor beta 
type 2 receptor in recurrent human breast cancer.' Cancer Res. 2001 Jan 15;61(2):482-5.). 

The genes COX7A2L and GRIN2D were both identified as novel estrogen responsive 
elements by Watanabe et. al. (Isolation of estrogen-responsive genes with a CpG island 
library. Molec. Cell. Biol. 18: 442-449, 1998.) using the CpG-GBS (genomic binding site) 
method. The gene COX7A2L (Accession number NM_004718) encodes a polypeptide 2-like 
cytochrome C oxidase subunit VIIA. Northern blot analysis detected an upregulation of 
COX7A2L after estrogen treatment of a breast cancer cell line. The gene GRDM2D (Accession 
number NM__000836) encodes the N-methyl-D-aspartate, ionotropic, subunit 2D glutamate 
receptor, a subunit of the NMDA receptor channels associated with neuronal signalling. 
Furthermore expression of the cDNA has been observed in an osteosarcoma cell line* The 
gene VTN (also known as Vitronectin Accession number NM_000638) encodes a 75-kD 
glycoprotein (also called serum spreading factor or complement S -protein) that promotes 
attachment and spreading of animal cells in vitro, inhibits cytolysis by the complement C5b-9 
complex, and modulates antithrombin III- thrombin action in blood coagulation. Furthermore 
expression of this gene has been linked to progression and invasiveness of cancer cells. 

The gene SFN (also known as Stratifin) encodes a polypeptide of the 14-3-3 family, 14-3-3 
sigma. The 14-3-3 family of proteins mediates signal transduction by binding to 
phosphoserine-containing proteins. Expression of the SFN gene is lost in breast carcinomas, 
this is likely due to hypermethylation during the early stages of neoplastic transformation (see 
JJmbricht CB, Evron E, Gabrielson E, Ferguson A, Marks J, Sukumar S. Hypermethylation of 
14-3-3 sigma (stratifin) is an early event in breast cancer. Oncogene. 2001 Jun 7; 
-20(26)3348,53). - - — „ . _ . . 
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The gene PS ATI (Accession number NM_021154) is not to be confused with the gene 
popularly referred to as PxySA (Accession number NM_001648) which encodes prostate 
specific antigen and whose technically correct name is kallikreixi 3 . The gene PSAT1 
encodes the protein phosphoserine aminotransferase which is the second step-catalysing 
enzyme in the serine biosynthesis pathway. Changes in gene expression levels have been 
monitored by mRNA expression analysis and upregulation of the gene has been identified in 
colonic carcinoma in a study of 6 samples (Electrophoresis 2002 Jun;23(ll): 1667-76 mRNA 
differential display of gene expression in colonic carcinoma.Ojala P, Sundstrom J, Gronroos 
JM, Virtanen E, Talvinen K, Nevalainen TJ). 

The gene stathmin (NM_005563) codes for an oncoprotein 18, also known as stathmin, a 
conserved cytosolic phosphoprotein that regulates microtubule dynamics. The protein is 
highly expressed in a variety of human malignancies. In human breast cancers the stathmin 
gene has shown to be up-regulated in a subset of the tumours. 

The gene PRKCD encodes a member of the family of protein kinase c enzymes, and is 
involved in B cell signaling and in the regulation of growth, apoptosis, and differentiation of a 
variety of cell types. 

Some of these molecules interact in a cascade-like manner. PRKCD activity that targets 
STMN1 is modulated by SFN binding and SYK phosphorylation. Together this influences 
tubulin polymerization that is required for cell division. 

The gene MSMB (Accession number NMJ302443 ) has been mapped to lOql 1.2. It encodes 
the beta-microseminoprotein (MSP) which is one of the major proteins secreted by the 
prostate. Furthermore, it may be useful as a diagnostic marker for prostate cancer. Using 
mRNA analysis low levels of beta-MSP mRNA expression and protein have been linked to 
progression under endocrine therapy and it has been postulated that it may be indicative of 
potentially aggressive prostate cancer (see Sakai H, Tsurusaki T, Kanda S, Koji T, Xuan JW, 
Saito Y Prognostic significance of beta-microseminoprotein mRNA expression in prostate 
cancer/ Prostate. 1999 Mar l;38(4):278-84.). 
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The gene TP53 (Accession number NM_000546) encodes the protein p53, one of the most 
well characterised tumour suppressor proteins. The p53 protein acts as a transcription factor 
and serves as a key regulator of the cell cycle. Inactivation of this gene through mutation 
disrupts the cell cycle, which, in turn, assists in tumour formation. Methylation changes 
associated with this gene have been reported to be significant in breast cancer. Saraswati eL 
al (Nature 405, 974 - 978 (22 Jun 2000) 'Compromised HOXA5 function can limit p53 
expression in human breast tumours' reported that low levels of p53 mRNA in breast tumours 
was correlated to methylation of the HOXA5 gene. The product of the HOX5A gene binds to 
the promoter region of the p53 and mediates expression of the gene. Methylation of the 
promoter region of the p53 gene itself has been reported (Kang JH, Kim S J, Noh D Y, Park 
IA, Choe KJ, Yoo OJ, Kang HS. 'Methylation in the p53 promoter is a supplementary route to 
breast carcinogenesis: correlation between CpG methylation in the p53 promoter and the 
mutation of the p53 gene in the progression from ductal carcinoma in situ to invasive ductal 
carcinoma.' Lab Invest. 2001 Apr;81(4):573-9.). It was therein demonstrated that CpG 
methylation in the p53 promoter region is found in breast cancer and it was hypothesised that 
methylation in the p53 promoter region could be an alternative pathway to neoplastic 
progression in breast tumours. It has been observed that treatment with Tamoxifen decreases 
the level of expression of the p53 gene (Farczadi E, Kaszas I, Baki M, Szende B . 'Changes in 
apoptosis, mitosis, Her-2, p53 and Bcl2 expression in breast carcinomas after short-term 
tamoxifen treatment.' Neoplasma. 2002;49(2): 101-3.) 

The gene CYP2D6 (Accession number: NM_000106) is a member of the human cytochrome 
P450 (CYP) superfamily. Many members of this family are involved in drug metabolism (see 
for example Curr Drug Metab. 2002 Jun;3(3):289-309. Rodrigues AD, Rushmore TH.), of 
these Cytochrome P450 CYP2D6 is one of the most extensively characterised. It is highly 
polymorphic (more than 70 variations of the gene have been described), and allelic variation 
can result in both increased and decreased enzymatic activity. The CYP2D6 enzyme catalyses 
the metabolism of a large number of clinically important drugs including antidepressants, 
neuroleptics, some antiarrhythmics (Nature 1990 Oct 25;347(6295):773-6 Identification of the 
primary gene defect at the cytochrome P450 CYP2D locus.Gough AC, Miles JS, Spurr NK, 
Moss JE, Gaedigk A, Eichelbaum M, Wolf CR.). 

The gene PTGS2 (Accession number NM_000963) encodes an inducible isozyme of 
prostaglandin-endoperoxide synthase (prostaglandm-endoperoxi 
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known as COX2 (cyclooxygenase 2). Aberrant methylation of this gene has been identified in 
lung carcinomas (Cancer Epidemiol Biomarkers Prev 2002 Mar;l l(3):291-7 Hierarchical 
clustering of lung cancer cell lines using DNA methylation markers . V irmani AK, Tsou J A, 
Siegmund KD, Shen LY, Long TI, Laird PW, Gazdar AF, Laird-Offringa IA.). 

The gene CGA (Accession number NM_000735) encodes the alpha polypetptide of 
glycoprotein hormones. Further, it has been identified as an estrogen receptor alpha (ER 
alpha)-responsive gene and overexpression of the gene has been linked to ER positivity in 
breast tumours. Bieche et al examined mRNA levels of said gene in 125 ER alpha-positive 
postmenopausal breast cancer patients treated with primary surgery followed by adjuvant 
tamoxifen therapy. Initial results indicated significant links between CGA gene 
overexpression and Scarff-Bloom-Richardson histopathological grade I+II and progesterone 
and estrogen receptor positivity, which suggested that CGA is a marker of low tumour 
aggressiveness (Identification of CGA as a Novel Estrogen Receptor-responsive Gene in 
Breast Cancer: An Outstanding Candidate Marker to predict the Response to Endocrine 
TherapyCancer Research* 61, 1652-1658, February 15, 2001, Ivan Bieche, Beatrice Parfait, 
Vivianne Le Doussal, Martine Olivi, Marie-Christine Rio, Rosette Lidereau and Michel 
Vidaud). Further mRNA expression analysis linked CGA expression levels to Tamoxifen 
response, it was postulated that when combined with analysis of the marker ERBB2 (a marker 
of poor response) the gene may be useful as a predictive marker of tamoxifen responsiveness 
in breast cancer (Oncogene 2001 Oct 18;20(47):6955-9 The CGA gene as new predictor of 
the response to endocrine therapy in ER alpha-positive postmenopausal breast cancer patients. 
Bieche I, Parfait B, Nogues C, Andrieu C, Vidaud D, Spyratos F, Lidereau R, Vidaud M.). 
The authors provided significant data associating the expression of the gene CGA with 
Tamoxifen treatment response. However, said analyses have all focused upon the analysis of 
relative levels of mRNA expression. This is not a methodology that is suitable for a medium 
or high throughput, nor is it a suitable basis for the development of a clinical assay. 

The gene PITX2 (NM_000325) encodes the paired-like homeodomain transcription factor 2 
which is known to be expressed during development of anterior structures such as the eye, 
teeth, and anterior pituitary. Although the expression of this gene is associated with cell 
differentiation and proliferation it has no heretofore recognised role in carcinogenesis or 
responsiveness to endocrine treatment. Toyota et ah, (2001. Blood. 97:2823-9.) found 
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hypermethylation of the PITX2 gene in a large proportion of acute myeloid leukemias. 
Furthermore, this hypermethylation is positively correlated to methylation of the ER gene. 

RASSF1A (Ras association domain family 1A ) gene is a candidate tumour suppressor gene at 
3p21.3. The Ras GTPases are a superfamily of molecular switches that regulate cellular 
proliferation and apoptosis in response to extra-cellular signals. It is purported that RASSF1A 
is a tumour suppressor gene, and epigenetic alterations of this gene have been observed in a 
variety of cancers. Methylation of RASSF1A has been associated with poor prognosis in 
primary non-small cell lung cancer (Kim DH, Kim JS, Ji YI, Shim YM, Kim H, Han J, Park 
J., hypermethylation of RASSF1A promoter is associated with the age at starting smoking 
and a poor prognosis in primary non-small cell lung cancer. 5 Cancer Res. 2003 Jul 
l;63(13):3743-6.). It has also been assocaited with the development of pancreatic cancer 
(Kuzmin I, Liu L, Dammann R, Geil L, Stanbridge EJ, Wilczynski SP, Lerman MI, Pfeifer 
GP. 'Inactivation of RAS association domain family 1 A gene in cervical carcinomas and the 
role of human papillomavirus infection.' Cancer Res. 2003 Apr 1 5 ;63(8): 1888-93.), as well as 
testicular tumours and prostate carcinoma amongst others. The application of the methylation 
of this gene as a cancer diagnostic marker has been described in U.S. patent 6,596,488, it does 
not however describe its application in the selection of appropriate treatments regimens for 
patients. 

Also located within 3p21 is the Dystroglycan precursor gene (Dystrophin-associated 
glycoprotein 1) (NM_ 004393). Dystroglycan (DG, also known as DAG1) is an adhesion 
molecule comprising two subunits namely alpha-DG and beta-DG. The molecule is 
responsible for crucial interactions between extracellular matrix and cytoplasmatic 
compartment and it has been hypothesised that as such it may contribute to progression to 
metastatic disease. Decreased expression of this gene has been associated with correlated with 
higher tumour grade and stage in colon, prostate and breast tumours. 

The onecut-2 transcription factor gene (NMJ304852) is located at 18q21.31 is a homeo- 
domain transcription factor regulator of liver gene expression in adults and during 
development. 

The trefoil factor 1 (TFF1) gene (NM_ 003225) encodes a member of the trefoil family of 
-proteins. Thcgene is-also known as pS2 i They-are-normally expressed atfiighest levels in the - 
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mucosa of the gastrointestinal tract, however they are often expressed ectopically in primary 
tumours of other tissues, including breast. The expression of TFF1 is regulated by estrogen in 
estrogen-responsive breast cancer cells in culture, its expression is associated with that of the 
estrogen receptor and TFF1 is a marker of hormone responsiveness in tumours (Schwartz et 
al. 9 1991. pS2 expression and response to hormonal therapy in patients with advanced breast 
cancer. Cancer Res. 51:624-8). TFF1 promoter methylation has been observed in 
nonexpressing gastric carcinoma-derived cell lines and tissues. 

TMEFF2 (NM_016192) encodes a transmembrane protein containing an epidermal growth 
factor (EGF)-like motif and two follistatin domains. It has been shown to be overexpressed in 
prostate and brain tissues and it has been suggested that this is an androgen-regulated gene 
exhibiting antiproliferative effects in prostate cancer cells. 

Methylation of the gene ESR1 (NM_000125), encoding the estrogen receptor has been linked 
to several cancer types including lung, oesophageal, brain and colorectal. The estrogen 
receptor (ESR) is a ligand-activated transcription factor composed of several domains 
important for hormone binding, DNA binding, and activation of transcription. Furthermore, it 
is the direct target of the anti-estrogenic compound Tamoxifen. Only tumours expressing 
estrogen receptor (ER+) can respond on Tamoxifen treatment. 

The PCAF (NM_003884) gene encodes the p300/CBP- Associated Factor (PCAF). CBP and 
p300 are large nuclear proteins that bind to many sequence-specific factors involved in cell 
growth and/or differentiation. The p300/CBP associated factor displays in vivo binding 
activity with CBP and p300. The protein has histone acetyl transferase activity with core 
histones and nucleosome core particles, indicating that it plays a direct role in transcriptional 
regulation. p300/CBP associated factor also associates with NF-kappa-B p65.This protein has 
been shown to regulate expression of the gene p53 by acetylation of Lys320 in the C-terminal 
portion of p53 . 

The WBP1 1 (NM__016312) gene encodes a nuclear protein, which co-localises with mRNA 
splicing factors and intermediate filament-containing perinuclear networks. It contains two 
proline-rich regions that bind to the WW domain of Npw38, a nuclear protein, and thus this 
protein is also called Npw38-binding protein NpwBP. 
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The TBC1 domain family, member 3 gene (TBC1D3, NM_032258) was discovered originally 
as an oncogene, also known as PRC17. The gene product contains a GTPase-activating 
protein (GAP) catalytic core motif and interacts directly with Rab5, stimulating its GTP 
hydrolysis. TBC1D3 is amplified in 15% of prostate cancers and highly overexpressed in 
approximately one-half of metastatic prostate tumors (Pei et aL 9 2002; Cancer Res. 62:5420- 
4). 

The CDK6 gene encodes a cyclin-dependent protein kinase regulating major cell cycle 
transitions in eukaryotic cells. The cdk6 kinase is associated with cyclins Dl, D2, and D3 and 
can phosphorylate pRB, the product of the retinoblastoma tumor suppressor gene. The 
activation of cdk6 kinase occurs during mid-Gl (Meyerson and Harlow, 1994; Mol Cell Biol. 
14:2077-86). 

Description 

In the following certain genetic regions are described for whom no genetic nomenclature is 
presently available. In each case the chromosomal location of the genetic sequence is denoted 
within parentetheses ( ) and the genetic sequence is further described by its sequence 
according to Table 1 . 

The present invention provides methods and nucleic acids for the improved treatment 
planning of patients with cell proliferative disorders of the breast tissues. The aim of the 
invention is achieved by assessment of one or both of two factors of particular relevance to 
patient treatment planning. The first factor is the characterisation of the cell proliferative 
disorder of the breast tissues and/or a metastases thereof in terms of aggresivity, the second 
factor being the prediction of disease free survival and/or response of a subject with said 
disorder to a therapy comprising one or more treatments which target the estrogen receptor 
pathway or are involved in estrogen metabolism, production or secretion. Said treatments 
include, but are not limited to estrogen receptor modulators, estrogen receptor down- 
regulators, aromatase inhibitors, ovarian ablation, LHRH analogues and other centrally acting 
drugs influencing estrogen production. 

The prediction of response to a therapeutic regimen comprising one or more treatments which 
target the estrogen receptor pathway or are involved in estrogen metabolism, production or 
secretion (a current treatment of choice as side effects are limited) further enables the 
physician to determine if additional treatments will be requked m 
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treatment. Treatments which may be used in addition to or instead of said treatment include, 
but are not limited to chemotherapy, radiotherapy, surgery, biological therapy, 
immunotherapy, antibodies and molecularly targeted drugs. 

Characterisation of a breast cancer in terms of its predicted aggressiveness enables the 
physician to make an informed decision as to a therapeutic regimen with appropriate risk and 
benefit trade offs to the patient. Aggressiveness is taken to mean one or more of decreased 
patient survival or disease- or relapse-free survival, increased tumor-related complications and 
faster progression of tumor or metastases. According to the aggressiveness of the disease an 
appropriate treatment or treatments may be selected from the group consisting of 
chemotherapy, radiotherapy, surgery, biological therapy, immunotherapy, antibody 
treatments, treatments involving molecularly targeted drugs, estrogen receptor modulator 
treatments, estrogen receptor down-regulator treatments, aromatase inhibitors treatments, 
ovarian ablation, treatments providing LHRH analogues or other centrally acting drugs 
influencing estrogen production. Wherein a cancer is characterised as 'aggressive' it is 
particularly preferred that a treatment such as, but not limited to, chemotherapy is provided in 
addition to or instead of an endocrine targeting therapy. 

Using the methods and nucleic acids described herein, statistically significant models of 
patient disease free survival and/or responsiveness to treatment and/or disease progression can 
be developed and utilised to assist patients and clinicians in determining suitable treatment 
options to be included in the therapeutic regimen. In one aspect the described method is to be 
used to assess the utility of therapeutic regimens comprising one or more treatments which 
target the estrogen receptor pathway or are involved in estrogen metabolism, production or 
secretion as a therapy for patients suffering from a cell proliferative disorder of the breast 
tissues. In particular this aspect of the method enables the physician to determine which 
treatments may be used in addition to or instead of said treatment. In a further aspect the 
described method enables the characterisation of the cell proliferative disorder in terms of 
agressiveness, thereby enabling the physician to recommend suitable treatments. Thus, the 
present invention will be seen to reduce the problems associated with present breast cell 
proliferative disorder treatment response prediction methods. 

Using the methods and nucleic acids as described herein, patient responsiveness can be 

evaluated before or during treatment for a cell proliferative disorder of the breast tissues, in 

- 1 .■»**■■ \ 1 ' " 

order to provide critical information to the patient and clinician as to the likely progression of 
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the disease. It will be appreciated, therefore, that the methods and nucleic acids exemplified 
herein can serve to improve a patient's quality of life and odds of treatment success by 
allowing both patient and clinician a more accurate assessment of the patient's treatment 
options. 

The method according to the definition may be used for the improved treatment of all breast 
cell proliferative disorder patients, both pre and post menopausal and independant of their 
node or estrogen receptor status. However, it is particularly preferred that said patients are 
node-negative and estrogen receptor positive. 

The aim of the invention is most preferably achieved by means of the analysis of the 
methylation patterns of one or a combination of genes taken from the group taken from the 
group EGR4, APC, CDKN2A, CSPG2, ERBB2, STMN1, STK11, CA9, PAX6, SFN, 
S100A2, TFF1, TGFBR2, TP53, TP73, PLAU, TMEFF2, ESR1, SYK, HSPB1, RASSF1, 
TES, PITX2, GRIN2D, PSAT1, CGA, CYP2D6, COX7A2L, ESR2, PLAU, VTN, 
SULT1A1, PCAF, PRKCD, ONECUT2, BCL6, WBP11, (MX1), N.N., APP, ORC4L, 
NETOl, TBC1D3, GRB7, CYP2D6, CDK6, (Chr. Ipl3.2), (Chr. 17q25.1), ABCA8, (Chr. 
12ql4.3), (Chr. 8ql2.1), MARK2, ELK1, Q8WUT3, CGB, BSG, BCKDK, SOX8, DAG1, 
SEMA4B and ESR1 (exon8) (see Table 1) and/or their regulatory regions. 

The invention is characterised in that the nucleic acid of one or a combination of genes taken 
from the group EGR4, APC, CDKN2A, CSPG2, ERBB2, STMN1, STK11, CA9, PAX6, 
SFN, S100A2, TFF1, TGFBR2, TP53, TP73, PLAU, TMEFF2, ESR1, SYK, HSPB1, 
RASSF1, TES, PITX2, GRIN2D, PSAT1, CGA, CYP2D6, COX7A2L, ESR2, PLAU, VTN, 
SULT1A1, PCAF, PRKCD, ONECUT2, BCL6, WBP11, (MX1), N.N., APP, ORC4L, 
NETOl, TBC1D3, GRB7, CYP2D6, CDK6, (Chr. Ipl3.2), (Chr. 17q25.1), ABCA8, (Chr. 
12ql4.3), (Chr. 8ql2.1), MARK2, ELK1, Q8WUT3, CGB, BSG, BCKDK, SOX8, DAG1, 
SEMA4B and ESR1 (exon8) are contacted with a reagent or series of reagents capable of 
distinguishing between methylated and non methylated CpG dinucleotides within the genomic 
sequence of interest. 

The present invention makes available a method for the improved treatment and monitoring 
of breast cell proliferative disorders, by enabling the accurate prediction of a patient's disease 
free survival and/or response to treatment with a therapy comprising one or more treatments 
which target the estrogen receptor pathway or are involved in estrogen metabolism, 
-production, or- secretion^ - 
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Iji a particularly preferred embodiment, the method according to the invention enables the 
differentiation between patients who have a high probability of response to said therapy and 
those who have a low probability of response to said therapy or a methylation characterisitc 
predicted disease free survival time, in addition to the characterisation of tumors in terms of 
aggresiveness. 

The method according to the invention may be used for the analysis of a wide variety of cell 
proliferative disorders of the breast tissues including, but not limited to, ductal carcinoma in 
situ., invasive ductal carcinoma, invasive lobular carcinoma, lobular carcinoma in situ, 
comedocarcinoma, inflammatory carcinoma, mucinous carcinoma, scirrhous carcinoma, 
colloid carcinoma, tubular carcinoma, medullary carcinoma, metaplastic carcinoma, and 
papillary carcinoma and papillary carcinoma in situ, undifferentiated or anaplastic carcinoma 
and Paget' s disease of the breast. 

The method according to the invention is particularly suited to the prediction of response to 
the aforementioned therapy in two treatment settings. In one embodiment, the method is 
applied to patients who receive endocrine pathway targeting treatment as secondary treatment 
to an initial non chemotherapeutical therapy, e.g. surgery (hereinafter referred to as the 
adjuvant setting) as illustrated in Figure 1. Such a treatment is often prescribed to patients 
suffering from Stage 1 to 3 breast carcinomas. In this embodiment patients disease free 
survival times are predicted according to their by detecting patients with worse disease free 
survival times the physician may choose to recommend the patient for further treatment, 
instead of or in addition to the endocrine targetting therapy(s), in particular but not limited to, 
chemotherapy. 

In a further preferred embodiment said method is applied to patients suffering from a relapse 
of breast cancer following treatment by a primary means (preferably surgery) followed by a 
disease free period, and wherein the endocrine pathway targeting treatment has been 
prescribed in response to a detection of a relapse of the carcinoma. Such a treatment is often 
prescribed to patients suffering from later stage carcinomas, particularly wherein metastasis 
has occurred. Therefore this clinical setting shall also hereinafter be referred to as the 
'metastatic setting'. In this embodiment responders are those who enter partial or complete 
remission i.e. subjects whose cancer recedes to undetectable levels as opposed to those whose 
diseases further metastasise or remain above detectable levels. By detecting patients whose 
cancers are likely to metastasis the physician may choose to recommend the patient for further 
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treatment, instead of or in addition to the endocrine targetting therapy(s), in particular but not 
limited to, chemotherapy. 

This methodology presents further improvements over the state of the art in that the method 
may be applied to any subject, independent of the estrogen and/or progesterone receptor 
status. Therefore in a preferred embodiment, the subject is not required to have been tested for 
estrogen or progesterone receptor status. 

The object of the invention is achieved by means of the analysis of the methylation patterns of 
one or more of the genes EGR4, APC, CDKN2A, CSPG2, ERBB2, STMN1, STK1 1, CA9, 
PAX6, SFN, S100A2, TFF1, TGFBR2, TP53, TP73, PLAU, TMEFF2, ESR1, SYK, HSPB1, 
RASSF1, TES, PITX2, GRIN2D, PSAT1, CGA, CYP2D6, COX7A2L, ESR2, PLAU, VTN, 
SULT1A1, PCAF, PRKCD, ONECUT2, BCL6, WBP11, (MX1), N.N., APP, ORC4L, 
NETOl, TBC1D3, GRB7, CYP2D6, CDK6, (Chr. Ipl3.2), (Chr. 17q25.1), ABCA8, (Chr. 
12ql43), (Chr. 8ql2.1), MARK2, ELK1, Q8WUT3, CGB, BSG, BCKDK, SOX8, DAG1, 
SEMA4B, ESR1 (exon8) and/or their regulatory regions. In a particularly preferred 
embodiment the sequences of said genes comprise SEQ ID NOs: 1-61 and sequences 
complementary thereto. 

The object of the invention may also be achieved by analysing the methylation patterns of one 
or more genes taken from the following subsets of said aforementioned group of genes. In one 
embodiment the object of the invention is the prediction of disease free survival and/or 
probability of response to a treatment which targets the estrogen receptor pathway or are 
involved in estrogen metabolism, production or secretion. This is achieved by analysis of the 
methylation patterns of one or more genes taken from the group consisting ERBB2, STMN1, 
TFF1, TMEFF2, ESR1, HSPB1, PITX2, COX7A2L, PLAU, VTN, PCAF, ONECUT2, 
BCL6, WBP11, TBC1D3, GRB7, CDK6, (Chr. Ipl3.2), ABCA8 and (Chr. 8ql2.1) and 
wherein it is further preferred that the sequence of said genes comprise SEQ ID NOs: 5, 6, 12, 
17, 18, 20, 23, 28, 16, 31, 33, 35, 36, 37, 43, 44, 46, 47, 49 and 51 

respectively according to Table 1. In a further embodiment the object of the invention is the 
characterisation of the tumor in terms of aggresiveness. This is achieved by analysis of the 
methylation patterns of one or more genes taken from the group consisting APC, CSPG2, 
ERBB2, STK11, S100A2, TFF1, TGFBR2, TP53, TMEFF2, SYK, HSPB1, RASSF1, 
PSATl, CGA, ESR2, ONECUT2, WBP11, CYP2D6, CDK6, ELK1, CGB and DAG1 , and 
-wherein it is further preferred that the sequence- of said-genes comprise SEQ ED NOs:-2,-4, 5^— 
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7, 11, 12, 13, 14, 17, 19, 20, 21, 25, 26, 29, 35, 37, 45, 46, 53, 55 and 59 respectively 
according to Table 1. 

In a preferred embodiment said method is achieved by contacting said nucleic acid sequences 
in a biological sample obtained from a subject with at least one reagent or a series of reagents, 
wherein said reagent or series of reagents, distinguishes between methylated and non 
methylated CpG dinucleotides within the target nucleic acid. 

In a preferred embodiment, the method comprises the following steps: 

Preferably, said method comprises the following steps: In the first step, a sample of the tissue 
to be analysed is obtained. The source may be any suitable source, such as cell lines, 
histological slides, biopsies, tissue embedded in paraffin, bodily fluids, ejaculate, urine, blood 
and all possible combinations thereof. In a particularly preferred embodiment of the method 
said source is bodily fluids including post prostatic massage urine, ejaculate, urine, or blood. 
The DNA is then isolated from the sample. Extraction may be by means that are standard to 
one skilled in the art, including the use of commercially available kits, detergent lysates, 
sonification and vortexing with glass beads. Briefly, wherein the DNA of interest is 
encapsulated by a cellular membrane the biological sample must be disrupted and lysed by 
enzymatic, chemical or mechanical means. The DNA solution may then be cleared of proteins 
and other contaminants e.g. by digestion with proteinase K. The genomic DNA is then 
recovered from the solution. This may be carried out by means of a variety of methods 
including salting out, organic extraction or binding of the DNA to a solid phase support. The 
choice of method will be affected by several factors including time, expense and required 
quantity of DNA. 

Once the nucleic acids have been extracted, the genomic double stranded DNA is used in the 
analysis. 

In the second step of the method, the genomic DNA sample is treated in such a manner that 
cytosine bases which are unmethylated at the 5 '-position are converted to uracil, thymine, or 
another base which is dissimilar to cytosine in terms of hybridization behavior. This will be 
understood as 'pretreatment* herein. 

The above-described treatment of genomic DNA is preferably carried out with bisulfite 
(hydrogen sulfite, disulfite) and subsequent alkaline hydrolysis that results in a conversion of 
non-methylated cytosine nucleobases to uracil or to another base that is dissimilar to cytosine 
in terms of base pairing behavior. 
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In the third step of the method, fragments of the pretreated DNA are amplified, using sets of 
primer oligonucleotides according to the present invention, and an amplification enzyme. The 
amplification of several DNA segments can be carried out simultaneously in one and the same 
reaction vessel. Typically, the amplification is carried out using a polymerase chain reaction 
(PGR). The set of primer oligonucleotides includes at least two oligonucleotides whose 
sequences are each reverse complementary, identical, or hybridize under stringent or highly 
stringent conditions to an at least 16-base-pair long segment of the base sequences of one or 
more of SEQ ID NO 206 to 449 and sequences complementary thereto. 
In an alternate embodiment of the method, the methylation status of preselected CpG 
positions within the nucleic acid sequences comprising one or more of SEQ ID NO 1 to 61 
may be detected by use of methylation-specific primer oligonucleotides. This technique 
(MSP) has been described in United States Patent No. 6,265,171 to Herman. The use of 
methylation status specific primers for the amplification of bisulfite treated DNA allows the 
differentiation between methylated and unmethylated nucleic acids. MSP primers pairs 
contain at least one primer that hybridizes to a bisulfite treated CpG dinucleotide. Therefore, 
the sequence of said primers comprises at least one CpG dinucleotide. MSP primers specific 
for non-methylated DNA contain a "T' at the 3' position of the C position in the CpG. 
Preferably, therefore, the base sequence of said primers is required to comprise a sequence 
having a length of at least 9 nucleotides which hybridizes to a pretreated nucleic acid 
sequence according to one of SEQ ID NO 206-449 and sequences complementary thereto, 
wherein the base sequence of said oligomers comprises at least one CpG dinucleotide. 
Wherein the method is for the prediction of probability of disease free survival and/or 
response to a treatment which targets the estrogen receptor pathway or are involved in 
estrogen metabolism, production or secretion it is particularly preferred that said nucleotide 
sequence(s) hybridizes to a pretreated nucleic acid sequence according to one of SEQ ID NO 
70, 71, 192, 193, 72, 73, 194, 195, 84, 85, 206, 207, 94, 95, 216, 217, 96, 97, 218, 219, 100, 
101, 222, 223, 106, 107, 228, 229, 116, 117, 238, 239, 92, 93, 214, 215, 122, 123, 244, 245, 
126, 127, 248, 249, 130, 131, 252, 253, 132, 133, 254, 255, 134, 135, 256, 257, 146, 147, 268, 
269, 148, 149, 270, 271, 152, 153, 274, 275, 154, 155, 276, 277, 158, 159, 280, 281, 162, 163, 

284 and 285 said contiguous nucleotides comprising at least one CpG, TpG or CpA 

« 

dinucleotide sequence. 

Wherein the method is for the the characterisation of the breast cell proliferative disorder in 
terms of aggresiveness it is particularly preferred that said nucleotide sequence(s) hybridizes 
to^-pretreated nucleic-acid sequence according to one of SEQ ID NO 64, 65; 1 86, 1 87, 68, 
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69, 190, 191, 70, 71, 192, 193, 74, 75, 196, 197, 82, 83, 204, 205, 84, 85, 206, 207, 86, 87, 
208, 209, 88, 89, 210, 211, 94, 95, 216, 217, 98, 99, 220, 221, 100, 101, 222, 223, 102, 103, 
224, 225, 110, 111, 232, 233, 112, 113, 234, 235, 118, 119, 240, 241, 130, 131, 252, 253, 134, 
135, 256, 257, 150, 151, 272, 273, 152, 153, 274, 275, 166, 167, 288, 289, 170, 171, 292, 293, 
178, 179, 300, 301, 148, 149, 270, 271, 150, 151, 272, 273, 152, 153, 274, 275, 154, 155, 276, 
277, 156, 157, 278, 279, 158, 159, 280, 281, 160, 161, 282, 283, 162, 163, 284, 285, 164, 165, 
286, 287, 166, 167, 288, 289, 168, 169, 290, 291, 170, 171, 292, 293, 172, 173, 294, 295, 174, 
175, 296, 297, 176, 177, 298, 299, 178, 179, 300, 301, 180, 181, 302, 303, 182, 183, 304 and 
305, said contiguous nucleotides comprising at least one CpG, TpG or CpA dinucleotide 
sequence. 

A further preferred embodiment of the method comprises the use of blocker oligonucleotides. 
The use of such blocker oligonucleotides has been described by Yu et al„ BioTechniques 
23:714-720, 1997. Blocking probe oligonucleotides are hybridized to the bisulfite treated 
nucleic acid concurrently with the PGR primers. PGR amplification of the nucleic acid is 
terminated at the 5 f position of the blocking probe, such that amplification of a nucleic acid is 
suppressed where the complementary sequence to the blocking probe is present. The probes 
may be designed to hybridize to the bisulfite treated nucleic acid in a methylation status 
specific manner. For example, for detection of methylated nucleic acids within a population 
of unmethylated nucleic acids, suppression of the amplification of nucleic acids which are 
unmethylated at the position in question would be carried out by the use of blocking probes 
comprising a 'CpA' or 'TpA' at the position in question, as opposed to a 'CpG' if the 
suppression of amplification of methylated nucleic acids is desired. 

For PGR methods using blocker oligonucleotides, efficient disruption of polymerase-mediated 
amplification requires that blocker oligonucleotides not be elongated by the polymerase. 
Preferably, this is achieved through the use of blockers that are 3'-deoxyoligonucleotides, or 
oligonucleotides derivitized at the 3' position with other than a "free" hydroxyl group. For 
example, 3'-0-acetyl oligonucleotides are representative of a preferred class of blocker 
molecule. 

Additionally, polymerase-mediated decomposition of the blocker oligonucleotides should be 
precluded. Preferably, such preclusion comprises either use of a polymerase lacking 5 '-3* 
exonuclease activity, or use of modified blocker oligonucleotides having, for example, thioate 
bridges at the 5Merminii thereof that render the blocker molecule nuclease-resistant. 
Particular applications may not require such 5* modifications of the blocker. For example, if 
the blocker- and primer-binding sites overlap, thereby precluding binding of the primer (e.g. 9 
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with excess blocker), degradation of the blocker oligonucleotide will be substantially 
precluded. This is because the polymerase will not extend the primer toward, and through (in 
the 5' -3' direction) the blocker — a process that normally results in degradation of the 
hybridized blocker oligonucleotide. 

A particularly preferred blocker/PCR embodiment, for purposes of the present invention and 
as implemented herein, comprises the use of peptide nucleic acid (PNA) oligomers as 
blocking oligonucleotides. Such PNA blocker oligomers are ideally suited, because they are 
neither decomposed nor extended by the polymerase. 

Preferably, therefore, the base sequence of said blocking oligonucleotides is required to 
comprise a sequence having a length of at least 9 nucleotides which hybridizes to a pretreated 
nucleic acid sequence according to one of SEQ ID NO 206-449 , and sequences 
complementary thereto, wherein the base sequence of said oligonucleotides comprises at least 
one CpG, TpG or CpA dinucleotide. 

Wherein the method is for the prediction of probability of disease free survival and/or 
response to a treatment which targets the estrogen receptor pathway or are involved in 
estrogen metabolism, production or secretion it is particularly preferred that said nucleotide 
sequence(s) hybridizes to a pretreated nucleic acid sequence according to one of SEQ ID NO 
70, 71, 192, 193, 72, 73, 194, 195, 84, 85, 206, 207, 94, 95, 216, 217, 96, 97, 218, 219, 100, 
101, 222, 223, 106, 107, 228, 229, 116, 117, 238, 239, 92, 93, 214, 215, 122, 123, 244, 245, 
126, 127, 248, 249, 130, 131, 252, 253, 132, 133, 254, 255, 134, 135, 256, 257, 146, 147, 268, 
269, 148, 149, 270, 271, 152, 153, 274, 275, 154, 155, 276, 277, 158, 159, 280, 281, 162, 163, 
284 and 285, said contiguous nucleotides comprising at least one CpG, TpG or CpA 
dinucleotide sequence. 

Wherein the method is for the the characterisation of the breast cell proliferative disorder in 
terms of aggresiveness it is particularly preferred that said nucleotide sequence(s) hybridizes 
to a pretreated nucleic acid sequence according to one of SEQ ID NO 64, 65, 186, 187, 68, 
69, 190, 191, 70, 71, 192, 193, 74, 75, 196, 197, 82, 83, 204, 205, 84, 85, 206, 207, 86, 87, 
208, 209, 88, 89, 210, 211, 94, 95, 216, 217, 98, 99, 220, 221, 100, 101, 222, 223, 102, 103, 
224, 225, 110, 111, 232, 233, 112, 113, 234, 235, 118, 119, 240, 241, 130, 131, 252, 253, 134, 
135, 256, 257, 150, 151, 272, 273, 152, 153, 274, 275, 166, 167, 288, 289, 170, 171, 292, 293, 
178, 179, 300, 301, 148, 149, 270, 271, 150, 151, 272, 273, 152, 153, 274, 275, 154, 155, 276, 
277, 156, 157, 278, 279, 158, 159, 280, 281, 160, 161, 282, 283, 162, 163, 284, 285, 164, 165, 
286, 287, 166, 167, 288, 289, 168, 169, 290, 291, 170, 171, 292, 293, 172, 173, 294, 295, 174, 
.175, 296, 297__176, 177, 298, 299, 178,479, 300,-301,-1^0, 181,~302 7 303, 182, 183, 304^nd 
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305, said contiguous nucleotides comprising at least one CpG, TpG or CpA dinucleotide 
sequence. 

The fragments obtained by means of the amplification can carry a directly or indirectly 
detectable label. Preferred are labels in the form of fluorescence labels, radionuclides, or 
detachable molecule fragments having a typical mass that can be detected in a mass 
spectrometer. Where said labels are mass labels, it is preferred that the labeled amplificates 
have a single positive or negative net charge, allowing for better detectability in the mass 
spectrometer. The detection may be carried out and visualized by means of, e.g., matrix 
assisted laser desorption/ionization mass spectrometry (MALDI) or using electron spray mass 
spectrometry (ESI). 

Matrix Assisted Laser Desorption/ionization Mass Spectrometry (MALDI-TOF) is a very 
efficient development for the analysis of biomolecules (Karas & Hillenkamp, Anal Chem., 
60:2299-301, 1988). An analyte is embedded in a light-absorbing matrix. The matrix is 
evaporated by a short laser pulse thus transporting the analyte molecule into the vapour phase 
in an unfragmented manner. The analyte is ionized by collisions with matrix molecules. An 
applied voltage accelerates the ions into a field-free flight tube. Due to their different masses, 
the ions are accelerated at different rates. Smaller ions reach the detector sooner than bigger 
ones. MALDI-TOF spectrometry is well suited to the analysis of peptides and proteins. The 
analysis of nucleic acids is somewhat more difficult (Gut & Beck, Current Innovations and 
Future Trends, 1:147-57, 1995). The sensitivity with respect to nucleic acid analysis is 
approximately 100-times less than for peptides, and decreases disproportionally with 
increasing fragment size. Moreover, for nucleic acids having a multiply negatively charged 
backbone, the ionization process via the matrix is considerably less efficient. In MALDI- 
TOF spectrometry, the selection of the matrix plays an eminently important role. For 
desorption of peptides, several very efficient matrixes have been found which produce a very 
fine crystallisation. There are now several responsive matrixes for DNA, however, the 
difference in sensitivity between peptides and nucleic acids has not been reduced. This 
difference in sensitivity can be reduced, however, by chemically modifying the DNA in such 
a mariner that it becomes more similar to a peptide. For example, phosphorothioate nucleic 
acids, in which the usual phosphates of the backbone are substituted with thiophosphates, can 
be converted into a charge-neutral DNA using simple alkylation chemistry (Gut & Beck, 
Nucleic Acids Res. 23: 1367-73, 1995). The coupling of a charge tag to this modified DNA 

results in an increase in MALDI-TOF sensitivity to the same level as that found for peptides. 

> • 
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A further advantage of charge tagging is the increased stability of the analysis against 
impurities, which makes the detection of unmodified substrates considerably more difficult. 
In the fourth step of the method, the amplificates obtained during the third step of the method 
are analysed in order to ascertain the methylation status of the CpG dinucleotides prior to the 
treatment. 

In embodiments where the amplificates were obtained by means of MSP amplification, the 
presence or absence of an amplificate is in itself indicative of the methylation state of the CpG 
positions covered by the primer, according to the base sequences of said primer. 
Amplificates obtained by means of both standard and methylation specific PGR may be 
further analyzed by means of hybridization-based methods such as, but not limited to, array 
technology and probe based technologies as well as by means of techniques such as 
sequencing and template directed extension. 

In one embodiment of the method, the amplificates synthesised in step three are subsequently 
hybridized to an array or a set of oligonucleotides and/or PNA probes. In this context, the 
hybridization takes place in the following manner: the set of probes used during the 
hybridization is preferably composed of at least 2 oligonucleotides or PNA-oligomers; in the 
process, the amplificates serve as probes which hybridize to oligonucleotides previously 
bonded to a solid phase; the non-hybridized fragments are subsequently removed; said 
oligonucleotides contain at least one base sequence having a length of at least 9 nucleotides 
which is reverse complementary or identical to a segment of the base sequences specified in 
the present Sequence Listing; and the segment comprises at least one CpG , TpG or CpA 
dinucleotide. 

In a preferred embodiment, said dinucleotide is present in the central third of the oligomer. 
For example, wherein the oligomer comprises one CpG dinucleotide, said dinucleotide is 
preferably the fifth to ninth nucleotide from the 5' -end of a 13-mer. One oligonucleotide 
exists for the analysis of each CpG dinucleotide within the sequence according to SEQ ID NO 
1 to 61, and the equivalent positions within SEQ ID NO 206-449 (according to Table 1). Said 
oligonucleotides may also be present in the form of peptide nucleic acids. The non- 
hybridized amplificates are then removed.The hybridized amplificates are then detected. In 
this context, it is preferred that labels attached to the amplificates are identifiable at each 
position of the solid phase at which an oligonucleotide sequence is located. 
In yet a further embodiment of the method, the genomic methylation status of the CpG 
positions may be ascertained by means of oligonucleotide probes that are hybridised to the 
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bisulfite treated DNA concurrently with the PGR amplification primers (wherein said primers 
may either be methylation specific or standard). 

A particularly preferred embodiment of this method is the use of fluorescence-based Real 
Time Quantitative PGR (Heid et al„ Genome Res. 6:986-994, 1996; also see United States 
Patent No. 6,331,393) employing a dual-labeled fluorescent oligonucleotide probe 
(TaqMan™ PGR, using an ABI Prism 7700 Sequence Detection System, Perkin Elmer 
Applied Biosystems, Foster City, California). The TaqMan™ PGR reaction employs the use 
of a nonextendible interrogating oligonucleotide, called a TaqMan™ probe, which, in 
preferred imbodiments, is designed to hybridize to a GpC-rich sequence located between the 
forward and reverse amplification primers. The TaqMan™ probe further comprises a 
fluorescent "reporter moiety" and a "quencher moiety" covalently bound to linker moieties 
{e.g., phosphoramidites) attached to the nucleotides of the TaqMan™ oligonucleotide. For 
analysis of methylation within nucleic acids subsequent to bisulfite treatment, it is required 
that the probe be methylation specific, as described in United States Patent No. 6,331,393, 
(hereby incorporated by reference in its entirety) also known as the MethylLight™ assay. 
Variations on the TaqMan™ detection methodology that are also suitable for use with the 
described invention include the use of dual-probe technology (Lightcycler™) or fluorescent 
amplification primers (Sunrise™ technology). Both these techniques may be adapted in a 
maimer suitable for use with bisulfite treated DNA, and moreover for methylation analysis 
within CpG dinucleotides. 

A further suitable method for the use of probe oligonucleotides for the assessment of 
methylation by analysis of bisulfite treated nucleic acids In a further preferred embodiment of 
the method, the fifth step of the method comprises the use of template-directed 
oligonucleotide extension, such as MS-SNuPE as described by Gonzalgo & Jones, Nucleic 
Acids Res. 25:2529-2531, 1997. 

In yet a further embodiment of the method, the fifth step of the method comprises sequencing 
and subsequent sequence analysis of the amplificate generated in the third step of the method 
(Sanger F., et al., Proc Natl Acad Sci USA 74:5463-5467, 1977). 

In one preferred embodiment of the method the nucleic acids according to SEQ ID NO 1 to 
61, are isolated and treated according to the first three steps of the method outlined above, 
namely: 
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a - obtaining, from a subject, a biological sample 

having subject genomic DNA; 

b. extracting or otherwise isolating the genomic 

DNA; 

c - treating the genomic DNA of b), or a fragment 

thereof, with one or more reagents to convert cytosine bases that are 
unmethylated in the 5-position thereof to uracil or to another base that is 
detectably dissimilar to cytosine in terms of hybridization properties; 
and wherein the subsequent amplification of d) is carried out in a methylation specific 
manner, namely by use of methylation specific primers or blocking oligonucleotides, and 
further wherein the detection of the amplificates is carried out by means of a real-time 
detection probes, as described above. 

Wherein the subsequent amplification of d) is carried out by means of methylation specific 
primers, as described above, said methylation specific primers comprise a sequence having a 
length of at least 9 nucleotides which hybridizes to a pretreated nucleic acid sequence 
according to one of SEQ ID NO 206-449 , and sequences complementary thereto, wherein the 
base sequence of said oligomers comprises at least one CpG dinucleotide. 
Wherein the method is for the prediction of disease free survival and/or probability of 
response to a treatment which targets the estrogen receptor pathway or are involved in 
estrogen metabolism, production or secretion it is particularly preferred that said blocking 
oligonucleotide nucleotide sequence(s) hybridizes to a pretreated nucleic acid sequence 
according to one of one of SEQ ID NO 70, 71, 192, 193, 72, 73, 194, 195, 84, 85, 206, 207, 
94, 95, 216, 217, 96, 97, 218, 219, 100, 101, 222, 223, 106, 107, 228, 229, 116, 117, 238, 239, 
92, 93, 214, 215, 122, 123, 244, 245, 126, 127, 248, 249, 130, 131, 252, 253, 132, 133, 254, 
255, 134, 135, 256, 257, 146, 147, 268, 269, 148, 149, 270, 271, 152, 153, 274, 275, 154, 155, 
276, 277, 158, 159, 280, 281, 162, 163, 284 and 285, said contiguous nucleotides comprising 
at least one CpG, TpG or CpA dinucleotide sequence. 

Wherein the method is for the characterisation of the breast cell proliferative disorder in terms 
of aggresiveness it is particularly preferred that said blocking oligonucleotide nucleotide 
sequence(s) hybridizes to a pretreated nucleic acid sequence according to one of SEQ ID NO 
64, 65, 186, 187, 68, 69, 190, 191, 70, 71, 192, 193, 74, 75, 196, 197, 82, 83, 204, 205, 84, 85, 
206, 207, 86, 87, 208, 209, 88, 89, 210, 211, 94, 95, 216, 217, 98, 99, 220, 221, 100, 101, 222, 
223, 102, 103, 224, 225, 110, 111,' 232, 233, 112, 113, 234, 235, 118, 119, 240, 241, 130, 131, 
25^ 2537134, 135, 256; 257,T50 f 'l ~5i72^ 
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171, 292, 293, 178, 179, 300, 301, 148, 149, 270, 271, 150, 151, 272, 273, 152, 153, 274, 275, 
154, 155, 276, 277, 156, 157, 278, 279, 158, 159, 280, 281, 160, 161, 282, 283, 162, 163, 284, 
285, 164, 165, 286, 287, 166, 167, 288, 289, 168, 169, 290, 291, 170, 171, 292, 293, 172, 173, 
294, 295, 174, 175, 296, 297, 176, 177, 298, 299, 178, 179, 300, 301, 180, 181, 302, 303, 182, 
183, 304 and 305, said contiguous nucleotides comprising at least one CpG, TpG or CpA 
dinucleotide sequence. 

Step e) of the method, namely the detection of the specific amplificates indicative of the 
methylation status of one or more CpG positions according to SEQ ID NO 1 to 61 is carried 
out by means of real-time detection methods as described above. 

In an alternative most preferred embodiment of the method the subsequent amplification of d) 
is carried out in the presence of blocking oligonucleotides , as described above. Said blocking 
oligonucleotides comprising a sequence having a length of at least 9 nucleotides which 
hybridizes to a pretreated nucleic acid sequence according to one of SEQ ID NO 206-449 and 
sequences complementary thereto, wherein the base sequence of said oligomers comprises at 
least one CpG, TpG or CpA dinucleotide. Step e) of the method, namely the detection of the 
specific amplificates indicative of the methylation status of one or more CpG positions 
according to SEQ ID NO 206-449 is carried out by means of real-time detection methods as 
described above. 

In a further preferred embodiment of the method the nucleic acids according to SEQ ID NO 1 
to 61 are isolated and treated according to the first three steps of the method outlined above, 
namely: 

a) obtaining, from a subject, a biological sample having subject genomic 
DNA; 

b) extracting or otherwise isolating the genomic DNA; 

c) treating the genomic DNA of b), or a fragment thereof, with 
one or more reagents to convert cytosine bases that are unmethylated in the 
5-position thereof to uracil or to another base that is detectably dissimilar to 
cytosine in terms of hybridization properties; and wherein 

d) amplifying subsequent to treatment in c) is carried out in a methylation 
specific manner, namely by use of methylation specific primers or blocking 
oligonucleotides ; and further wherein 

e) detecting of the amplificates is carried out by means of a real-time 

detection probes, as described above. 

1 , ■ ... - • • 
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Wherein the subsequent amplification of c) is carried out by means of methylation specific 
primers, as described above, said methylation specific primers comprise a sequence having a 
length of at least 9 nucleotides which hybridizes to a pretreated nucleic acid sequence 
according to one of SEQ ID NO 206-449 and sequences complementary thereto, wherein the 
base sequence of said oligomers comprises at least one CpG dinucleotide. Wherein the 
method is for the prediction of disease free survival and/or probability of response to a 
treatment which targets the estrogen receptor pathway or are involved in estrogen 
metabolism, production or secretion it is particularly preferred that said methylation specific 
primers hybridize to a pretreated nucleic acid sequence according to one of one of SEQ ID 
NO 70, 71, 192, 193, 72, 73, 194, 195, 84, 85, 206, 207, 94, 95, 216, 217, 96, 97, 218, 219, 
100, 101, 222, 223, 106, 107, 228, 229, 116, 117, 238, 239, 92, 93, 214, 215, 122, 123, 244, 
245, 126, 127, 248, 249, 130, 131, 252, 253, 132, 133, 254, 255, 134, 135, 256, 257, 146, 147, 
268, 269, 148, 149, 270, 271, 152, 153, 274, 275, 154, 155, 276, 277, 158, 159, 280, 281, 162, 
163, 284 and 285, said contiguous nucleotides comprising at least one CpG, TpG or CpA 
dinucleotide sequence. 

Wherein the method is for the characterisation of the breast cell proliferative disorder in terms 
of aggresiveness it is particularly preferred that said methylation specific primers hybridize to 
a pretreated nucleic acid sequence according to one of SEQ ID NO 64, 65, 186, 187, 68, 69, 
190, 191, 70, 71, 192, 193, 74, 75, 196, 197, 82, 83, 204, 205, 84, 85, 206, 207, 86, 87, 208, 
209, 88, 89, 210, 211, 94, 95, 216, 217, 98, 99, 220, 221, 100, 101, 222, 223, 102, 103, 224, 
225, 110, 111, 232, 233, 112, 113, 234, 235, 118, 119, 240, 241, 130, 131, 252, 253, 134, 135, 
256, 257, 150, 151, 272, 273, 152, 153, 274, 275, 166, 167, 288, 289, 170, 171, 292, 293, 178, 
179, 300, 301, 148, 149, 270, 271, 150, 151, 272, 273, 152, 153, 274, 275, 154, 155, 276, 277, 
156, 157, 278, 279, 158, 159, 280, 281, 160, 161, 282, 283, 162, 163, 284, 285, 164, 165, 286, 
287, 166, 167, 288, 289, 168, 169, 290, 291, 170, 171, 292, 293, 172, 173, 294, 295, 174, 175, 
296, 297, 176, 177, 298, 299, 178, 179, 300, 301, 180, 181, 302, 303, 182, 183, 304 and 305, 
said contiguous nucleotides comprising at least one CpG, TpG or CpA dinucleotide sequence. 
Additional embodiments of the invention provide a method for the analysis of the methylation 
status of genomic DNA according to the invention (SEQ ID NO 1 to 61) , and complements 
thererof) without the need for pretreatment. 

Wherein the method is for the prediction of disease free survival and/or probability of response to 
a treatment which targets the estrogen receptor pathway or are involved in estrogen 
metabolism, production or secretion it is particularly preferred that said genomic sequences 
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are selected from SEQ ID NO 5, 6, 12, 17, 18, 20, 23, 28, 16, 31, 33, 35, 36, 37, 43, 44, 46, 
47,49 and 51. 

Wherein the method is for the characterisation of the breast cell proliferative disorder in 
terms of aggresiveness it is particularly preferred that said genomic sequences are selected 
from SEQ ID NO 2, 4, 5, 7, 11, 12, 13, 14, 17, 19, 20, 21, 25, 26, 29, 35, 37, 45, 46, 53, 55 
and 59. 

In they?™* step of such additional embodiments, the genomic DNA sample is isolated from 
tissue or cellular sources. Preferably, such sources include cell lines, histological slides, body 
fluids, or tissue embedded in paraffin. In the second step, the genomic DNA is extracted. 
Extraction may be by means that are standard to one skilled in the art, including but not 
limited to the use of detergent lysates, sonification and vortexing with glass beads. Once the 
nucleic acids have been extracted, the genomic double-stranded DNA is used in the analysis. 
In a preferred embodiment, the DNA may be cleaved prior to the treatment, and this may be 
by any means standard in the state of the art, in particular with methyl at ion- sens i ti ve 
restriction endonucleases. 

In the third step, the DNA is then digested with one or more methylation sensitive restriction 
enzymes. The digestion is carried out such that hydrolysis of the DNA at the restriction site is 
informative of the methylation status of a specific CpG dinucleotide. 

In the fourth step, which is optional but a preferred embodiment, the restriction fragments are 
amplified. This is preferably carried out using a polymerase chain reaction, and said 
amplificates may carry suitable detectable labels as discussed above, namely fluorophore 
labels, radionucleotides and mass labels. 

In the fifth step the amplificates are detected. The detection may be by any means standard in 
the art, for example, but not limited to, gel electrophoresis analysis, hybridization analysis, 
incorporation of detectable tags within the PGR products, DNA array analysis, MALDI or 
ESI analysis. 

When the methylation status of the selected CpG positions have been ascertained patient 
treatment relevant parameters can be ascertained wherein hypermethylation of the genes is 
associated with poor prognosis of said subject, aggressive characteristics of said cell 
proliferative disorder, poor disease free survival and/or lower probability of response of said 
subject to said treatment as relative to individuals with hypomethylation. 

The term "hypermethylation" refers to the average methylation state corresponding to 
air increased (above average or median) presence of 5-mCyt at one or a plurality of CpG 
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dinucleotides within a DNA sequence of a test DNA sample, relative to the amount of 5-mCyt 
found at corresponding CpG dinucleotides within a control DNA sample. 

The term "hypomethylation" refers to the average methylation state corresponding to a 
decreased (below average or median) presence of 5-mCyt at one or a plurality of CpG 
dinucleotides within a DNA sequence of a test DNA sample, relative to the amount of 5-mCyt 
found at corresponding CpG dinucleotides within a control DNA sample. 

Kits 

Moreover, an additional aspect of the present invention is a kit comprising, for example: a 
bisulfite-containing reagent; a set of primer oligonucleotides containing at least two 
oligonucleotides whose sequences in each case correspond, are complementary, or hybridize 
under stringent or highly stringent conditions to a 16-base long segment of the sequences SEQ 
ID NO: 1 to 61 and 206-449; oligonucleotides and/or PNA-oligomers; as well as instructions 
for carrying out and evaluating the described method. In a further preferred embodiment, said 
kit may further comprise standard reagents for performing a CpG position-specific 
methylation analysis, wherein said analysis comprises one or more of the following 
techniques: MS-SNuPE, MSP, MethyLight ™, HeavyMethyl™ , COBRA, and nucleic acid 
sequencing. However, a kit along the lines of the present invention can also contain only part 
of the aforementioned components. 

Typical reagents {e.g., as might be found in a typical MethyLight®-based kit) for 
MethyLight® analysis may include, but are not limited to: PGR primers for specific gene (or 
methylation-altered DNA sequence or CpG island); TaqMan® probes; optimised PGR buffers 
and deoxynucleotides; and Taq polymerase. 

Typical reagents {e.g., as might be found in a typical Ms-SNuPE-based kit) for Ms-SNuPE 
analysis may include, but are not limited to: PGR primers for specific gene (or methylation- 
altered DNA sequence or CpG island); optimised PCR buffers and deoxynucleotides; gel 
extraction kit; positive control primers; Ms-SNuPE primers for specific gene; reaction buffer 
(for the Ms-SNuPE reaction); and radioactive nucleotides. Additionally, bisulfite conversion 
reagents may include: DNA denaturation buffer; sulfonation buffer; DNA recovery regents or 
kit {e.g., precipitation, ultrafiltration, affinity column); desulfonation buffer; and DNA 
recovery components. 
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Typical reagents (e.g., as might be found in a typical MSP-based kit) for MSP analysis may 
include, but are not limited to: methylated and unmethylated PCR primers for specific gene 
(or methylation-altered DNA sequence or CpG island), optimized PCR buffers and 
deoxynucleotides, and specific probes. 

In order to enable the disclosed method, the invention further provides the modified DNA of 
one or a combination of genes taken from the group EGR4, APC, CDKN2A, CSPG2, 
ERBB2, STMN1, STK11, CA9, PAX6, SFN, S100A2, TFF1, TGFBR2, TP53, TP73, PLAU, 
TMEFF2, ESR1, SYK, HSPB1, RASSF1, TES, PITX2, GRIN2D, PSAT1, CGA, CYP2D6, 
COX7A2L, ESR2, PLAU, VTN, SULT1A1, PCAF, PRKCD, ONECUT2, BCL6, WBP11, 
(MX1), N.N., APP, ORC4L, NETOl, TBC1D3, GRB7, CYP2D6, CDK6, (Chr. Ipl3.2), 
(Chr. 17q25.1), ABCA8, (Chr. 12ql4.3), (Chr. 8ql2.1), MARK2, ELK1, Q8WUT3, CGB, 
BSG, BCKDK, SOX8, DAG1, SEMA4B and ESR1 (exon8) as well as oligonucleotides 
and/or PNA-oligomers for detecting cytosine methylations within said genes. The present 
invention is based on the discovery that genetic and epigenetic parameters and, in particular, 
the cytosine methyl ation patterns of said genomic DNAs are particularly suitable for 
improved treatment and monitoring of breast cell proliferative disorders. 

The nucleic acids according to the present invention can be used for the analysis of genetic 
and/or epigenetic parameters of genomic DNA. 

This objective according to the present invention is achieved using a nucleic acid containing a 
sequence of at least 16 bases in length of the pretreated genomic DNA according to one of 
SEQ ID NO: 206 to SEQ ID NO: 449 and sequences complementary thereto. 

The modified nucleic acids could heretofore not be connected with the improved treatment of 
breast cell proliferative disorders by prediction of disease free survival and/or probability of 
response to treatment and/or characterisaton of the disease in terms of aggressiveness. 

i 

b 

: . ' . ! 

The object of the present invention is further achieved by an oligonucleotide or oligomer for 
the analysis of pretreated DNA, for detecting the genomic cytosine methylation state v said 
oligonucleotide containing at least one base sequence having a length of at least 10 
nucleotides which hybridises to a pretreated genomic DNA according to SEQ ID NO: 206 to 
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SEQ ED NO: 449 . The oligomer probes according to the present invention constitute 
important and effective tools which, for the first time, make it possible to ascertain specific 
genetic and epigenetic parameters during the analysis of biological samples for features 
associated with a patient's disease free survival and/or response to endocrine treatment. Said 
oligonucleotides allow the improved treatment and monitoring of breast cell proliferative 
disorders. The base sequence of the oligomers preferably contains at least one CpG or TpG 
dinucleotide. The probes may also exist in the form of a PNA (peptide nucleic acid) which 
has particularly preferred pairing properties. Particularly preferred are oligonucleotides 
according to the present invention in which the cytosine of the CpG dinucleotide is within the 

middle third of said oligonucleotide e.g. the 5 th - 9 th nucleotide from the 5' -end of a 13-mer 
oligonucleotide; or in the case of PNA-oligomers, it is preferred for the cytosine of the CpG 

dinucleotide to be the 4 th - 6 th nucleotide from the 5 '-end of the 9-mer. 

The oligomers according to the present invention are normally used in so called "sets" which 
contain upto two oligomers and up to one oligomer for each of the CpG dinucleotides within 
SEQ ID NO: 206 to SEQ ID NO: 449 . 

In the case of the sets of oligonucleotides according to the present invention, it is preferred 
that at least one oligonucleotide is bound to a solid phase. It is further preferred that all the 
oligonucleotides of one set are bound to a solid phase. 

The present invention further relates to a set of at least 2 n (oligonucleotides and/or PNA- 
oligomers) used for detecting the cytosine methylation state of genomic DNA, by analysis of 
said sequence or treated versions of said sequence (of the genes EGR4, APC, CDKN2A, 
CSPG2, ERBB2, STMN1, STK11, CA9, PAX6, SFN, S100A2, TFF1, TGFBR2, TP53, 
TP73, PLAU, TMEFF2, ESR1, SYK, HSPB1, RASSF1, TES, PITX2, GRIN2D, PSAT1, 
CGA, CYP2D6, COX7A2L, ESR2, PLAU, VTN, SULT1A1, PCAF, PRKCD, ONECUT2, 
BCL6, WBP11, (MX1), N.N., APP, ORC4L, NETOl, TBC1D3, GRB7, CYP2D6, CDK6, 
(Chr. Ipl3.2), (Chr. 17q25.1), ABCA8, (Chr. 12ql4.3), (Chr. 8ql2.1), MARK2, ELK1, 
Q8WUT3, CGB, BSG, BCKDK, SOX8, DAG1, SEMA4B, ESR1 (exon8) as detailed in the 
sequence listing and Table 1) and sequences complementary thereto). These probes enable 
improved treatment and monitoring of breast cell proliferative disorders. 
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It will be obvious to one skilled in the art that the method according to the invention will be 
improved and supplemented by the incorporation of markers and clinical indicators known in 
the state of the art and currently used as predictive of the outcome of therapies which target 
endocrine or endocrine associated pathways. More preferably said markers include node 
status, age, menopausal status, grade, estrogen and progesterone receptors. 

The genes that form the basis of the present invention may be used to form a "gene panel", i.e. 
a collection comprising the particular genetic sequences of the present invention and/or their 
respective informative methylation sites. The formation of gene panels allows for a quick and 
specific analysis of specific aspects of breast cancer treatment. The gene panel(s) as described 
and employed in this invention can be used with surprisingly high efficiency for the treatment 
of breast cell proliferative disorders by prediction of the outcome of treatment with a therapy 
comprising one or more drugs which target the estrogen receptor pathway or are involved in 
estrogen metabolism, production, or secretion. The analysis of each gene of the panel 
contributes to the evaluation of patient responsiveness, however, in a less preferred 
embodiment the patient evaluation may be achieved by analysis of only a single gene. The 
analysis of a single member of the 'gene panel' would enable a cheap but less accurate means 
of evaluating patient responsiveness, the analysis of multiple members of the panel would 
provide a rather more expensive means of carrying out the method, but with a higher accuracy 

i 

(the technically preferred solution). 

The efficiency of the method according to the invention is improved when applied to patients 
who have not been treated with chemotherapy. Accordingly, it is a particularly preferred 
embodiment of the method wherein the method is used for the assessment of subjects who 
have not undergone chemotherapy. 

According to the present invention, it is preferred that an arrangement of different 
oligonucleotides and/or PNA-oligomers (a so-called "array") made available by the present 
invention is present in a manner that it is likewise bound to a solid phase. This array of 
different oligonucleotide- and/or PNA-oligomer sequences can be characterised in that it is 
arranged on the solid phase in the form of a rectangular or hexagonal lattice. The solid phase 

surface is preferably composed of silicon, glass, polystyrene, aluminium, steel, iron, copper, 

• i . , 

nickel, silver, or gold. However, nitrocellulose as well as plastics such as nylon which can 
exist in the form of pellets or also as resin matrices are suitable alternatives. 
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Therefore, a further subject matter of the present invention is a method for manufacturing an 
array fixed to a carrier material for the improved treatment and monitoring of breast cell 
proliferative disorders. In said method at least one oligomer according to the present invention 
is coupled to a solid phase. Methods for manufacturing such arrays are known, for example, 
from US Patent 5,744305 by means of solid-phase chemistry and photolabile protecting 
groups. 

A further subject matter of the present invention relates to a DNA chip for the improved 
treatment and monitoring of breast cell proliferative disorders. The DNA chip contains at least 
one nucleic acid according to the present invention. DNA chips are known, for example, in 
US Patent 5,837,832. 

The oligomers according to the present invention or arrays thereof as well as a kit according 
to the present invention are intended to be used for the improved treatment and monitoring of 
breast cell proliferative disorders. According to the present invention, the method is 
preferably used for the analysis of important genetic and/or epigenetic parameters within 
genomic DNA, in particular for use in improved treatment and monitoring of breast cell 
proliferative disorders. 

The methods according to the present invention are used, for improved treatment and 
monitoring of breast cell proliferative disorder by enabling more informed therapeutic 
regimens. 

The present invention moreover relates to the diagnosis and/or prognosis of events which are 
disadvantageous or relevant to patients or individuals in which important genetic and/or 
epigenetic parameters within genomic DNA, said parameters obtained by means of the 
present invention may be compared to another set of genetic and/or epigenetic parameters, the 
differences serving as the basis for the diagnosis and/or prognosis of events which are 
disadvantageous or relevant to patients or individuals. 
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In the context of the present invention the term "hybridisation" is to be understood as a bond 
of an oligonucleotide to a completely complementary sequence along the lines of the Watson- 
Crick base pairings in the sample DNA, forming a duplex structure. 

In the context of the present invention, "genetic parameters" are mutations and 
polymorphisms of genomic DNA and sequences further required for their regulation. To be 
designated as mutations are, in particular, insertions, deletions, point mutations, inversions 
and polymorphisms and, particularly preferred, SNPs (single nucleotide polymorphisms). 

In the context of the present invention the term "methylation state" is taken to mean the 
degree of methylation present in a nucleic acid of interest, this may be expressed in absolute 
or relative terms i.e. as a percentage or other numerical value or by comparison to another 
tissue and therein described as hypermethylated, hypomethylated or as having significantly 
similar or identical methylation status. 

In the context of the present invention the term "regulatory region" of a gene is taken to mean 
nucleotide sequences which affect the expression of a gene. Said regulatory regions may be 
located within, proximal or distal to said gene. Said regulatory regions include but are not 
limited to constitutive promoters, tissue-specific promoters, developmental-specific 
promoters, inducible promoters and the like. Promoter regulatory elements may also include 
certain enhancer sequence elements that control transcriptional or translational efficiency of 
the gene. 

In the context of the present invention the term "chemotherapy" is taken to mean the use of 
drugs or chemical substances to treat cancer. This definition excludes radiation therapy 
(treatment with high energy rays or particles), hormone therapy (treatment with hormones or 
hormone analogues (synthetic substitutes) and surgical treatment. 

In the context of the present invention, "epigenetic parameters" are, in particular, cytosine 
methylations and further modifications of DNA bases of genomic DNA and sequences further 
required for their regulation. Further epigenetic parameters include, for example, the 
acetylation of histones which, cannot be directly analysed using the described method but 
which, in turn, correlates with the DNA methylation. 

T 
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In the context of the present invention the term "adjuvant treatment" is taken to mean a 
therapy of a cancer patient immediately following an initial non chemotherapeutical therapy, 
e.g. surgery. In general, the purpose of an adjuvant therapy is to provide a significantly 
smaller risk of recurrences compared without the adjuvant therapy. 

In the context of the present invention the term "estrogen and/or progesterone receptor 
positive" is taken to mean cells that express on their surface receptors that are susceptible to 
the binding of estrogens and/or progesterones. 

While the present invention has been described with specificity in accordance with certain of 
its preferred embodiments, the following examples and figures serve only to illustrate the 
invention and is not intended to limit the invention within the principles and scope of the 
broadest interpretations and equivalent configurations thereof. 

Figure 1 shows a preferred application of the method according to the invention. The X axis 
shows the tumour(s) mass, wherein the line f 3' shows the limit of detectability. The Y-axis 
shows time. Accordingly said figure illustrates a simplified model of endocrine treatment of 
an Stage 1-3 breast tumour wherein primary treatment was surgery (at point 1), followed by 
adjuvant therapy with Tamoxifen. In a first scenario a responder to treatment (4) is shown as 
remaining below the limit of detectability for the duration of the observation. A non responder 
to the treatment (5) has a period of disease free survival (2) followed by relapse when the 
carcinoma mass reaches the level of detectability. 

Figure 2 shows another preferred application of the method according to the invention. The X 
axis shows the turnouts) mass, wherein the line '3' shows the limit of detectability. The Y- 
axis shows time. Accordingly said figure illustrates a simplified model of Endocrine treatment 
of an late stage breast tumour wherein primary treatment was surgery (at point 1), followed by 
relapse which is treated by Tamoxifen (2). In a first scenario a responder to treatment (4) is 
shown as remaining below the limit of detectability for the duration of the observation. A non 
responder to the treatment (5) does not recover from the relapse. J 
Figures 3 to 45 show the Kaplan-Meier estimated disease-free survival curves for single genes 
or oligonucleotide positions. The black plot shows the proportion of disease free patients in 
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the population with above median methylation levels, the grey plot shows shows the 
proportion of disease free patients in the population with below median methylation levels 

Figure 46 shows the methylation analysis of CpG islands according to Example 1. CpG 
islands per gene were grouped and their correlation with objective response determined by 
Hotelling's T 2 statistics. Black dots indicate the P- value of the indicated gene. The 20 most 
informative genes, ranked from left to right with increasing P-value, are shown. The top 
dotted line marks the uncorrected significance value (P < 0.05). The lower dotted line marks 
significance after false discovery rate correction of 25%. All genes with a P- value smaller or 
equal to the gene with the largest P- value that is below the lower line (in this case COX7 A2L) 
are considered significant. The FDR correction chosen guarantees that the identified genes are 
with 75% chance true discoveries. 

Figures 48 shows a ranked matrix of the best 1 1 amplificates of data obtained according to 
Example 1 (Metastatic setting, limited sample set). P- values were calculated from Likelihood 
ratio (LR) tests from multivariate logistic regression models. The figure is shown in 
greyscale, wherein the most significant CpG positions are at the bottom of the matrix with 
significance decreasing towards the top. Black indicates total methylation at a given CpG 
position, white represents no methylation at the particular position, with degrees of 
methylation represented in grey, from light (low proportion of methylation) to dark (high 
proportion of methylation). Each row represents one specific CpG position within a gene and 
each column shows the methylation profile for the different CpGs for one sample. The p- 
values for the individual CpG positions are shown on the right side. The p-values are the 
probabilities that the observed distribution occurred by chance in the data set. 
Figure 49 shows a ranked matrix of some of the best markers obtained according to Example 
1 (Metastatic setting, limited sample set). P-values were calculated from Likelihood ratio 
(LR) tests from univariate logistic regression models. The figure is shown in greyscale, 
wherein the most significant CpG positions are at the bottom of the matrix with significance 
decreasing towards the top. Black indicates total methylation at a given CpG position, white 
represents no methylation at the particular position, with degrees of methylation represented 
in grey, from light (low proportion of methylation) to dark (high proportion of methylation). 
Each row represents one specific CpG position within a gene and each column shows the 
methylation profile for the different CpGs for one sample. The p-values for the individual 
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CpG positions are shown on the right side. The p-values are the probabilities that the 
observed distribution occurred by chance in the data set. 



Figures 47 and 50 show the uncorrected p-values on a log-scale. P-values were calculated 
from Likelihood ratio (LR) tests from multivariate logistic regression models according to 
Example 1 (metastatic setting) . Each individual genomic region of interest is represented as a 
point, the upper dotted line represents the cut off point for the 25% false discovery rate, the 
lower dotted line shows the Bonferroni corrected 5% limit. 

Figure 51 shows a ranked matrix of the best 11 amplificates of data obtained according to 
Example 1 (Metastatic setting, all samplews). P-values were calculated from Likelihood ratio 
(LR) tests from multivariate logistic regression models. The figure is shown in greyscale, 
wherein the most significant CpG positions are at the bottom of the matrix with significance 
decreasing towards the top. Black indicates total methylation at a given CpG position, white 
represents no methylation at the particular position, with degrees of methylation represented 
in grey, from light (low proportion of methylation) to dark (high proportion of methylation). 
Each row represents one specific CpG position within a gene and each column shows the 
methylation profile for the different CpGs for one sample. The p-values for the individual 
CpG positions are shown on the right side. The p-values are the probabilities that the 
observed distribution occurred by chance in the data set. 

Figure 52 shows the disease-free survival curves for a combination of two oligonucleotides 
each from the genes TBC1D3 and CDK6, and one oligonucleotide from the gene PITX2. The 
black plot shows the proportion of disease free patients in the population with above median 
methylation levels, the grey plot shows shows the proportion of disease free patients in the 
population with below median methylation levels 

Figure 53 shows the plot according to Figure 52 and the classification of the sample set by 
means of the St. Gallen method. The unbroken lines represent the methylation analysis 
wherein the black plot shows the proportion of disease free patients in the population with 
above median methylation levels, the grey plot shows shows the proportion of disease free 
patients in the population with below median methylation levels. The broken lines represent 
----- the_St. Gallen-classification of the^ampte set wherein the black plot shows the disease free 
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survival time of the high risk group and the grey plot shows the disease free survival of the 
low risk group. 



SEQ ID NOS: 1 to 61 represent 5' and/or regulatory regions and/or CpG rich regions of the 
genes according to Table 1 . These sequences are derived from Genbank and will be taken to 
include all minor variations of the sequence material which are currently unforeseen, for 
example, but not limited to, minor deletions and SNPs. 

Example 1 

DNA samples were extracted using the Wizzard Kit (Promega), samples from 278 patients 
were analysed, data analyses were carried out on a selection of candidate markers. 

Bisulfite treatment and mPCR 

Total genomic DNA of all samples was bisulfite treated converting unmethylated cytosines to 
uracil. Methylated cytosines remained conserved. Bisulfite treatment was performed with 
minor modifications according to the protocol described in Olek et al. (1996). After 
bisulfitation 10 ng of each DNA sample was used in subsequent mPCR reactions containing 
6-8 primer pairs. 

Each reaction contained the following: 

2.5 pmol each primer 

11.25 ng DNA (bisulfite treated) 

Multiplex PGR Master mix (Qiagen) 

Further details of the primers are shown in TABLE 2. 

Initial denaturation was carried out at 95°C for 15 min. Forty cycles were carried out as 
follows: Denaturation at 95°C for 30 sec, followed by annealing at 57°C for 90 sec., primer 
elongation at 72°C for 90 sec. A final elongation at 72°C was carried out for 10 min. 

* 

Hybridisation 
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All PGR products from each individual sample were then hybridised to glass slides carrying a 
pair of immobilised oligonucleotides for each CpG position under analysis. Each of these 
detection oligonucleotides was designed to hybridise to the bisulphite converted sequence 
around one CpG site which was either originally unmethylated (TG) or methylated (CG). See 
Table 2 for further details of hybridisation oligonucleotides used. Hybridisation conditions 
were selected to allow the detection of the single nucleotide differences between the TG and 
CG variants. 

5 fi\ volume of each multiplex PCR product was diluted in 10 x Ssarc buffer . The reaction 
mixture was then hybridised to the detection oligonucleotides as follows. Denaturation at 
95°C, cooling down to 10 °C, hybridisation at 42°C overnight followed by washing with 10 x 
Ssarc and dH20 at 42°C. Further details of the hybridisation oligonucleotides are shown in 
TABLES. 

Fluorescent signals from each hybridised oligonucleotide were detected using genepix 
scanner and software. Ratios for the two signals (from the CG oligonucleotide and the TG 
oligonucleotide used to analyse each CpG position) were calculated based on comparison of 
intensity of the fluorescent signals. 

Data analysis methods 

Analysis of the chip data: From raw hybridisation intensities to methylation ratios; The log 
methylation ratio (log(CG/TG)) at each CpG position is determined according to a 
standardised preprocessing pipeline that includes the following steps: For each spot the 
median background pixel intensity is subtracted from the median foreground pixel intensity 
(this gives a good estimate of background corrected hybridisation intensities): For both CG 
and TG detection oligonucleotides of each CpG position the background corrected median of 
the 4 redundant spot intensities is taken; For each chip and each CpG position the 
log(CG/TG) ratio is calculated; For each sample the median of log(CGZTG) intensities over 
the redundant chip repetitions is taken. This ratio has the property that the hybridisation noise 
has approximately constant variance over the full range of possible methylation rates (Huber 
et al M 2002). 

i 

Hypothesis testing 
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The main task is to identify markers that show significant differences in the average degree of 
methylation between two classes. A significant difference is detected when the nullhypothesis 
that the average methylation of the two classes is identical can be rejected with p<0.05. 
Because we apply this test to a whole set of potential markers we have to correct the p- values 
for multiple testing. This was done by applying the False Discovery Rate (FDR) method 
(Dudoit et al., 2002). 

For testing the null hypothesis that the methylation levels in the two classes are identical we 
used the likelihood ratio test for logistic regression models (Venables and Ripley, 2002). The 
logistic regression model for a single marker is a linear combination of methylation 
measurements from all CpG positions in the respective genomic region of interest (ROI). A 
significant p- value for a marker means that this ROI has some systematic correlation to the 
question of interest as given by the two classes. However, at least formally it makes no 
statement about the actual predictive power of the marker. 
Logistic Regression 

Logistic regression models are tools to model the probability of an event in dependence of 
one or more variables 

or factors. For example, if x denotes a specific methylation logratio, the probability that a 

patient responds to the 

applied therapy (Tamoxifen) is modeled as 

P(response I x) = exp{ . + x)/[l + exp( . + *)]. (1) 

If xl 9 . . .yXk denote the k methylation logratios measured for one amplificate, the model is 
P(response \xl,...jck) = exp{ . + lxl + ... + xk)/[l + exp( . + lxl + ... + xk)]. (2) 

Significance of the respective amplificate is assessed using a likelihood-ratio test. This test 
calculates the 

difference of 2Log(likelihood) for the full model and the null-model including just the 
intercept . which is 
approximately 2 

-distributed with k degrees of freedom under the null hypotheses 1 = ... = h=0. 

If additional covariates are considered, the model contains an additional parameter for each 

covariate and the test 

statistic is calculated as the difference of -2Log(likelihood) or the full model and the null- 
model including 

? i . ■ • ■■ 
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intercept and covariates. Again, given the null hypothesis, this difference is approximately 2 
-distributed with k 
degrees of freedom. 

Ranked Matrices 

For a graphical display of all group comparisons, ranked matrices are used. Each row 
represents one oligo pair, 

whereas each column of the matrix stands for one sample (or chip in the case of up- versus 
downmethylated 

Promega DNA comparisons). Oligo pairs are ranked according to their discriminatory power 
(Wilcoxon test, 

Fisher score or logistic regression), where the best "marker" is displayed on the bottom line. 
Low methylation is displayed in light grey, high methylation in dark grey, and the data are 
normalized prior to display. For each oligo pair, the median is subtracted and the differences 
are subsequently divided by the MAD. Values below 2 or above 2 are censored to 2 and 2, 
respectively. The color scale is then adjusted to the whole matrix. 

Cox Regression 

Disease-free survival times (DFS) are modeled using Cox regression models. These models 
are similar to 

logistic regression models, but instead of probabilities, the hazard is modeled. The hazard 
gives the 

instantaneous risk of a* relapse. The models 

h(t \x) = hO (0 exp( x) (3) 

and 

hit I jcl,. . .jck) = hO (f) exp( Ixl + ... + kxk) (4) 

are used for uni- and multivariate analyses, respectively, where t is the time measured in 
months after surgery 

and h0(t) is the baseline hazard. The official milestone criteria refer to model (4), whereas 
results from model (3) 

are used for a better understanding of the most relevant CpG positions. 

Likelihood ratio tests are performed similar to those used for logistic regression. Again, the 

difference between 

— 2Log(Likelihood) of foil model and null-model is approximately - 2 
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-distributed with k degrees of freedom 
under the null hypotheses 1 = ... = k= 0. 
Additional covariates can be included into the models. 

Stepwise Regression Analysis 

For both multivariate logistic and Cox regression models, a stepwise procedure is used in 
order to find submodels 

including only relevant variables. Two effects are usually achieved by these procedures: 

Variables (methylation ratios) that are basically unrelated to the dependent variable (response 
state or 

DFS, respectively) are excluded as they do not add relevant information to the model. 
Out of a set of highly correlated variables, only the one with the the best relation to the 
dependent 
variable is retained. 

Inclusion of both types of variables can lead to numerical instabilities and a loss of power. 
Moreover, the 

predictory performance can be low due to overfitting. 

The applied algorithm aims at minimizing the Akaike information criterion (AIC) which is 
defined as 

AIC = 2 maximized log-likelihood + 2 #parameters. 

The AIC is related to the predictory performance of a model, smaller values promise better 
'performance. 

Whereas the inclusion of additional variables always improves the model fit and thus 
increases the likelihood, 

the second term penalizes the estimation of additional parameters. The best model will present 
a compromise 

model with good fit and usually a small or moderate number of variables. 
Results 

Adjuvant setting 

Analysis of the methylation patterns of patient samples treated with Tamoxifen as an adjuvant 
therapy immediately following surgery (see Figure 1) is shown in the plots according to 
Figures 3 to 45. For each amplificate, the mean methylation over all oligo-pairs for that 
amplificate was calculated and the population split into groups according to their mean 
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methylation values, wherein one group was composed of individuals with a methylation score 
higher than the median and a second group composed of individuals with a methylation score 
lower than the median. 

The results are shown in figures 3 to 9, as Cox model estimated disease-free survival curves. 
Figures 10 to 34 show the disease free survival curves using the methylation analyses of only 
single oligonucleotide. 

In a further analysis the recurrance of distant metastses only was analysed in figures 35 to 46. 
The accuracy of the differentiation between the different groups was further increased by 
combining multiple oligonucleotides from different genes. Figures 52 show the combination 
of two oligonucleotides each from the genes TBC1D3 and CDK6, and one oligonucleotide 
from the gene PITX2. Figure 53 shows the classification of the patienst from the sample set 
by means of the St. Gallen method (the current method of choice for estimating disease free 
survival) on top of Figure 52, thereby showing the improved effectiveness of methylation 
analysis over current methods, in particular post 80 months. 
Metastatic setting 

Analysis of the methylation patterns of patient samples treated with Tamoxifen in a metastatic 
setting (see Figure 2) is shown in the matrices according to Figures 46 to 52). The subjects 
analysed in this classification had relapsed following an initial treatment, the subsequent 
metastasis being treated by Tamoxifen. 

In order to determine the ability of each gene promoter to predict success or failure of 
Tamoxifen treatment, the individual CpGs measured were combined per gene using 
Hotelling's T 2 statistics . Several genes were significantly associated with response to 
tamoxifen after correcting for multiple comparison with a moderate conservative false 
discovery rate of 25% (see Figure 52). The genes were ONECUT2, WBP1 1, CYP2D6, 
DAG1, ERBB2, S100A2, TFF1, TP53, TMEFF2, ESR1, SYK, RASSF1, PITX2, PSAT1, 
CG A and PC AF . 

Figures 50 shows the uncorrected p- values on a log-scale. P-values were calculated from 
Likelihood ratio (LR) tests from multivariate logistic regression models. Each individual 
genomic region of interest is represented as a point, the upper dotted line represents the cut 
off point for the 25% false discovery rate, the lower dotted line shows the Bonferroni 
corrected 5% limit. 

Figure 51 shows a ranked matrix of the best 1 1 amplificates of data obtained . P-values were 
calculated from Likelihood ^ratio (I^) tests fromnrndtivariate logistic xegressron^odels. The 
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figure is shown in grayscale, wherein the most significant CpG positions are at the bottom of 
the matrix with significance decreasing towards the top. Black indicates total methylation at a 
given CpG position, white represents no methylation at the particular position, with degrees 
of methylation represented in grey, from light (low proportion of methylation) to dark (high 
proportion of methylation). Each row represents one specific CpG position within a gene and 
each column shows the methylation profile for the different CpGs for one sample. The p- 
values for the individual CpG positions are shown on the right side. The p- values are the 
probabilities that the observed distribution occurred by chance in the data set. 



Figures 47 through 49 the analysis of a subset of shows the uncorrected p-values on a log- 
scale. 

Figure 47 shows the uncorrected p-values on a log-scale. P-values were calculated from 
Likelihood ratio (LR) tests from multivariate logistic regression models according to Example 
1 (metastatic setting) . Each individual genomic region of interest is represented as a point, 
the upper dotted line represents the cut off point for the 25% false discovery rate, the lower 
dotted line shows the Bonferroni corrected 5% limit. 

Figure 48 shows a ranked matrix of the best 1 1 amplificates of data obtained. P-values were 
calculated from Likelihood ratio (LR) tests from multivariate logistic regression models. The 
figure is shown in greyscale, wherein the most significant CpG positions are at the bottom of 
the matrix with significance decreasing towards the top. Black indicates total methylation at a 
given CpG position, white represents no methylation at the particular position, with degrees 
of methylation represented in grey, from light (low proportion of methylation) to dark (high 
proportion of methylation). Each row represents one specific CpG position within a gene and 
each column shows the methylation profile for the different CpGs for one sample. The p- 
values for the individual CpG positions are shown on the right side. The p-values are the 
probabilities that the observed distribution occurred by chance in the data set. 



Table 1 



Accession 


Gene 


Genomic 


Pretreated 


Pretreated 


Pretreated 


Pretreated 
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no. 


name/locus 


SEQ1D 
NO: 


methylated 
sequence 
(sense) SEQ 
ID NO: 


methylated 
strand 
(antisense) 
SEQ ID 
NO: 


unmethylated 
sequence 
(sense) SEQ 
ID NO: 


unmethylated 
sequence 
(antisense) 
SEQ ID NO: 


NM_U01965 


EGR4 


1 


62 


63 


184 


185 


xtx at f\r\r\f\'j o 

NM_U00038 


APC 


2 


64 


65 


186 


187 


JNM__UUUU77 


CDKN2A 


3 


66 


67 


188 


189 


XTA yf r\f\A OOC 


CSPG2 


4 


68 


69 


190 


191 


IN M._UU444o 


liKx3B2 


5 


70 


71 


192 


193 


iNJVl_UUDDuJ 


blJVUNl 


6 


72 


73 


194 


195 




olKll 


7 


74 

■ 


75 


196 


197 


\TA If CSf\ 1 1 1 £L 


CA9 


8 


76 


77 


198 


199 


JNM_UUloU4 


PAX6 


9 


78 


79 


200 


201 


NM_U0ol42 


SFN 


10 


80 


81 


202 


203 


XTA if f\f\Cf\*70 

NM_005978 


o 1 t\r\ a 

S100A2 


11 


82 


83 


204 


205 


iSIM_0U3225 


TFF1 


12 


84 


85 


206 


207 


NM_003242 


TGFBR2 


13 


86 


87 


208 


209 


X. TX Jf f\j~\f~\ C A S~ 

NM__000546 


TP53 


14 


88 


89 


210 


211 


x tx is r\r\^ a f\ t 

NM__005427 


TP73 


15 


90 


91 


212 


213 


NMJ)02658 


T"V¥" ATT 

PLAU j 


16 


92 


93 


214 


215 


XTA Jf f\ t s~ t r\s~\ 

NM_016192 


TMEFF2 


17 


94 


95 


216 


217 


NM_000125 


ESR1 


18 


96 


97 


218 


219 


NM_O03177 


SYK 


19 


98 


99 


220 


221 


JNM_UU154(j 


HSPB1 


20 


100 


101 


222 


223 


X TX AT Ol 

INM_UU7182 


RASSF1 


21 


102 


103 


224 


225 


KTA. if r\ 1 C/C ,1 1 

JNiVl_UlDD41 ; 


I rib 


22 


104 


105 


226 


227 


lNlVl_UUlo 2D 




23 


106 


107 


228 


229 


NM 000836 


GRIN2D 


24 


lvO 


1U7 






NM_021154 


PSAT1 


25 


110 


111 


232 


233 


NM_000735 


CGA 


26 


112 


113 


234 


235 


NM_000106 


CYP2D6 


27 


114 


115 


236 


237 
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NM_004718 


COX7A2L 


28 


116 


117 


238 


239 


NM_001437 


ESR2 


29 


118 


119 


240 


241 


NM_002658 


PLAU 


30 


120 


121 


242 


243 


NM_000638 


VTN 


31 


122 


123 


244 


245 


NM_001055 


SULT1A1 


32 


124 


125 


246 


247 


NM_003884 


PCAF 


33 


126 


127 


248 


249 


NM_006254 


PRKCD 


34 


128 


129 


250 


251 


NM_004852 


ONECUT2 


35 


130 


131 


252 


253 


NM_001706 


BCL6 


36 


132 


133 


254 


255 


NM_016312 


WBP11 


37 


134 


135 


256 


257 


NM_002462 


(MX1) 


38 


136 


137 


258 


259 


NM_138433 


N.N. 


39 


138 


139 


260 


261 


NM_000484 


APP 


40 


140 


141 


262 


263 


NM_002552 


ORC4L 


41 


142 


143 


264 


265 


NM_138999 


NETOl 


42 


144 


145 


266 


267 


NM_032258 


TBC1D3 


43 


146 


147 


268 

4o* X«* 


269 


NM_005310 


GRB7 


44 


148 


149 


270 


271 


NM 000106 


CYP2D6 


45 


150 

JL. V 


151 


272 


273 


NM 001259 


CDK6 


46 


152 


153 


274 


275 




(Chr. Ipl3.2) 


47 


154 


155 


276 


277 




(Chr. 17q25.1) 


48 


156 


157 


278 


279 \ 


NM 007168 


ABCA8 


49 


158 


159 


280 


281 




(Chr. 12ql4.3) 


50 


160 

JL V«J V«* 


161 


282 


283 




(Chr. 8ql2.1) 


51 


162 


163 


284 


285 


NM_017490 


MARK2 


52 


164 


165 


286 


287 


NM_005229 


ELK1 


53 


166 


167 


288 


289 




"Q8WUT3" 


54 


168 


169 


290 


291 


NM_000737 


CGB 


55 


170 


171 


292 


293 


NM_001728 


BSG 


56 


172 


173 


294 


295 


NM__005881 


BCKDK 


57 


174 


175 


296 


297 


NM_014587 


SOX8 


58 


176 


177 


298 


299 


NM_004393 


DAG1 


59 


178 


179 


300 


301 
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NM_020210 


SEMA4B 


60 


180 


181 


302 


303 


NM_000125 


ESR1 (exon8) 


61 


182 


183 


304 


305 



Table 2 Primers and amplificates according to Example 1 



Gene: 


Primer: 


Amplificate Length: 


EGR4 
(SEQ 
ID NO: 

1) 


AGGGGGATTGA 
GTGTTAAGT 
(SEQ ID NO: 450) 
CCCAAACATAAA 
CACAAAAT 
(SEQ ID NO: 451) 


294 


APC 
(SEQ 
ID NO: 

2) 


TCAACTACCATC 

AACTTCCTTA 

(SEQ ID NO: 452) 

AATTTATTTTTA 

GTGTTGTAGTGG 

G 

(SEQ ID NO: 453) 


491 


CDKN 
2A 
(SEQ 
ID NO: 

3) 


GGGGTTGGTTGG 
TTATTAGA 
(SEQ ID NO: 454) 
AACCCTCTACCC 
ACCTAAAT 
(SEQ ID NO: 455) 


256 


CSPG2 
(SEQ 
ED NO: 


GGATAGGAGTTG 
GGATTAAGAT 
(SEQ ID NO: 456) 
AAATCTTTTTCA 
ACACCAAAAT 
(SEQ ID NO: 457) 


414 


ERBB2 


GGAGGGGGTAG 
AGTTAT^AGTT ~ 


257 

! 


(SEQ 


* 

f 
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Gene: 


Primer: 


Amplificate Length: 


ID NO- 


fSEO ID NO- 458^ 




5) 


TATACTTCCTCA 






AACAACCCTC 






(SEQ ID NO: 459) 




PTA /f KT 


G AGTTTGTA1T1 1 


334 


1 


AAGTTGAGTGGT 




(SEQ 


T 




ID NO- 


fSEO ID NO- 4fi(Y1 






A A A A A A A A Ti 

AACAAAACAAT 






ACCCCTTCTAA 






(SEQ ID NO: 461) 




OT^X /XT 

STMN 


CCTCTTACTAAC 


A A 

454 


1 


CTCAACCAAC 




fSEO 


fSEO ID NO- 46^ 




LL) NO: 


GAAAGGTAGGG 




6) 


AAGGATTTTT 






(SEQ ID NO: 462) 




STK11 


TAAAAGAAGGA 


528 


(SEQ 


TTTTTGATTGG 




ID NO- 


rsFn rn wo- 4^a\ 




7) 


CATCTTA1T1AC 






CTCCCTCCC 






(SEQ ID NO: 465) 




CA9 


GGGAAGTAGGTT 




(SEQ , 


A.GGGTTAGTT 




ID NO- 1 






8) 


AAATCCTCCTCT 




( 


*~%tT*S AAA T* aaa m 

^CAAATAAAT 




( 


>iiQ ixJ JNO: 467) 




PAX6 ( 


3GAGGGGAGAG • 


374 


(SEQ < 


3GTTATG 




ID, NO: ( 


SEQ ID NO: 468) 




9) , il 


fACTATACACAC 





-50- 



\Gene: 


Primer: 


Amplificate Length: 




CCCAAAACAA 
(SEQ ID NO: 469) 




iSFN 
(SEQ 
pNO: 
10) 


GAAGAGAGGAG 
AGGGAGGTA 
(SEQ ID NO: 470) 
CTATCCAACAAA 
CCCAACA 
(SEQ ID NO: 471) 


489 


pi 00 A 

(SEQ 
IDNO: 

11) 


GTTTTTAAGTTG 
GAGAAGAGGA 
(SEQ ID NO: 472) 

ACCTATAAATCA 
CAACCCACTC 
(SEQ ID NO: 473) 


460 


TFFl 
(SEQ 
pNO: 
12) 


TTGGTGATGTTG 
ATTAGAGTTT 
(SEQ ID NO: 474) 
TAAAACACCTTA 
CATTTTCCCT 
(SEQ ID NO: 475) 


449 


[tgfb 

R2 
KSEQ 
pNO: 
13) 


GTAATTTGAAGA 
AAGTTGAGGG 
(SEQ ID NO: 476) 

CCAACAACTAAA 
CAAAACCTCT 
(SEQ ID NO: 477) 


296 


p>53 
(SEQ 
ID NO: 
14) 


ITGATGAGAAGA 
AAGGA1T1AGT 
(SEQ ID NO: 478) 
rCAAAlTCAATC 
f\AAAACTTACC 
[SEQ ID NO: 479) 


496 

* 


rP73— i 


\GTAAATAGTGG 


>07 



51 



\Gene: 


Primer: 


Amplificate Length: 


(SEQ 
JD NO: 


GTGAGTTATGAA 
(SEQ ID NO: 480) 

AAAACTACTCTC 
C 

(SEQ ID NO: 481) 




PLAU 

(SEQ 

pNO: 
h f>} 


GAGAGAGATAG 
TTGGGGAGTTT 
(SEQ ID NO: 482) 

PA A APA A A PTTp 

ATCTACCAAATA 
C 

(SEQ ID NO: 483) 


453 


[TMEF 
F2 

ttDNO: 
17) 


TGTTGGTTGTTG 
TTGTTGTT 

CTTTCTACCCAT 
CCCAAAA 
(SEQ ID NO: 485) 


319 


ESRl 
pEQ 

18) 


CTATCAATTCCC 
CCAACTACT 

^oriQ ID INvJ: 487) 

1'1'GTTGGATAGA 
GGTTGAGTTT 
(SEQ ID NO: 486) 


349 


KYK 
(SEQ 

rn wo- 

ILL/ XNv-J. 

19) 

| 


GTGGGTTTTGGG 
IAGTTATAGA 

rAACCTCCTCTC 
CTTACCAA 
:SEQ ID NO: 489) 


485 

■ 


HSPB1 
(SEQ ' 


2ctacctctacc : 
\cttctcaat 


216 

' .■ » 



J 



Gene: 


Primer: 


Amplificate Length: 


ID NO: 


(SEQ ID NO: 491) 




20) 


AAGAGGGTTTAG 






IT1TTATTTGG 






(SEQ ID NO: 490) 




RASSF 


AGTGGGTAGGTT 


319 


1 


AAGTGTGTTG 




(SEQ 


(SEQ ID NO: 492) 




ID NO: 


CCCCAAAATCCA 




21) 


AACTAAA 






(SEQ ID NO: 493) 




IES 


AGGTTGGGGATT 


448 


(SEQ 


TTAGTTTTT 




ID NO: 


(SEQ ID NO: 494) 




22) 


ACCTTCTTCACT 






TTATlTrCCA 






(SEQ ID NO: 495) 




P1TX2 


TCCTCAACTCTA 


408 


(SEQ 


CAAACCTAAAA 




ID NO: 


(SEQ ID NO: 497) 




23) 


GTAGGGGAGGG 






AAGTAGATGT 






(SEQ ID NO: 496) 




GRIN2 


ATAGTTTGTGGT 


435 


D 


1TGGATTTTT 




(SEQ 


(SEQ ID NO: 498) 




ID NO: 


AAAACCTTTCCC 




24) 


TAACTTCAAT 






(SEQ ID NO: 499) 




PSAT1 


GTAGGTGGTTAA 


500 


(SEQ 


1T1TGGGTT 




ID NO: 


(SEQ ID NO: 500) 




25) 

_ . ^- , w .„ 1— 


CTCATTCACACT 




1 


ATATCCATTGA 
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Gene: 


Primer: 


Amplificate Length: 




(SEQ ID NO: 501) 




PSATl 
(SEQ 
ID NO: 
25) 


TAAGAGAGAGG 
AGTTGAGGTTT 
(SEQ ID NO: 502) 
CCAAAATTAACC 
ACCTACCTAA 
(SEQ ID NO: 503) 


478 


CGA 
(SEQ 
ID NO: 

26) 


TAGTGGTATAAG 
TTTGGAAATGTT 
(SEQ ID NO: 504) 
TCCACCTACATC 
TAAACCCTAA 
(SEQ ID NO: 505) 


364 


CYP2D 
6 

(SEQ 
ID NO: 
27) 


CCTCCTAAACTA 
AATCCAACAA 
(SEQ ID NO: 507) 
GGGGTTAAGGTT 
TTTATGGTA 
(SEQ ID NO: 506) 


418 


COX7 
A2L 
(SEQ 
ID NO: 

28) 


AATCCTAAAAAC 

CCTAACTTTTAA 

T 

(SEQ ID NO: 509) 
GGAGGTGTAAG 
GAGAATAGAGA 
(SEQ ID NO: 508) 


398 


ESR2 
(SEQ 
ID NO: 
29) 

• 


AAACCTTCCCAA 
T AACCTCTT A ; 
(SEQ ID NO: 511) 
TAGAGGGGAGT 
AGTGTTTGAGT 
(SEQ ID NO: 510) 


471 

* 


PLAU : 


GTGATATTTGGG 


479 , , . • 
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Gene: 


Primer: 


Amplificate Length: 

* 


(SEQ 


GATTGTTATT 




ID NO: 


(SEQ ID NO: 512) 




30) 


ACTCCCTCCCCT 






ATCTTACA 






(SEQ ID NO: 513) 


* 


VTN 


GT1A1T1X3GGTT 


492 


(SEQ 


AATGTAGGGA 




ID NO: 


(SEQ ID NO: 514) 




31) 


TCTATCCCCTCA 






AACTTAAAAA 






(SEQ ID NO: 515) 




SULT1 


ATACTACCAAAC 


448 


Al 


CCACTCAAAC 




(SEQ 


(SEQ ID NO: 517) 




ID NO: 


GAATTTAGGGAA 




32) 


GGAGTTAGTTG 






(SEQ ID NO: 516) 




PCAF 


GGATAAATGATT 


369 


(SEQ 


GAGAGGTTGT 




ID NO: 


(SEQ ID NO: 518) 




33) 


CCTCCCTTAATT 






CTCCTACC 






(SEQ ID NO: 519) 




PRKC 


CTTAACCCATCC 


322 


D 


CAATCA 




(SEQ 


(SEQ ID NO: 521) 




ID NO: 


GATAGAAGGATT 




34) 


TTAGTTTTTATTG 






TT 






(SEQ ID NO: 520) 




ONEC 


riTG'lTGGGATT 


467 


UT2 


TGTTAGGAT 




(SEQ 


(SEQ ID NO: 522) 





Gene: i 


p rimer: I 


kmplificate Length: 


ID NO: , 
35) ( 

1 


AAACATTTTACC 
3CTCTAAACC 
;SEQ ID NO: 523) 




BCL6 i 
(SEQ 
IDNO: 
36) 


dATCACCACTTC I 
rAAAAACCC 
[SEQ ID NO: 525) 
GGGTAAGAAAG 
AAGGAATTAGTT 

r 

(SEQ ID NO: 524) 


156 


WBP11 
(SEQ 
ED NO: 

37) 

ft 
* 


AAGAGGTGAGG 
AAGAGTAGTAA 
AT 

(SEQ ID NO: 526) 
CTCCCAACAACT 
AAATCAAAAT 
(SEQ ID NO: 527) 


437 


(MX1) 
(SEQ 
IDNO: 
38) 


TGTAGGAGAGGT 
TGGGAAG 
(SEQ ID NO: 528) 
CCAAACATAACA 
TCCACTAAAA 
(SEQ ID NO: 529) 


341 


N.N. 
(SEQ 
IDNO: 
39) 


TAGGTTTAAGAG 
GAGAGGGAAT 
(SEQ ID NO: 530) 
AAACAACTACCC 
AAATCCAAC 
(SEQ ID NO: 531) 


433 

1 


APP 
(SEQ 
Dp NO; 
40) 


GAGTAAGGAAG 
GGGGATG 
(SEQ ID NO: 532) 
AACCCAAATCTT 


494 

i 

' ' * * " 
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Gene: 



Primer: 



Amplificate Length: 



TAATACAAAAA 
(SEQ ID NO: 533) 



NETOl GGAGTTTTTAGA 



(SEQ 
ID NO: 
42) 



395 



AGAGGAAGATT 
(SEQ ID NO: 534) 

ACTTCACAATAA 
ATACCCTCCC 
(SEQ ID NO: 535) 



TBC ID GGTAGAGGAAG 



490 



(SEQ 
ID NO: 

43) 



TAGTTGGTTTG 
(SEQ ID NO: 536) 
CTTTTATATTTCT 
CCCAATCTCC 
(SEQ ID NO: 537) 



GRB7 
(SEQ 
ID NO: 

44) 



AAAATCCATAAC ni6 
CACCAAAATA 
(SEQ ID NO: 539) 
ITAGGAAGTTTT 
AGGAATGAGG 
(SEQ ID NO: 538) 



CYP2D AATTTCCTAACC 



379 



(SEQ 
ID NO: 

45) 



CACTATCCTC 
(SEQ ID NO: 541) 
ATTTGTAGTTTG 
GGGTGATTT 
(SEQ ID NO: 540) 



CDK6 
(SEQ 
ID NO: 
46) 



(Chr. 



ACCTTAAACACC 

TTCCCATAA 

(SEQ ID NO: 543) 

GTGTAATGATTT 

TGGATTGAGA 

(SEQ ID NO: 542) 

AAGGAAGGTAG 



456 



499 



56 



-57- 



Gene: \Primer: 



jnplificate Length: 



lp 1 3 .2) kGGGTTG AGT 
(SEQ (SEQ ID NO: 544) 
ID NO: LaAATCCAAAATI 



47) 



AACACCATT 
|(SEQ ID NO: 545) 



(Chr. L\GTAGATGAAGT|500 
17q25. fTGGGGATTAG 
1) (SEQ ID NO: 546) 
(SEQ [TCCTACTATCCC 
ED NO: [TTCTCAAAAA 
48) (SEQ ED NO: 547) 



ABCA fTGATTGTGTAGA H99 



8 



TT ATTTTTG GTT 



(SEQ (SEQ ID NO: 548) 
tD NO: CAAACTCTCTAA 



49) 



.CCTCAATCTC 
|(SEQ ID NO: 549) 



(Chr. IACCCTAACATTC 
12ql4. fTCTAAACAACA 
3) |(SEQ ID NO: 551) 
(SEQ GATGAAAGTGG 
BD NO: UaAGATTATGG 



50) 



;SEQ ID NO: 550) 



(Chr. CTCCAACTCTCC (343 
8ql2.1)fTCACCTC 
(SEQ (SEQ ID NO: 553) 
ED NO: IaTTTGAAGGTTG 



51) 



GTTTGTAGA 
'SEQ ID NO: 552) 



IMARK frCACC ACTATCC |476 

CAATAATCA i 
(SEQ |(SEQ ID NO: 555) 
ED NO: (TAAAGTAGGAA 



-58- 



Gene: 
52) 



rimer: Wnplificate Length: 

' GTTTG GTTTG 



(SEQ ID NO: 554) 



ELK1 pCTCTAATTCCT 
(SEQ UtCAATCACC 
ID NO: (SEQ ID NO: 557) 
53) fTTAGAAGTGAAAl 
STAG AAGGGTTT I 
|(SEQ ID NO: 556) 



■35 



Q8WU jGGTTAGAAGTTA 
T3 GAGGGGTAGG 
(SEQ |(SEQ ID NO: 558) 
ID NO: pCATCCCATTAC 
54) CTATAAAAAT 
(SEQ ID NO: 559) 



^06 



CGB pCCACCCTATTT 
(SEQ pCTACCAA 

NO: (SEQ ID NO: 561) 
<5) JTTTGTTTTAGGT 
GGTGTGTAAT 
(SEQ ED NO: 560) 



17 



SG [TTATCTATCCCC 
|(SEQ IACACCCTAAT 



420 



NO: 



'6) 



CKD 



:seq 

NO: 
157) . 



(SEQ ID NO: 563) 
GGAGTAGGTGA 
GGAGTATTTTG 
(SEQ ID NO: 562) 



TCACCTCCTTTT i258 
ACAACCAAT 
(SEQ ID NO: 565) 
ITTGGGAGAGTT 
TTAGGATTTA 
(SEQ ID NO: 564) 



-59- 



Gene: 


Primer: 


Amplificate Length: 


(SEQ 
ID NO 

58) 


GGTTTGTT 

ACACACTCCTTA 
AAACTCTTCC 
(SEQ ID NO: 567) 




DAG1 
(SEQ 
ID NO- 
59) 


AATACCAACCCA 
AACATCTACC 

IT1GGTTATGTG 
GAGTTTATTGT 
(SEQ ID NO: 568) 


315 


ORC4L 
(SEQ 

IT) KfO- 

41) 


CACTCAAAACTT 
CCCTACCTAC 

GGTAATGGTGGG 
GGTAAAT 
(SEQ ID NO: 570) 


489 


SEMA 
4B 

ID NO: ( 
60) l 

( 


ACCAAAATACTA 
CTCCCAAATC 

f QV*C\ TT\ KT/'V <T2\ 
^OJZK^l MJ INU. O I D) 

3GGTAGAGGGA 
3GTTATTGTT 
;SEQ ID NO: 572) 


337 


ESR1 : 
(exon8) " 
(SEQ ( 
ID NO: ( 
61) 1 

( 


rATGATTTGTTG • 
ITGGAGATGT 
SEO ID NO* 574^ 

:ttaaaatccct 
[taactattccc 

SEQ ID NO: 575) 


388 



Table 3 Hybridisation oligonucleotides accordin g to Example 1 

Oligo: 
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Gene t 


Oligo: 


ONECUT2 
(SEQ ID NO: 35) 


rACGTAGTTGCGCGTT 
[SEQ ED NO: 800) 


ONECUT2 
(SEQ ED NO: 35) 


GTATGTAGTTGTGTGTT 
(SEQ ID NO: 801) 


ONECUT2 
(SEQ ID NO: 35) 


ITTTGTGCGTACGGAT 
(SEQ ED NO: 802) 


ONECUT2 
(SEQ ID NO: 35) 


nTTTGTGTGTATGGAT 
(SEQ ED NO: 803) 


ONECUT2 
(SEQ ID NO: 35) 


TTAAGCGGGCGTTGAT 
(SEQ ID NO: 804) 


ONECUT2 
(SEQ ID NO: 35) 


TTAAGTGGGTGTTGAT 
(SEQ ID NO: 805) 


ONECUT2 
(SEQ ID NO: 35) 


TAGAGGCGCGGGTTAT 
(SEQ ID NO: 806) 


ONECUT2 
(SEQ ID NO: 35) 


TAGAGGTGTGGGTTAT 
(SEQ ID NO: 807) 


BCL6 

(SEQ ID NO: 36) 


ATTTCGAAATATGTCGG 
(SEQ ID NO: 1004) 


BCL6 

(SEQ ID NO: 36) 


ATTTTGAAATATGTTGGT 
(SEQ ID NO: 1005) 


BCL6 

(SEQ ID NO: 36) 


ATTCGAGACGTTTTGT 
(SEQ ID NO: 1006) 


BCL6 

(SEQ ID NO: 36) 


TTTGAGATGTTTTGTTTA 
(SEQ ED NO: 1007) 


BCL6 

(SEQ ED NO: 36) 


TTCGAGTTTCGAATCGG 
(SEQ ID NO: 1008) 


BCL6 

(SEQ ED NO: 36) 


TTTGAGTTTTGAATTGGA 
(SEQ ID NO: 1009) 


BCL6 

(SEQ ED NO: 36) 


ATAGCGAAGGCGTCGA 
(SEQ ID NO: 1010) 


BCL6 

(SEQ ID NO: 36) 


TATAGTGAAGGTGTTGA 
(SEQ ID NO: 1011) 


"WBPT1 . 


TTACGAGAAGCGGGTA 
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Gene 



(SEQ ID NO: 37) (SEQ ID NO: 946) 



Oligo: 



WBP11 



lTTATGAGAAGTGGGTA 



(SEQ ID NO: 37) (SEQ ID NO: 947) 



WBP11 

(SEQ ID NO: 37) 
WBPlT 



lGGGGGCGATTTTCGG 
|(SEQ ID NO: 948) 

\AGGGGGTGATTTTTGG 



(SEQ ID NO: 37) (SEQ ID NO: 949) 



WBP11 



"AGCGTCGTTTGATT 



(SEQ ID NO: 37) (SEQ ID NO: 950) 



WBP11 



fTTTTAGTGTTGTTTGATT 



WBP11 



WBP11 
(SEQ ID NO: 37) 



(SEQ ID NO: 37) (SEQ ID NO: 95 1 ) 



lGTTCGTTTTATTGCGT 



(SEQ ID NO: 37) (SEQ ID NO: 952) 



GAGTTTGTTTTATTGTGT 
ksEQ ID NO: 953) 



(MX!) 

(SEQ ID NO: 38) 



.CGCGCGAAAGTAAA 
|(SEQ ID NO: 576) 



(MX1) 

(SEQ ID NO: 38) 



fTTGGGAATGTGTGAAA 
(SEQ ID NO: 577) 



(MX1) 

(SEQ ID NO: 38) 



n CGAGTTGGGTCGAGA 
|(SEQ ID NO: 578) 



(MX1) 

(SEQ ID NO: 38) 



TTTGAGTTGGGTTGAGA 
(SEQ ID NO: 579) 



(MX1) 

(SEQ ID NO: 38) 



"ATGCGCGGGAAGATT 
|(SEQ ID NO: 580) 



(MX1) 

(SEQ ID NO: 38) 
(MX1) 
(SEQ ID NO: 38) 



TATGTGTGGGAAGAT 
:SEQIDNO: 581) 

lTTTACGGTTGCGCGG 
|(SEQ ID NO: 582) 



(MX1) 

(SEQ ID NO: 38) 



ATGGTTGTGTGGGTTA 
;SEQ ID NO: 583) 



N-N. . 

(SEQ ID NO: 39) 



iGGCGTTTATAGTCGGT 
[SEQ ID NO: 584) 



-62- 



Gene 


Oligo: 


N.N. 

(SEQ ID NO: 39) 


AGGTGTTTATAGTTGGT 
(SEQ ID NO: 585) 


N.N. 

(SEQ ID NO: 39) 


nTCGAGTTCGGAGTA 
(SEQ ID NO: 586) 


N.N. 

(SEQ ED NO: 39) 


TTTTGAGTTTGGAGTAG 
(SEQ ID NO: 587) 


N.N. 

(SEQ ID NO: 39) 


TTGTCGGTCGTAGCGG 
(SEQ ID NO: 588) 


N.N. 

(SEQ ID NO: 39) 


TTTGTTGGTTGTAGTGG 
(SEQ ID NO: 589) 


N.N. 

(SEQ ID NO: 39) 


TTCGTTACGGCGGTAG 
(SEQ ID NO: 590) 


N.N. 

(SEQ ID NO: 39) 


AGTTTGTTATGGTGGT 
(SEQ ID NO: 591) 


APP 

(SEQ ID NO: 40) 


TGAAACGAGGCGGAGA 
(SEQ ID NO: 592) 


APP 

(SEQ ID NO: 40) 


TGAAATGAGGTGGAGA 
(SEQ ID NO: 593) 


APP 

(SEQ ID NO: 40) 


GACGTTGCGTTTTCGG 
(SEQ ID NO: 594) 


APP 

(SEQ ID NO: 40) 


GGATGTTGTGTTTTTGG 
(SEQ ID NO: 595) 


APP 

(SEQ ID NO: 40) 


TTTTTAGCGGGTCGGA 
(SEQ ID NO: 596) 


APP 

(SEQ ID NO: 40) 


TTTTTAGTGGGTTGGA 
(SEQ ID NO: 597) 


APP 

(SEQ ID NO: 40) 


GGACGTTCGTAAGCGG 
(SEQ ID NO: 598) 


APP 

(SEQ ID NO: 40) 


GGATGTTTGTAAGTGG 
(SEQ ID NO: 599) 


ORC4L 

(SEQ ID NO: 41) 


TTATACGCGTTGTTTAT 
(SEQ ID NO: 600) 


DRC4L ~~ 


TGTATTATSTGTGTTGTTT 



62 
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Gene 


Oligo: ~1 


(SEQ ID NO: 41) 


(SEQ ID NO: 601) 


ORC4L 

(SEQ ID NO: 41) 


AGCGTGACGGTTCGAG ~ 
(SEQ ID NO: 602) 


ORC4L 

(SEQ ID NO: 41) 


AGTGTGATGGTTTGAG 
(SEQ ID NO: 603) 


ORC4L 

(SEQ ID NO: 41) 


ATTAGGCGAGTTTCGT 
(SEQ ID NO: 604) 


ORC4L 

(SEQ ID NO: 41) 


1 T AGGTG AGTTTTGTTT 
(SEQ ID NO: 605) 


NETOl 

(SEQ ID NO: 42) 


TACGTTCGGTTTTACGA 
(SEQ ID NO: 606) 


NETOl 

(SEQ ID NO: 42) 


TTATGTTTGGTTTTATGAT 
(SEQ ID NO: 607) 


NETOl 

(SEQ ID NO: 42) 


TTACGTCGGTTTCGAT 
(SEQ ID NO: 608) 


NETOl 

(SEQ ID NO: 42) 


ITTATGTTGGTTTTGATT 
(SEQ ID NO: 609) 


NETOl 

(SEQ ID NO: 42) 


TTCGGTTTCGGGAAAG 

(SEQ ED NO: 610) 


NETOl 

(SEQ ID NO: 42) 


TTTGGTTTTGGGAAAGG 
(SEQ ID NO: 611) 


NETOl 

(SEQ ID NO: 42) 


rGTCGTACGTGTTTAT 
'SEQ ID NO: 612) 


NETOl 

(SEQ ID NO: 42) ( 


AATTTTTGTTGTATGTGT 
[SEQ ID NO: 613) 


TBC1D3 : 
(SEQ ID NO: 43) ( 


rATTCGCGGGCGGTTT 

SEQ ID NO: 988) 


TBC1D3 ] 
(SEQ ID NO: 43) , ( 


rAGTATTTGTGGGTGG 
SEQ ID NO; 989) 


rBClD3 / 
(SEQ ID NO: 43) ( 


VTTCGGCGGGAGATTA 
SEQ ID NO:, 990) , 


rBClD3 (■ 
(SEQ ID NO: 43) ( 

1 


VGTAAATTTGGTGGGA 
SEQ ID NO: 991) 
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Gene 
TBC1D3 



(SEQ ID NO: 43) (SEQ ID NO: 992) 



TBC1D3 



(SEQ ID NO: 43) (SEQ ID NO: 993) 



TBC1D3 



\Oligo: 

X3ATTAGTCGAAAGAGT 



rAGATTAGTTGAAAGAGT 



ATATTTCGGGGTTTTAA 



(SEQ ID NO: 43) (SEQ ID NO: 994) 



TBC1D3 



'ATATTTTGGGGTTTTAAA 



(SEQ ED NO: 43) (SEQ ID NO: 995) 



GRB7 



lTAGTTTCGTTATTTGTAT 



(SEQ ID NO: 44) (SEQ ID NO: 1062) 



GRB7 



GGTATAGTTTTGTTATTTG 



(SEQ ID NO: 44) (SEQ ID NO: 1063) 



GRB7 



(SEQ ID NO: 44) (SEQ ID NO: 1064) 



AGTACGGGGTGTA 



GRB7 



TTTTAGTATGGGGTGTA 



(SEQ ID NO: 44) (SEQ ID NO: 1065) 



GRB7 



GGCGTTATAGTTACGTTT 



(SEQ ID NO: 44) |(SEQ ID NO: 1066) 



GRB7 



(SEQ ID NO: 44) |(SEQ ID NO: 1067) 



rGGTGTTATAGTTATGTT 



GRB7 



GTTTATCGAAGGTAGA 



(SEQ ID NO: 44) (SEQ ID NO: 1068) 



GRB7 



GTTTATTGAAGGTAGAA 



(SEQ ID NO: 44) |(SEQ ID NO: 1069) 



CYP2D6 



GAGATCGCGTTTTCGT 



(SEQ ID NO: 45) KSEQ ID NO: 844) 



CYP2D6 



(SEQ ID NO: 45) KSEQ ID NO: 845) 



CYP2D6 



(SEQ ID NO: 45) KSEQ ED NO: 846) 



.GAGATTGTGTTTTTGT 



lTTCGCGGCGAGGATA 



CYP2D6 



[ATTTGTGGTGAGGAT 



(SEQ ED NO: 45) (SEQ ED NO: 847) 



CYP2D6 



rTCGTTTCGGGGACGT 



64 
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Gene 
(SEQ ID NO: 45) 



CYP2D6 



(SEQ ID NO: 45) 

CYP2D6 

(SEQ ID NO: 45) 



CYP2D6 

(SEQ ID NO: 45) 



CDK6 

(SEQ ID NO: 46) 



Oligo: 
(SEQ ID NO: 848) 



GTTGTTTTGGGGATGTG 
(SEQ ID NO: 849) 

TAAGTAGCGTCGATAG 
(SEQ ID NO: 850) 



AAGTAGTGTTGATAGGG 
(SEQ ID NO: 851) 



T>K6 



(SEQ ID NO: 46) 



:DK6 
(SEQ ID NO: 46) 



CDK6 

(SEQ ID NO: 46) 



TTTTGGAGTAGGTGAG 
(SEQ ID NO: 869) 



CDK6 

(SEQ ID NO: 46) 
CDK6 

(SEQ ID NO: 46) 
CDK6 
(SEQ ID NO: 46) 



TACGAATGCGTGGCGG 
(SEQ ID NO: 866) 

TATGAATGTGTGGTGGA 
(SEQ ID NO: 867) 



TTTCGGAGTAGGCGAG 
(SEQ ID NO: 868) 



TACGTTAGTTTCGCGG 
(SEQ ID NO: 870) 

ATGTTAGTTTTGTGGG 
(SEQ ID NO: 871) 

ATTGAGACGCGTTTGG 
(SEQ ID NO: 872) 



CDK6 

(SEQ ID NO: 46) 



GAGATGTGTTTGGGTA 
(SEQ ID NO: 873) 



(Chr. Ipl3.2) 
(SEQ ID NO: 47) 



(Chr. Ipl3.2) 
(SEQ ID NO: 47) 



TAAATTCGACGGGTTT 
(SEQ ID NO: 1054) 



ATTTGATGGGTTTTTGT 
(SEQ ID NO: 1055) 



(Chr, lpl3.2) 
(SEQ ID NO: 47) 



TnTCGTTCGGCGGAG 
(SEQ ID NO: 1056) 



(Chr, lpl3.2) 
(SEQ ID NO: 47) 



(Chr. Ipl3.2) 
(SEQ ID NO: 47) 



TTTGTTTGGTGG AGGTT 
(SEQ ID NO: 1057) 



TTCGCGTTTATCGTGT 

» r 

(SEQ ID NO: 1058) 
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Gene < 


Oligo: 


(Chr. Ipl3.2) 
(SEQ ID NO: 47) 


rGGTTTGTGTTTATTGT 
:SEQ ID NO: 1059) 


(Chr. Ipl3.2) 
(SEQ ID NO: 47) 


rTTCGCGGTTCGTAGT 
(SEQ ID NO: 1060) 


(Chr. Ipl3.2) 
(SEQ ID NO: 47) 


ITTGTGGTTTGTAGTTTA 

■ * 

(SEQ ID NO: 1061) 


(Chr. 17q25.1) 
(SEQ ID NO: 48) 


ITAGGTCGGGAGGAAA 
(SEQ ID NO: 614) 


(Chr. 17q25.1) 
(SEQ ID NO: 48) 


TTAGGTTGGGAGGAAA 
(SEQ ID NO: 615) 


(Chr. 17q25.1) 
(SEQ ED NO: 48) 


TTAGACGTGGGGCGAT 
(SEQ ID NO: 616) 


(Chr. 17q25.1) 
(SEQ ID NO: 48) 


TTAGATGTGGGGTGAT 
(SEQ ID NO: 617) 


(Chr. 17q25.1) 
(SEQ ID NO: 48) 


TAAGGTACGAGCGTGT 
(SEQ ID NO: 618) 


(Chr. 17q25.1) 
(SEQ ID NO: 48) 


AAGGTATGAGTGTGTG 
(SEQ ID NO: 619) 


(Chr. 17q25.1) 
(SEQ ID NO: 48) 


GTAGAGTACGAGAGATT 
(SEQ ID NO: 620) 


(Chr. 17q25.1) 
(SEQ ID NO: 48) 


GGTAGAGTATGAGAGAT 
(SEQ ID NO: 621) 


ABCA8 

(SEQ ID NO: 49) 


AT 1TGGTTTCGAAGTTT 
(SEQ ID NO: 996) 


ABCA8 

(SEQ ID NO: 49) 


TATTTGGTTTTGAAGTTT 
(SEQ ID NO: 997) 


ABCA8 

(SEQ ID NO: 49) 


ITITCGGAATTCGGGT 
(SEQ ID NO: 998) 


ABCA8 

(SEQ ID NO: 49) 


TTTTGGAATTTGGGTGT 
(SEQ ID NO: 999) 


ABCA8 

(SEQ ID NO: 49) 


TTTCGGTTTTT AACGGT : 
(SEQ ID NO: 1000) 


ABCA8 


TTTTGGrilFTTAATGGTG -- - — 
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Lrene 


Oligo: 


(SEQ ID NO: 49) 


(SEQ ID NO: 1001) 


AxJLAo 

(SEQ ID NO: 49) 


AAAA I'll ACGAGGGGA 
(SEQ ID NO: 1002) 


A "D A O 

(SEQ ID NO: 49) 


Tl AAAA 111 ATUAGGGGA 
(SEQ ID NO: 1003) 


(Cnr. Izql4.3) 
(SEQ ID NO: 50) 


ATG ACGATG A'l 1 GGCGA 
(SEQ ID NO: 622) 


(cnr. lZql4.3) 
(SEQ ID NO: 50) 


GATGATGATTGGTGAGT 
(SEQ ID NO: 623) 


^LJtir. lZql4.3) 
(SEQ ED NO: 50) 


TTATGACGTTTAATCGT 
(SEQ ID NO: 624) 


(^nr. izql4.3) 
(SEQ ID NO: 50) 


AGTTATGATGTTTAATTGT 
(SEQ ID NO: 625) 


(Cnr. lZql4.3) 
(SEQ ID NO: 50) 


AATCGAACU 1 1GGCGT 
(SEQ ID NO: 626) 


(Cnr. 12ql4.3) 
(SEQ ID NO: 50) 


AAATTGAAI G TTGGTGT 
(SEQ ID NO: 627) 


(Cnr. 8ql2.1) 
(SEQ ID NO: 51) 


TATTC'GGGTri CGCGA 
(SEQ ID NO: 1070) 


(Cnr. eql2.1) 
(SEQ ID NO: 51) 


AT1TGGG1T1TGTGAG 
(SEQ ID NO: 1071) 


(Cnr. oqlz.l) 
(SEQ ID NO: 51) 


TATlXil'lACGUGTCGA 
(SEQ ID NO: 1072) 


(Cnr. oqlz.l) 
(SEQ ID NO: 51) 


ATTGTTATGTGTTGATTT 
(SEQ ID NO: 1073) 


(Cnr. oqlz.l) 

(SEQ ID NO: 51) I 


GACGTGTAGGTCGTAT 
[SEQ ID NO: 1074) 


(^nr. oqlz.l j I 
(SEQ ID NO: 51) | 


LjATGTGTAGGTTGTATT 
;SEQ ID NO: 1075) 


(Chr. 8ql2.1) 

(SEQ ID NO: 51) ( 


ITCGGGAACGATTTTT 
SEQ ID NO: 1076) 


(Chr. 8ql2.1) ( 
(SEQ ID NO: 51) ( 


3GGTTTGGGAATGATT 
SEQ ID NO: 1077) 
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Gene < 


Oligo: 


MARK2 

(SEQ ED NO: 52) I 


ATATTTCGGGGGAAGT 
[SEQ ID NO: 628) 


MARK2 

(SEQ ID NO: 52) 1 


rATATTTTGGGGGAAGT 
^SEQ ID NO: 629) 


MARK2 

(SEQ ID NO: 52) 


ITTCGTATTTGTCGGA 
(SEQ ID NO: 630) 


MARK2 

(SEQ ID NO: 52) 


ITTGTATTTGTTGGAGT 
(SEQ ID NO: 631) 


MARK2 

(SEQ ID NO: 52) 


GGTTATATCGTAGGGTA 
(SEQ ID NO: 632) 


MARK2 

(SEQ ID NO: 52) 


GGGTTATATTGTAGGGT 
(SEQ ID NO: 633) 


MARK2 

(SEQ ID NO: 52) 


AGGGGGACGAATTAGG 
(SEQ ID NO: 634) 


MARK2 

(SEQ ID NO: 52) 


GAGGGGGATGAATTAG 
(SEQ ID NO: 635) 


ELK1 

(SEQ ID NO: 53) 


GGTCGGCGTTGATTTTA 
(SEQ ID NO: 920) 


ELK1 

(SEQ ID NO: 53) 


GGTTGGTGTTGATTTTA 
(SEQ ID NO: 921) 


ELK1 

(SEQ ID NO: 53) 


GTCGGGATTCGAACGG 
(SEQ ID NO: 922) 


ELK1 

(SEQ ID NO: 53) 


GTTGGGATTTGAATGG 
(SEQ ID NO: 923) 


ELK1 

(SEQ ID NO: 53) 


GTCGGAAGTTTCGGGA 
(SEQ ID NO: 924) 


ELK1 

(SEQ ID NO: 53) 


GTTGGAAGTTTTGGGAT 
(SEQ ID NO: 925) 


ELK1 

(SEQ ID NO: 53) 


ATATCGTAGGGTAGGCGG 
(SEQ ID NO: 926) 


ELK1 

(SEQ ID NO: 53) 


ATATTGTAGGGTAGGTGG 
(SEQ ID NO: 927) ' 


Q8WUT3 


TAGAACGGCGTGGGAT 
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(SEQ ID NO: 54) 


(SEQ ID NO: 636) 


08WUT3 

(SEQ ID NO: 54) 


TAG A ATGGTGTGOG A T 

*■ / iVJ/Vf\ X VJVJ 1 VJ 1 VJVJ VJ/"x. J. 

(SEQ ID NO: 637) 


08WUT3 

(SEQ ID NO: 54) 


GTCGCG ATGT A frTT A CdT 

(SEQ ID NO: 638) 


08WUT3 
(SEQ ID NO: 54) 


GTTGTG A TOT A CYTT A TriT 

VJ x x vj J. KJrx 1 VJ x <rA. VJ X X *\ X VJ X 

(SEQ ID NO: 639) 


08WUT3 
(SEQ ID NO: 54) 


TTAGTTTCGGG A TPGG 
(SEQ ID NO: 640) 


08WUT3 
(SEQ ID NO: 54) 


TTTAGTTTTGOfrATTOO 

V_J X X X X V_l V 1 V 1 X X VJVJ 

(SEQ ID NO: 641) 


Q8WUT3 

(SEQ ID NO: 54) 


TTCGTTTTTCnfrrJATA 
(SEQ ID NO: 642) 


08WUT3 

(SEQ ID NO: 54) 


TTTGTTTTTTnO n ATA A A 

XXX VJ X X X X X X VJVJVJ^VX Zi-iVc\ 

(SEQ ID NO: 643) 


CGB 

Jk^r 

(SEQ ID NO: 55) 


TT ACGTPGTGGTTTTT A 

-*- ^xv^vj X v-»vJ X VJVJ X X X X X ax 

(SEQ ID NO: 954) 


CGB 

(SEQ ID NO: 55) 


iTATnTTOTHfrTTTiT A CI 
A x vj x x vj x vjvj x x x x x /\vj 

(SEQ ID NO: 955) 


CGB 

(SEQ ID NO: 55) 


vJVJV^VJ 1 VJ-ttltV xxx V^VJ 1 VJVJ 1 

(SEQ ID NO: 956) 


CGB 

(SEQ ID NO: 55) 


VJVJ X VJ X VJ/V/-V X X X X VJ X VJVJ 1 

(SEQ ID NO: 957) 


CGB 

JL~M 

(SEQ ID NO: 55) 


XXX V^VJ/AVJ XXI Jttl X X V^VJvJ x 1 

(SEQ ID NO: 958) 


CGB 

(SEQ ID NO: 55) i 


FTTTG A GTTT A TTTHnTT 

x X X X VJAYVJ XXI xV XXX VJVJ X X I 

:SEQ ID NO: 959) 


CGB 

(SEQ ID NO: 55) ( 


rTATrTrPGATGTnPGT i 

ini V^VIV/VJAl VI 1 VJVVJ1 1 

[SEQ ID NO: 960) 


CGB 

(SEQ ID NO: 55) 


\TTATTGTGATGTGTGT 
;SEQ ID NO: 961) 


BSG : 
(SEQ ID NO: 56) 


rACGGTTCGCGTTGTT 
SEQ ID NO: 644) 
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Gene 
BSG 

(SEQ ID NO: 56) 



Oligo: 

GGAGTATGGTTTGTGT 
(SEQ ID NO: 645) 



BSG 

(SEQ ID NO: 56) 



IGTAAGGTTCGGCGAGA 
(SEQ ID NO: 646) 



SG 

|(SEQ ID NO: 56) 



GTAAGGTTTGGTGAGA 
(SEQ ID NO: 647) 



SG 

i(SEQ ID NO: 56) 



ITACGTTTTCGGGAAG 
(SEQ ID NO: 648) 



SG 

|(SEQ ID NO: 56) 



TTATGTTTTTGGGAAGG 
(SEQ ID NO: 649) 



SG 



(SEQ ID NO: 56) 



(TACGTTTCGAGGATCGG 
(SEQ ID NO: 650) 



SG 

;SEQ ID NO: 56) 



ITATGTTTTGAGGATTGG 
SEQ ID NO: 651) 



iCKDK 
|(SEQ ID NO: 57) 



iGGGCGTTAGGCGGATT 
(SEQ ID NO: 652) 



iCKDK 



(SEQ ID NO: 57) 



TGGGTGTTAGGTGGAT 
(SEQ ID NO: 653) 



iCKDK 

;SEQ ID NO: 57) 



lAGAGCGGTTAGCGTAG 
(SEQ ID NO: 654) 



iCKDK 
|(SEQ ID NO: 57) 



fTGAGAGTGGTTAGTGT 
(SEQ ID NO: 655) 



iCKDK 
|(SEQ ID NO: 57) 



1ATAGAGGGCGTGAATT 
(SEQ ID NO: 656) 



iCKDK 
|(SEQ ID NO: 57) 



AGAGGGTGTGAATTTT 
(SEQ ID NO: 657) 



iCKDK 
|(SEQ ID NO: 57) 



TAGGATTTACGAGGAAA 
(SEQ ID NO: 658) 



CKDK 
|(SEQ ID NO: 57) 



AGGATTTATGAGGAAAAT 
(SEQ ID NO: 659) 



IOX8 
|(SEQ ID NO: 58) 



ITTTCGGTTCGAAGTA 
(SEQ ID NO: 660) 



TTTTGGTTTGAAGTAGG 
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Oligo: I 


(SEQ ID NO: 58) 


(SEQ ID NO: 661) 


(SEQ ID NO: 58) 


AUUTCGT1T1TATCGA 1 
(SEQ ID NO: 662) 


CAYS 
pUAo 

(SEQ ID NO: 58) 


AGG 1 1 u 1TTTTA 1 1 G AGT 
(SEQ ID NO: 663) 


pUAo 

(SEQ ED NO: 58) 


Cj 1 AGTTACGGGGCGTT i 
(SEQ ID NO: 664) 


(SEQ ID NO: 58) 


ijl Aul 1 AIGGGGTGTT 1 
(SEQ ID NO: 665) 


p\JAo 

(SEQ ID NO: 58) 


lul Lu 1 A 1 AGGCGGTT 
(SEQ ID NO: 666) 


(SEQ ID NO: 58) 


1 1 <j 1 1 G 1 A rAGGTGGTT 
(SEQ ID NO: 667) 


Am 

(SEQ ID NO: 59) 


111 (_G 1 GGCGGAGAAT 
(SEQ ID NO: 820) 


(SEQ ID NO: 59) 


TTTTGTGGTGGAGAAT I 
(SEQ ID NO: 821) 


(SEQ ID NO: 59) 


1AL.GGA1 Al 1 1CGG1 1 
(SEQ ID NO: 822) 


(SEQ ID NO: 59) 


AATTATGGATAl l l l GGTT 
(SEQ ID NO: 823) 


(SEQ ID NO: 59) 


TTACGATTCGTAGGTT j 
(SEQ ID NO: 824) 


(SEQ ID NO: 59) 


1^ A r ¥* w I 1 A '1*1* A rrv" , i A mmrn/^rp a M/>«nn S 

F ATT ATT ATG A 1 1 1 G 1 AGGT 
(SEQ ID NO: 825) 


(SEQ ID NO: 60) 


AGTrTTGGGCGCGATTT j 
:SEQ ID NO: 668) 


(SEQ ID NO: 60) 


^GTTTTGGGTGTGATTT 
;SEQ ID NO: 669) 


SEMA4B 

(SEQ ID NO: 60) 


\GCGAATAGATTGCGGAT 
;SEQ ID NO: 670) 


SEMA4B 

(SEQ ID NO: 60) 


\GTGAATAGATTGTGGAT 
SEQ ID NO: 671) 



-72- 



Gene 


Oligo: 


SEMA4B 

(SEQ ID NO: 60) 


AGCGATTAGATTGCGGAT 
CSEQ ID NO: 672) 


SEMA4B 

(SEQ ID NO: 60) 


AGTGATTAGATTGTGGAT 
(SEQ ID NO: 673) 


SEMA4B 

(SEQ ID NO: 60) 


rAGGCGTTCGATTTTT 
(SEQ ID NO: 674) 


SEMA4B 

(SEQ ID NO: 60) 


GGGTAGGTGTTTGATT 
(SEQ ID NO: 675) 


APC 

(SEQ ID NO: 2) 


GGTTTCGTTTAATCGT 
(SEQ ID NO: 928) 


APC 

(SEQ ID NO: 2) 


GGGTTTTGTTTAATTGTA 
(SEQ ID NO: 929) 


APC 

(SEQ ID NO: 2) 


TTCGTATTTAGCGGAT 
(SEQ ID NO: 930) 


APC 

(SEQ ID NO: 2) 


GGTTTGTATTTAGTGGA 
(SEQ ID NO: 931) 


APC 

(SEQ ID NO: 2) 


ATCGGCGGGTTTTCGA 
(SEQ ID NO: 932) 


APC 

(SEQ ID NO: 2) 


AATTGGTGGGTTTTTGA 
(SEQ ID NO: 933) 


APC 

(SEQ ID NO: 2) 


ATTTTCGAGTTCGGTA 
(SEQ ID NO: 934) 


APC 

(SEQ ID NO: 2) 


ITTTTGAGTTTGGTAGT 
(SEQ ID NO: 935) 


CDKN2A 
(SEQ ID NO: 3) 


GGCGTTGTTTAACGTAT 
(SEQ ID NO: 676) 


CDKN2A 
(SEQ ID NO: 3) 


GGGTGTTGTTTAATGTA 
(SEQ ID NO: 677) 


CDKN2A 
(SEQ ID NO: 3) 


AACGTATCGAATAGTTACGG 
(SEQ ID NO: 678) 


CDKN2A 
(SEQ ID NO: 3) 


AATGTATTGAATAGTTATGG 

i 

(SEQ ID NO: 679) 


CDKN2A 


TACGGTCGGAGGTCGA 
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Uligo: 


(SEQ ID NO: 3) 


(SEQ ID NO: 680) 


CD KIN 2 A 
(SEQ ID NO: 3) 


1 A 1 CjCj 1 1 CjCj ACjCj I 1 Cr A 

(SEQ ID NO: 681) 


CSPG2 

(SEQ ID NO: 4) 


TTCGGTTAGTTTCGTAT 
(SEQ ID NO: 904) 


CSPG2 

(SEQ ID NO: 4) 


TTTTGGTT AG 1111 GT ATT 
(SEQ ID NO: 905) 


(SEQ ID NO: 4) 


1 1 CGGGlJ/Al 1 ACGTTT 
(SEQ ID NO: 906) 


(SEQ ID NO: 4) 


111 IvjCjvjI 1A1 IAICjI 111 

(SEQ ID NO: 907) 


Uor'vjrZ 

(SEQ ID NO: 4) 


1 1 1 ACjrlCvjrCOl ACjCCjI 

(SEQ ID NO: 908) 


(SEQ ID NO: 4) 


A rTTAGTTGTGTAGTGTT 
(SEQ ID NO: 909) 


CSPG2 

(SEQ ID NO: 4) 


AATTCGCGAGTTTAGA 
(SEQ ID NO: 910) 


CSPG2 

(SEQ ID NO: 4) 


/—^ A A A A A A A TTT/ — ' r T T /^' A /*if|vii 

GAAAAAAATTTGTGAGTT 
(SEQ ID NO: 911) 


ERBB2 

(SEQ ID NO: 5) 


TGTGAGAACGGTTGTA 
(SEQ ID NO: 912) 


ERBB2 

(SEQ ID NO: 5) 


TGAGAATGGTTGTAGG 
(SEQ ID NO: 913) 


ERBB2 

(SEQ ED NO: 5) 


TTAGGCG 1 1 1 CGGCGT 
(SEQ ID NO: 914) 


(SEQ ID NO: 5) 


1TTAGGTG TTTTGGl G r 
(SEQ ID NO: 915) 


CDDDO 

(SEQ ID NO: 5) 


TAGGTTTGCGCGAAGA 
(SEQ ID NO: 916) 


FRRF.9 

(SEQ ID NO: 5) 


(SEQ ID NO: 917) 


ERBB2 

(SEQ ID NO: 5) 


TAATTATCGGAGAAGGA 
(SEQ ID NO: 918) 
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Oligo: 


ERBB2 

(SEQ ID NO: 5) 


TAATTATTGGAGAAGGAG 
(SEQ ID NO: 919) 


STMN1 

(SEQ ID NO: 6) 


TTAGGCGGTTCGGATT 
(SEQ ID NO: 1012) 


STMN1 

(SEQ ID NO: 6) 


TTAGGTGGTTTGGATT 
(SEQ ID NO: 1013) 


STMN1 

(SEQ ID NO: 6) 


rATCGGTTCGGGAATT 
(SEQ ID NO: 1014) 


STMN1 

(SEQ ID NO: 6) 


TATTGGTTTGGGAATTT 
(SEQ ID NO: 1015) 


STMN1 

(SEQ ID NO: 6) 


TTTCGCGCGGAGGTTA 
(SEQ ID NO: 1016) 


STMN1 

(SEQ ID NO: 6) 


TTTTGTGTGGAGGTTA 
(SEQ ID NO: 1017) 


STMN1 

(SEQ ID NO: 6) 


GGTAAGAACGTATATAGT 
(SEQ ID NO: 1018) 


STMN1 

(SEQ ID NO: 6) 


TGGTAAGAATGTATATAGT 
(SEQ ID NO: 1019) 


STMN1 

(SEQ ID NO: 6) 


TTTCGGTTAATGCGGA 
(SEQ ID NO: 1020) 


STMN1 

(SEQ ID NO: 6) 


TTTTTGGTTAATGTGGA 
(SEQ ID NO: 1021) 


STMN1 

(SEQ ID NO: 6) 


TACGTTCGCGATTTGT 
(SEQ ID NO: 1022) 


STMN1 

(SEQ ID NO: 6) 


AGGGTTATGTTTGTGA 
(SEQ ID NO: 1023) 


STMN1 

(SEQ ID NO: 6) 


GATACGTCGGTGTCGG 
(SEQ ID NO: 1024) 


STMN1 

(SEQ ID NO: 6) 


rGATATGTTGGTGTTGG 
[SEQ ID NO: 1025) 


STMN1 

(SEQ ID NO: 6) 


ITACGGCGAGATTATT 
[SEQ ID NO: 1026) 


STMN1 


ITTTATGGTGAGATTATTT 
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Oligo: 


(SEQ ID NO: 6) 


(SEQ ID NO: 1027) 


OTI^I 1 

olivll 

(SEQ ID NO: 7) 


Ai I AAICvjILajI ILAjo 

(SEQ ID NO: 880) 


b IKll 

(SEQ ID NO: 7) 


uA 1 1 AA IIvjIIvjIII vjvjCj 

(SEQ ID NO: 881) 


CTT7" "1 1 

(SEQ ID NO: 7) 


IAAICCjI 1 ACjCCjCjCCjCj 
(SEQ ID NO: 882) 


b 1 Jvll 

(SEQ ID NO: 7) 


1 1 AA 1 1 vj 1 1 A(jr 1 vjIj 1 vjvj 

(SEQ ID NO: 883) 


o 1 Jvl 1 

(SEQ ID NO: 7) 


vj 1 v^VJ 1111 VxVJv^vj Avjvj A 

(SEQ ID NO: 884) 


O 1 JVl 1 

(SEQ ID NO: 7) 


vj 1 1 VJ 1 111 lul vjAvjvjAvj 

(SEQ ID NO: 885) 


i> 1 Jvll 

(SEQ ID NO: 7) 


T A A T/~* A i~~f~~ /^V^TT/^T A 

1 AA1 vjAvjCCjCvj 1 1 VjI A 

(SEQ ID NO: 886) 


o 1 ivl I 

(SEQ ID NO: 7) 


A TY"1 A PTPTPTTP'T A TTT 

Al VjAvjIvjI vjI IvjIAI 1 1 

(SEQ ID NO: 887) 


UA9 

(SEQ ID NO: 8) 


ATfT'TTTPP ATA A 'l»| v | ,l | v r 

AICjCjI 1 ICAjAI AA1 1111 

(SEQ ID NO: 682) 


CA9 

(SEQ ID NO: 8) 


ATGGl 1 1 10A1AA1 1 1 1 i 1 
(SEQ ID NO: 683) 


CA9 

(SEQ ID NO: 8) 


TGTACGTATAGTTCGTA 
(SEQ ID NO: 684) 


(SEQ ID NO: 8) 


TT A A T/^T A T/^T ATA /"""TTT/^T 

1 1 AAICjI Al Cj1 Al AvjI 1 lvjl 

(SEQ ID NO: 685) 


o a n 

(SEQ ID NO: 8) 


ATA T A TPPTPTPTTrPP 

A 1 A 1 A 1 Cvj 1 vj 1 Cj 1 1 vjvjvj 

(SEQ ID NO: 686) 


(SEQ ID NO: 8) 


A T A T A TTP T/^" 1 T/T 1 " 1 'Hnn 

AIAIAIIvjIvjIvjI 1 vjvjvj 

(SEQ ID NO: 687) 


CA9 

(SEQ ID NO: 8) 


ATAGTTAGTGGTATGGT 
(SEQ ID NO: 688) 


CA9 ; 1 
(SEQ ID NO: 8) 


ATAGTTAGTTGTATGGTT 
(SEQ ID NO: 689) ■ " " r 
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Gene 


\Oligo: 


PAX6 

(SEQ ID NO: 9) 


VTA 11 0 TTTCGGTTGTTAG 
(SEQ ID NO: 690) 


PAX6 

(SEQ ID NO: 9) 


iTATTGTTTTGGTTGTTAG 
(SEQ ID NO: 691) 


PAX6 

(SEQ ID NO: 9) 


GGCGACGCGGTTAGTT 
(SEQ ID NO: 692) 


PAX6 

(SEQ ID NO: 9) 


GGTGATGTGGTTAGTT 
(SEQ ID NO: 693) 


PAX6 

(SEQ ID NO: 9) 


[TAGGTCGCGTAGATTT 
(SEQ ID NO: 694) 


PAX6 

(SEQ ID NO: 9) 


AGTTTAGGTTGTGTAGA 
(SEQ ID NO: 695) 


PAX6 

(SEQ ID NO: 9) 


TAGCGTATTTTTCGGT 
(SEQ ID NO: 696) 


PAX6 

(SEQ ID NO: 9) 


TAGTGTATTTTTTGGTTG 
(SEQ ID NO: 697) 


SFN 

(SEQ ID NO: 10) 


AGTAGGTCGAACGTTA 
(SEQ ID NO: 698) 


SFN 

(SEQ ID NO: 10) 


AGAGTAGGTTGAATGTT 
(SEQ ID NO: 699) 


SFN 

(SEQ ID NO: 10) 


ITG CG AAGAGCG AAAT 
(SEQ ID NO: 700) 


SFN 

(SEQ ID NO: 10) 


rGTGAAGAGTGAAATTT 
^SEQ ID NO: 701) 


SFN 

(SEQ ID NO: 10) 1 


ITCGAGGTGCGTGAGT 
:SEQ ID NO: 702) 


SFN 

(SEQ ID NO: 10) ( 


ITTGAGGTGTGTGAGTA 
:SEQ ID NO: 703) 


SFN [ 
(SEQ ID NO: 10) i 


rGTGCGATATCGTGTT 
SEQ ID NO: 704) 


SFN r 
(SEQ ID NO: 10) [ 


rGTGATATTGTGTTGGG 
;SEQ»ID NO: 705) 


S100A2 : f 


rTTAATTGCGGTTGTGTG 
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(SEQIDNO: 11) (SEQ ID NO: 786) 



S100A2 



S100A2 

(SEQ ID NO: 1 1) 
S100A2 

(SEQ ID NO: 1 1) 
S100A2 



(SEQ ID NO: 1 1 ) (SEQ ID NO: 787) 



S100A2 



(SEQ ID NO: 1 1) (SEQ ID NO: 788) 



(SEQ ID NO: 1 1) (SEQ ID NO: 791) 



S100A2 

(SEQIDNO: 11) 
IS100A2 



(SEQ ID NO: 1 1) (SEQ ID NO: 793) 



TFF1 



(SEQ ID NO: 12) (SEQ ID NO: 794) 



rFFl 



SEQ ID NO: 12) (SEQ ID NO: 795) 



Oligo: 



TTTAATTGTGGTTGTGTG 



TATATAGGCGTATGTATG 



TATATAGGTGTATGTATG 
(SEQ ID NO: 789) 

TGTATACGAGTATTGGA 
(SEQ ID NO: 790) 

fTATGTATATGAGTATTGGA 



AGTTTTAGCGTGTGTTTA 
(SEQ ID NO: 792) 

AGTTTTAGTGTGTGTTTA 



AGAATTTATCGTATAAAAAG 



AATTTATTGTATAAAAAGGT 



TFF1 

SEQ ID NO: 12) 
fTFFl 



(SEQ ID NO: 12) (SEQ ID NO: 797) 



GGACGTCGATGGTATT 
(SEQ ID NO: 796) 

AGGGATGTTGATGGTA 



AACGGTGTCGTCGAAA 
(SEQ ID NO: 12) (SEQ ID NO: 798) 



TFF1 



AATGGTGTTGTTGAAAT 



(SEQ ID NO: 12) (SEQ ID NO: 799) 



TGFBR2 



AAAACGTGGACGTTTT 



(SEQ ID NO: 13) (SEQ ID NO: 896) 



TGFBR2 



rAAAATGTGGATGTTTT 



(SEQ ID NO: 13) (SEQ ID NO: 897) 



TGFBR2 

(SEQ ID NO: 13) 



fTGAAAGTCGGTTAAAGT 
SEQ ID NO: 898) 
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Oligo: 


TGFBR2 

(SEQ ID NO: 13) l 


rGAAAGTTGGTTAAAGT 
^SEQ ID NO: 899) 


TGFBR2 

(SEQ ID NO: 13) l 


ITGGACGTCGAGGAGA 
[SEQ ID NO: 900) 


TGFBR2 

(SEQ ID NO: 13) 


ITGGATGTTGAGGAGA 
(SEQ ID NO: 901) 


TGFBR2 

(SEQ ID NO: 13) 


rxTTCGGGCGGAGAGA 
(SEQ ID NO: 902) 


TGFBR2 

(SEQ ID NO: 13) 


AA.GGTTTTTGGGTGGA 
(SEQ ID NO: 903) 


TP53 

(SEQ ID NO: 14) 


TATTAGGTCGGCGAGA 
(SEQ ID NO: 858) 


TP53 

(SEQ ID NO: 14) 


AGGTTGGTGAGAATTT 
(SEQ ID NO: 859) 


TP53 

(SEQ ID NO: 14) 


TTCGGTAGGCGGATTA 
(SEQ ID NO: 860) 


TP53 

(SEQ ID NO: 14) 


TTTTTGGTAGGTGGAT 
(SEQ ID NO: 861) 


TP53 

(SEQ ID NO: 14) 


ATATTTTGCGTTCGGG 
(SEQ ID NO: 862) 


TP53 

(SEQ ID NO: 14) 


ATATTTTGTGTTTGGGT 
(SEQ ID NO: 863) 


TP53 

(SEQ ID NO: 14) 


TACGACGGTGATACGT 
(SEQ ID NO: 864) 


TP53 

(SEQ ID NO: 14) 


TTTATGATGGTGATATGT 
(SEQ ID NO: 865) 


TP73 

(SEQ ID NO: 15) 


TTCGTTCGCGAAGTTA 
(SEQ ID NO: 706) 


TP73 

(SEQ ID NO: 15) 


GGTTTGTTTGTGAAGTTA 

! 

(SEQ ID NO: 707) 


PLAU 

(SEQ ID NO: 16) 


AAGAGGTCGTCGGGAT 
(SEQ ID NO: 708) 


PLAIT 


AAGAGGTTGTTGGGAT 
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LsllgO. 


(SEQ ID NO: 16) 


(SEQ ID NO: 709) 


PT ATT 

(SEQ ID NO: 16) 


TT A TC* C±r*Cin CZT A TTTT 
1 1 /\ 1 v^vjtv^vJvJvJ 1 /\ 1 111 

(SEQ ID NO: 710) 


PT ATT 

(SEQ ID NO: 16) 


i l vjvjr i 1 /\ 1 l Ur 1 uuu 1 /\ 1 

(SEQ ID NO: 711) 


PI .AT J 

(SEQ ID NO: 16) 


l l v-^ur/\ 111 v^vjr l l /\ l l /V 1 vj 

(SEQ ID NO: 712) 


PT ATI 

(SEQ ID NO: 16) 


Tn A TTTTnTT A TT A TTl A i~l 
111 vJ/\ 1 1 1 lul 1/V1 1 /\ x VJ AO 

(SEQ ID NO: 713) 


PLAT J 

(SEQ ID NO: 16) 


nrpfrTf^ a nrr^ a tttt a 

VJ 1 v-^Vj 1 VJ/\VJv^vJ/\ 111 1A 

(SEQ ID NO: 714) 


PLAU 

(SEQ ID NO: 16) 


TTGOTTfTTf^ A ClTa A TT 
x x vjvjr i x VJ x \JrWJ l Vjr/\ l 1 

(SEQ ID NO: 715) 


TMFFF? 

X 1Y1 1 iX X w 

(SEQ ID NO: 17) 


1 r\ 1 V^VJ 1 /\VJ 1 1 v_xVjr 1 1 L-Vjvj 

(SEQ ID NO: 874) 


1 1YXX-/X X 

(SEQ ID NO: 17) 


A TTfJT A fiTTTnTTTnriT 
AI lul f\vJ 111 vJ 111 VjVj 1 

(SEQ ID NO: 875) 


TA/TFKF9 
x iv i sr ,d 

(SEQ ID NO: 17) 


AAA mTTT A TrnHTTP 
/VrVAA^ij 1 1 1A1 V^OVJ 1 1 O 

(SEQ ID NO: 876) 


(SEQ ID NO: 17) 


A ATnTTT AT^fZf^TTf^t^ A 

/\J\ lul 1 iJ\l 1 OO 1 1 OOA 

(SEQ ID NO: 877) 


j. xvxx-»jrrz^ 

(SEQ ID NO: 17) 


TTPnT A f~l A A f~Z A ATA f^/" 1 r^/^nn A 

1 1 V-Ajr i /\0/Y/\0/\/Y 1 /YUvjUvj 1 A 

(SEQ ID NO: 878) 


TMFFF9 

X XVXX-/X X jL* 

(SEQ ID NO: 17) 


TTTrJT A A A ft A ATA TV^'TV^T A 
1 1 1 1 /\O/v/\0/\/Y 1/vlOlOlA 

(SEQ ID NO: 879) 


(SEQ ID NO: 18) 


TOfr*r^TTriT ATA r*czrr A 

1 VJV^vJVJ 1 1U1A1 /VL-VJ 1 AO 

(SEQ ID NO: 962) 


ESR1 

(SEQ ID NO: 18) 


TGTGTnnTTCrTATATrTT 

X VJ X VJ X \JVJ X X VJ VJ 1 

(SEQ ID NO: 963) 


ESR1 

(SEQ ID NO: 18) 


ITCGTGTTAGATTTCGATAT 
(SEQ ID NO: 964) 


ESR1 

(SEQ ID NO: 18) 


ITTGTGTTAGATTTTGATAT 
(SEQ ID NO: 965) 
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Gene 


Oligo: 


ESR1 

(SEQ ID NO: 18) 


AACGCGAAAGACGGAT 
(SEQ ID NO: 966) 


ESR1 

(SEQ ID NO: 18) 


ATAAATGTGAAAGATGGA 
(SEQ ID NO: 967) 


ESR1 

(SEQ ID NO: 18) 


GGGCGTACGAGGATTT 
(SEQ ID NO: 968) 


ESR1 

(SEQ ED NO: 18) 


GGGTGTATGAGGATTT 
(SEQ ID NO: 969) 


HSPB1 

(SEQ ID NO: 20) 


AGGGTATTCGTCGGTT 
(SEQ ID NO: 888) 


HSPB1 

(SEQ ED NO: 20) 


AGGGTATTTGTTGGTT 
(SEQ ED NO: 889) 


HSPB1 

(SEQ ID NO: 20) 


GAATTCGAGAGCGCGA 
(SEQ ED NO: 892) 


HSPB1 

(SEQ ED NO: 20) 


TGAATTTGAGAGTGTGA 
(SEQ ID NO: 893) 


RASSF1 

(SEQ ED NO: 21) 


AGTAAATCGGATTAGGA 
(SEQ ID NO: 852) 


RASSF1 

(SEQ ED NO: 21) 


AGTAAATTGGATTAGGAG 
(SEQ ID NO: 853) 


RASSF1 

(SEQ ED NO: 21) 


TACGGGTATTTTCGCGT 
(SEQ ID NO: 854) 


RASSF1 

(SEQ ED NO: 21) 


ATATGGGTATTTTTGTGT 
(SEQ ID NO: 855) 


RASSF1 

(SEQ ED NO: 21) 


TGCGAGAGCGCGTTTA 
(SEQ ID NO: 856) 


RASSF1 

(SEQ ID NO: 21) 


TTGTGAGAGTGTGTTTA 
(SEQ ID NO: 857) 


GRIN2D 

(SEQ ED NO: 24) 


ATTTCGATTTGGAGGCGG 
(SEQ ID NO: 716) 


GRIN2D . 

* * 

(SEQ ED NO: 24) 


ATTTTGATTTGGAGGTGG 
(SEQ ID NO: 717) 


PSAT1 

1 

j 


TTCGTCGGTUTIACGT 

9 
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Gene 


Oligo: 


(SEQ ID NO: 25) 


(SEQ ID NO: 718) 


PSAT1 

(SEQ ID NO: 25) 


TTTTGTTGGTGTT ATGT 
(SEQ ID NO: 719) 


PSAT1 

(SEQ ID NO: 25) 


GGCGAGTTCGGGTAGT 
(SEQ ID NO: 720) 


PSAT1 

(SEQ ID NO: 25) 


GGTGAGTTTGGGTAGT 
(SEQ ID NO: 721) 


PSAT1 

(SEQ ID NO: 25) 


ATAGTAAACGCGAGGA 
(SEQ ID NO: 818) 


PSAT1 

(SEQ ID NO: 25) 


AGTAAATGTGAGGAGG 
(SEQ ID NO: 819) 


PSAT1 

(SEQ ID NO: 25) 


AAGTTTTCGCGAGCGG 
(SEQ ID NO: 722) 


PSAT1 

(SEQ ID NO: 25) 


AAGTTTTTGTGAGTGG 
(SEQ ID NO: 723) 


PSAT1 

(SEQ ID NO: 25) 


AGGAAGTTCGGCGAGG 
(SEQ ID NO: 724) 


PSAT1 

(SEQ ID NO: 25) 


AGGAAGTTTGGTGAGG 
(SEQ ID NO: 725) 


CYP2D6 

(SEQ ID NO: 27) 


TACGACGATTTTCGTT 
(SEQ ID NO: 726) 


CYP2D6 

(SEQ ID NO: 27) 


GAGTATGATGATTTTTGT 
(SEQ ID NO: 727) 


CYP2D6 

(SEQ ID NO: 27) 


TTCGTCGATTAAGTCGG 
(SEQ ID NO: 728) 


CYP2D6 

(SEQ ID NO: 27) 


TTTGTTGATTAAGTTGGT 
(SEQ ID NO: 729) 


CYP2D6 

(SEQ ID NO: 27) 


GTGGCGCGAGTAGAGG 
(SEQ ID NO: 730) 


CYP2D6 

(SEQ ID NO: 27) 


GTGGTGTGAGTAGAGG 
(SEQ ID NO: 731) 


CYP2D6 

(SEQ ID NO: 27) 


AACGTTTACGTGTTCGT 
(SEQ ID NO: 732) 1 
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\Gene 


Oligo: 


CYP2D6 

(SEQ ID NO: 27) 


GTAATGTTTATGTGTTTGT 
(SEQ ID NO: 733) 


COX7A2L 
(SEQ ID NO: 28) 


AATTCGATCGCGGGTA 
(SEQ ID NO: 1086) 


COX7A2L 
(SEQ ID NO: 28) 


ATTTGATTGTGGGTAGA 
(SEQ ID NO: 1087) 


PLAU 

(SEQ ID NO: 30) 


TATTTGTCGCGTTGAT 
(SEQ ID NO: 1044) 


PLAU 

(SEQ ID NO: 30) 


ATTTGTTGTGTTGATGA 
(SEQ ID NO: 1045) 


PLAU 

(SEQ ID NO: 30) 


TGTAATTCGGGGATTT 
(SEQ ID NO: 1046) 


PLAU 

(SEQ ID NO: 30) 


TTGTAATTTGGGGATTT 
(SEQ ID NO: 1047) 


PLAU 

(SEQ ID NO: 30) 


AGGAAGTACGGAGAAT 
(SEQ ID NO: 1048) 


PLAU 

(SEQ ID NO: 30) 


AGGAAGTATGGAGAATT 
(SEQ ID NO: 1049) 


PLAU 

(SEQ ID NO: 30) 


TTCGTTGGAGATCGCGT 
(SEQ ID NO: 1050) 


PLAU 

(SEQ ID NO: 30) 


TTTGTTGGAGATTGTGT 
(SEQ ID NO: 1051) 


PLAU 

(SEQ ID NO: 30) 


TTGCGGAAGTACGCGG 
(SEQ ID NO: 1052) 


PLAU 

(SEQ ID NO: 30) 


TTGTGGAAGTATGTGG 
(SEQ ID NO: 1053) 


VTN 

(SEQ ID NO: 31) 


TTCGGGTTCGCGAAAG 
(SEQ ED NO: 1028) 


VTN 

(SEQ ID NO: 31) 


TTTGGGTTTGTGAAAG 
(SEQ ID NO: 1029) 


VTN 

(SEQ ID NO: 31) 


TTTTGTTCGCGTTGAA 
(SEQ ID NO: 1030) 


VTN- 


ITG'ITTGTG'ITGAAGTA "" ~ " 
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Gene 


Oligo: 


(SEQIDNO:31) 


(SEQIDNO: 1031) 


VTN 

(SEQIDNO: 31) 


TGGGTCGCGAGGTAGT 
(SEQ ID NO: 1032) 


VTN 

(SEQIDNO: 31) 


TGGGTTGTGAGGTAGT 
(SEQ ID NO: 1033) 


VTN 

(SEQIDNO: 31) 


TTCGATGGCGGTTTCGA 
(SEQ ID NO: 1036) 


VTN 

(SEQIDNO: 31) 


TTTGATGGTGGTTTTGA 
(SEQ ID NO: 1037) 


SULT1A1 
(SEQ ID NO: 32) 


TTCGAGTCGTTTTGAT 
(SEQ ID NO: 734) 


SULT1A1 
(SEQ ID NO: 32) 


TTTGAGTTGTTTTGATG 
(SEQ ID NO: 735) 


SULT1A1 
(SEQ ID NO: 32) 


TTCGTCGTGTACGGTT 
(SEQ ID NO: 736) 


SULT1A1 
(SEQ ID NO: 32) 


Til GTI GTGTA'l GGTTT 
(SEQ ID NO: 737) 


SULT1A1 
(SEQ ID NO: 32) 


AGGATTTCGTTTTCGG 
(SEQ ID NO: 738) 


SULT1A1 
(SEQ ID NO: 32) 


AGGATTTTGTTTTTGGG 
(SEQ ID NO: 739) 


SULT1A1 
(SEQ ID NO: 32) 


TTTTCGGTTGAAGTCGG 
(SEQ ID NO: 740) 


SULT1A1 
(SEQ ID NO: 32) 


TTTTTGGTTGAAGTTGG 
(SEQ ID NO: 741) 


PCAF 

(SEQ ID NO: 33) 


AGCGTCGGTACGTATA 
(SEQ ID NO: 986) 


PCAF 

(SEQ ID NO: 33) 


GGTAGTGTTGGTATGT 
(SEQ ID NO: 987) 


PRKCD 

(SEQ ID NO: 34) 


A111LUUU Tl CUUA 'I 1 

(SEQ ID NO: 742) 


PRKCD 

(SEQ ID NO: 34) 


GATTTTGTGTTTGGATT 
(SEQ ID NO: 743) 
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Gene \Oligo: 


EGR4 

(SEQIDNO: 1) 


AAGCGTATTTATCGGA 
(SEQ ID NO: 744) 


EGR4 

(SEQ ID NO: 1) 


GGAAGTGTATTTATTGGA 
(SEQ ID NO: 745) 


EGR4 

(SEQ ID NO: 1) 


[TATCGGACGGTCGGTT 
(SEQ ID NO: 746) 


EGR4 

(SEQ ID NO: 1) 


ATTTATTGGATGGTTGG 
KSEQ ID NO: 747) 


EGR4 

(SEQ ID NO: 1) 


AGGCGTAGCGTTTTAG 
(SEQ ID NO: 748) 


EGR4 

(SEQ ID NO: 1) 


TGAGGTGTAGTGTTTT 
(SEQ ID NO: 749) 


EGR4 

(SEQ ID NO: 1) 


AACGTTATAGTTCGAGT 
(SEQ ID NO: 750) 


EGR4 

(SEQ ID NO: 1) 


AATGTTATAGTTTGAGTTT 
(SEQ ID NO: 751) 


TP73 

(SEQ ID NO: 15) 


GTGCGAGTTAGTCGGA 
(SEQ ID NO: 752) 


TP73 

(SEQ ID NO: 15) 


GTGTGAGTTAGTTGGA 
(SEQ ID NO: 753) 


TP73 

(SEQ ID NO: 15) 


rATCGGTTCGGAGTTA 
(SEQ ID NO: 754) 


TP73 

(SEQ ID NO: 15) l 


A.GGATATTGGTTTGGAG 
[SEQ ID NO: 755) 


TP73 [ 
(SEQ ID NO: 15) ( 


\GAGTCGTTCGGAATT 
[SEQ ID NO: 756) 


TP73 

(SEQ ID NO: 15) ( 


rGAGAGTTGTTTGGAAT 
;SEQ ID NO: 757) 


SYK ( 
(SEQ ID NO: 19) ( 


3AAGT1ATCGCGTTGG 
SEQ ID NO: 826) 


SYK / 
(SEQ ID NO: 19) ( 


VGAAGTTATTGTGTTGG 
SEQIDNO: 827) ' 


SYK ( 


jATCGATGCGGTTTAT 
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Gene 


OH so " 


(SEQ ED NO: 19) 


(SEQ ID NO: 828) 


SYK 

(SEQ ID NO: 19) 


GGGATTGATGTGGTTTA 

^— ' VJ vJ<tX A JL VJ X VJ J. \JVJ J. A J. xa. 

(SEQ ID NO: 829) 


SYK 

(SEQ ID NO: 19) 


GTTCGGCGGG AGG A G A 
(SEQ ID NO: 830) 


SYK 

■ JJL Jjk Jk 

(SEQ ID NO: 19) 


GTTTGGTGGG Arjfr AC? A 

v — * AAA VJVJ A. VJ VJ VJ/k\J\J < r\\J/\ 

(SEQ ID NO: 831) 


SYK 

(SEQ ID NO: 19) 


AGTCGATTTTCGTTTAG 

* * ^ — * A ^VJX 1 A Jl J. JL V — . VJ AAA iVVJ 

(SEQ ID NO: 832) 


SYK 

(SEQ ID NO: 19) 


TAGTTGATTTTTGTTTAGT 
(SEQ ID NO: 833) 


SYK 

(SEQ ID NO: 19) 


GGAAGAGTCGCGGGTT 
(SEQ ID NO: 834) 


SYK 

(SEQ ID NO: 19) 


GGAAGAGTTGTGGGTT 

^ VJiLi ivj^LVj A A VJ A VJVJVJ A A 

(SEQ ID NO: 835) 


HSPB1 

(SEQ ID NO: 20) 


AGTCGTGTTACGGTAG 
(SEQ ID NO: 890) 


HSPB1 

(SEQ ID NO: 20) 


AGTTGTGTTATGGT A GG 

* »VJ A A A VJ A A XV A VJ VJ A /IVJVJ 

(SEQ ID NO: 891) 


HSPB1 

(SEQ ID NO: 20) 


TTTTTTCrr TT A AOGA A Aft 

(SEQ ID NO: 894) 


HSPB1 

■*- JjLk_JA 1 # J, 

(SEQ ID NO: 20) 


TTTTTTTTGTTA Add A A AO 

AAAAAAAA VJ 1 A /AJTlAj vJ/Vt^JTAvJ 

(SEQ ID NO: 895) 


TES 

(SEQ ID NO: 22) 


TAGAAGTPGGTTPfrTO 

a nvj rViWj a V_^VJ VJ A A V^VJ A VJ 

(SEQ JD NO: 758) 


TES 

(SEQ ID NO: 22) 


AGAAGTTGfrTTTOTriO 1 

rLVirxrivVJ A A VJ VJ AAA VJ A VJ vJ 

(SEQ ID NO: 759) 


TES 

(SEQ ID NO: 22) 


GATf GGGCGGCGG A AG 
CSEQ ID NO: 760) 


TES 

(SEQ ID NO: 22) I 


(\TTGGGTGGTGGAAGT 
:SEQ ID NO: 761) 


TES 

(SEQ ID NO: 22) | 


rAGCGGAGTCGGAGGT 
:SEQ ID NO: 762) 
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Gene 


Oligo: 


TES 

(SEQ ID NO: 22) 


TAGTGGAGTTGGAGGT 
(SEQ ID NO: 763) 


TES 

(SEQ ID NO: 22) 


AATTCGGTCGTGGGAT 
(SEQ ID NO: 764) 


TES 

(SEQ ID NO: 22) 


AATTTGGTTGTGGGAT 
(SEQ ID NO: 765) 


PITX2 

(SEQ ID NO: 23) 


AGTCGGGAGAGCGAAA 
(SEQ ID NO: 970) 


PITX2 

(SEQ ID NO: 23) 


AGTTGGGAGAGTGAAA 
(SEQ ID NO: 971) 


PITX2 

(SEQ ID NO: 23) 


AAGAGTCGGGAGTCGGA 
(SEQ ID NO: 972) 


PITX2 

(SEQ ID NO: 23) 


AAGAGTTGGGAGTTGGA 
(SEQ ID NO: 973) 


PITX2 

(SEQ ID NO: 23) 


GGTCGAAGAGTCGGGA 
(SEQ ID NO: 974) 


PITX2 

(SEQ ID NO: 23) 


GGTTGAAGAGTTGGGA 
(SEQ ID NO: 975) 


PITX2 

(SEQ ID NO: 23) 


ATGTTAGCGGGTCGAA 
(SEQ ID NO: 976) 


PITX2 

(SEQ ID NO: 23) 


TAGTGGGTTGAAGAGT 
(SEQ ID NO: 977) 


GRIN2D 

(SEQ ID NO: 24) 


GAGAGTCGGGATGATT 
(SEQ ID NO: 766) 


GRIN2D 

(SEQ ID NO: 24) 


GGAGAGTTGGGATGAT 
(SEQ ID NO: 767) 


GRIN2D 

(SEQ ID NO: 24) 


TAGGGTCGAGATTTGG 
(SEQ ID NO: 768) 


GRBST2D 

(SEQ ID NO: 24) 


TTAGGGTTGAGATTTGG 
(SEQ ID NO: 769) 


GRIN2D 

(SEQ ID NO: 24) 


AGTGTGGCGAATATTG 
(SEQ ID NO: 770) 


GRIN2D 


GTGTGGTGAATATTGAA 
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Gene 


Ohgo: 


(SEQ ED NO: 24) 


(SEQ ED NO: 771) 


DO A T 1 

PI>A1 1 

(SEQ ID NO: 25) 


TTTCG A 1 1 CGGTTTAGA 
(SEQ ED NO: 808) 


DO AT1 

Pc^Al 1 

(SEQ ID NO: 25) 


AATTGTTTTGA 111 GGTT 
(SEQ EO NO: 809) 


no A T 1 *l 

PS ATI 

(SEQ ID NO: 25) 


TAATGGGGCGTCGATT 
(SEQ EO NO: 810) 


DO A T" 1 1 

riSAl 1 

(SEQ ID NO: 25) 


rTAATGGGGTGTTGATT 
(SEQ ED NO: 811) 


DO A T1 

(SEQ ID NO: 25) 


I AlCCrl ACjCOCjI 1 AGO 
(SEQ ID NO: 812) 


DO A TP 1 

rkAl 1 

(SEQ ID NO: 25) 


TATTGTAGTGG 1 TAGGAA 
(SEQ ED NO: 813) 


DO A T^l 

roAi 1 

(SEQ ID NO: 25) 


AGGAACGTTAGTCGTT 
(SEQ ED NO: 814) 


DO AT1 

PS ATI 

(SEQ ID NO: 25) 


TAGGAATGTTAGTTGTTT 
(SEQ EO NO: 815) 


DO A T 1 "! 

PSAT1 

(SEQ ID NO: 25) 


GGTCGTCGTATTATGGA 
(SEQ ED NO: 816) 


PSAT1 

(SEQ ID NO: 25) 


TGGTTGTTGTATTATGGA 
(SEQ ED NO: 817) 


CGA 

(SEQ ID NO: 26) 


ATAI 1 1A1 1 1 ICGGAAAI 1 1 

(SEQ ED NO: 836) 


(SEQ ID NO: 26) 


1 1 v 1 1 a rrnvprm • v-"* » AAA »'l»|vi< a TP a /"^T» 

TTATTTITGGAAATTTATAGT 
(SEQ ED NO: 837) 


(SEQ ID NO: 26) 


TG A' YY 1TGTCGTT ATT ATT 
(SEQ ED NO: 838) 


(SEQ ID NO: 26) 


r piy" 1 A r l ,, ^ ,^ I"*^p/^^p , ^p/^"T ,/ T , a r PT a tphp 
1 lOAI 1 1 l(jrl ICjrl 1ATTAI 1 

(SEQ ID NO: 839) 


CGA 

(SEQ ID NO: 26) 


TAAATTOACOTTATOfrTA 

(SEQ EO NO: 840) 


CGA 

(SEQ ID NO: 26) 


AAATTGATGTTATGGTAAA 
(SEQ EO NO: 841) 
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Gene 


Oligo: 


CGA 

(SEQ ED NO: 26) 


AATTGACGTTATGGTAAT 
(SEQ ID NO: 842) 


CGA 

(SEQ ID NO: 26) 


TAAAAATTGATGTTATGGT 
(SEQ ID NO: 843) 


COX7A2L 
(SEQ ID NO: 28) 


TTGTTCGAAGATCGTT 
(SEQ ID NO: 1078) 


COX7A2L 
(SEQ ID NO: 28) 


GTTGTTTGAAGATTGTTT 
(SEQ ID NO: 1079) 


COX7A2L 
(SEQ ID NO: 28) 


TAGCGTAAGGATTCGGT 
(SEQ ID NO: 1080) 


COX7A2L 
(SEQ ID NO: 28) 


TTAGTGTAAGGATTTGGT 
(SEQ ID NO: 1081) 


COX7A2L 
(SEQ ID NO: 28) 

• 


AGAGTTCGGTTTTTCGTA 
(SEQ ID NO: 1082) 


COX7A2L 
(SEQ ID NO: 28) 


AGAGTTTGGTTTTTTGTA 
(SEQ ID NO: 1083) 


COX7A2L 
(SEQ ID NO: 28) 


ATTCGTATTTGCGGGTTA 
(SEQ ID NO: 1084) 


COX7A2L 
(SEQ ID NO: 28) 


ATTTGTAT1TGTGGGTTA 
(SEQ ID NO: 1085) 


ESR2 

(SEQ ID NO: 29) 


ATTTCGAGGATTACGTT 
(SEQ ID NO: 936) 


ESR2 

(SEQ ID NO: 29) 


ATTTTGAGGATTATGTTTT 
(SEQ ED NO: 937) 


ESR2 

(SEQ ID NO: 29) 


AGATGGCGTTTTTCGTA 
(SEQ ID NO: 938) 


ESR2 

(SEQ ID NO: 29) 


TAGATGGTGTTTTTTGTA 
(SEQ ID NO: 939) 


ESR2 

(SEQ ID NO: 29) 


ATTTTCGA ATC GATTTTT 
(SEQ ID NO: 940) 


ESR2 

(SEQ ID NO: 29) ' 


GGAGTATTTTTGAATTGAT 
(SEQ ID NO: 941) 


ESR2 • 


AGTTCGACGGTTTTAG ~ — 



88 



-89- 



Gene 


Ohgo: 


(SEQ ID NO: 29) 


(SEQ ID NO: 942) 


ESR2 

(SEQ ID NO: 29) 


AGGGAGTTTGATGGTT 
(SEQ ID NO: 943) 


(SEQ ID NO: 29) 


ALrl 1 1 ACGTGATCGAG 
(SEQ ID NO: 944) 


h,SR2 

(SEQ ID NO: 29) 


AGTTTATGTGATTGAGTT 
(SEQ ID NO: 945) 


\ /TXT 

V1N 

(SEQ ID NO: 31) 


GGTGGTATCGATTGAT 
(SEQ ID NO: 1034) 


V llN 

(SEQ ID NO: 31) 


TGGTGGTATTGATTGAT 
(SEQ ID NO: 1035) 


V 1 IN 

(SEQ ID NO: 31) 


TAGTGATTCGCGGGGA 
(SEQ ID NO: 1038) 


V 1 JN 

(SEQ ID NO: 31) 


TAGTGATTTGTGGGGA 
(SEQ ID NO: 1039) 


VTN 

(SEQ ID NO: 31) 


TTATGTCGGAGGATGA 
(SEQ ID NO: 1040) 


VTN 

(SEQ ID NO: 31) 


ATI ATGTTGG AGGATGA 
(SEQ ID NO: 1041) 


VTN 

(SEQ ID NO: 3 1) 


ATACGGTTTATGACGAT 
(SEQ ID NO: 1042) 


VTN 

(SEQ ID NO: 31) 


ATATGGT1TATGATGATGG 
(SEQ ID NO: 1043) 


PCAF 

(SEQ ID NO: 33) 


GAGCGGTAGGTGTCGAA 
(SEQ ID NO: 978) 


PCAF 

(SEQ ID NO: 33) 


GAGTGGTAGGTGTTGAA 
[SEQ ID NO: 979) 


Dp A TT 

(SEQ ID NO: 33) 


fAAGATTTCGCGGGTA 
:SEQ ID NO: 980) 


PCAF 

(SEQ ID NO: 33) | 


rGTAAGATTTTOTHnnTA 
;SEQIDNO: 981) 


PCAF 

(SEQ ID NO: 33) ( 


\GTTCGTAGTTTCGAG 
[SEQ ID NO: 982) 
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Gene 


Oligo: 


PCAF 

(SEQ ID NO: 33) 


GTTTGTAGTTTTGAGGA 
(SEQ ID NO: 983) 


PCAF 

(SEQ ID NO: 33) 


TAGGGCGCGGAGTAGA 
(SEQ ID NO: 984) 


PCAF 

(SEQ ID NO: 33) 


TAGGGTGTGGAGTAGA 
(SEQ ID NO: 985) 


PRKCD 

(SEQ ID NO: 34) 


ATTTATTTTTCGTTGTAGG 
(SEQ ID NO: 772) 


PRKCD 

(SEQ ID NO: 34) 


lAriTATlTriTG'lTGTAGG 
(SEQ ID NO: 773) 


PRKCD 

(SEQ ID NO: 34) 


TTTCGGAAACGGGAAT 
(SEQ ID NO: 774) 


PRKCD 

(SEQ ID NO: 34) 


TAGTTTTGGAAATGGGA 
(SEQ ID NO: 775) 


PRKCD 

(SEQ ID NO: 34) 


GGACGGAGTTATCGGT 
(SEQ ID NO: 776) 


PRKCD 

(SEQ ID NO: 34) 


GGATGGAGTTATTGGTA 
(SEQ ID NO: 777) 


PRKCD 

(SEQ ED NO: 34) 


GTTTAGCGGAGGGATA 
(SEQ ID NO: 778) 


PRKCD 

(SEQ ID NO: 34) 


TGTTTAGTGGAGGGAT 
(SEQ ID NO: 779) 


ESR1 (exon8) 
(SEQ ID NO: 61) 


TTGTTACGGTTTGAGAG 
(SEQ ED NO: 780) 


ESR1 (exon8) 
(SEQ ID NO: 61) 


TTGTTATGGTTTGAGAGT 
(SEQ ID NO: 781) 


ESR1 (exon8) 
(SEQ ID NO: 61) 


TTTGTTATAGTTTGAGAGT 
(SEQ ID NO: 782) 


ESR1 (exon8) 
(SEQ ID NO: 61) 


TTTGTTACGGTTTGAG 
(SEQ ID NO: 783) 


ESR1 (exon8) 
(SEQ ID NO: 61) 


TTTGTTATGGTTTGAGA 
(SEQ ID NO: 784) ' 


ESR1 (exon8) 


ITTGTTA-TAGTTTGAGAG 
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Gene 


Oligo: 


(SEQ ID NO: 61) 


(SEQ ID NO: 785) 
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VW, data: 1 1 - 12 " 2003 

1 . A method for characterising a cell proliferative disorder of the breast tissues 
and/or a metastases thereof and/or predicting the disease free survival and/or response 
of a subject with said disorder to a treatment comprising one or more treatment which 
target the estrogen receptor pathway or are involved in estrogen metabolism, 
production or secretion, said method comprising 

a) obtaining a biological sample from the subject 

b) determining the methylation status of one or more CpG positions within at least one 
target nucleic acid comprising one or a combination of the genes taken from the group 
consisting of EGR4, APC, CDKN2A, CSPG2, ERBB2, STMN1, STK11, CA9, 
PAX6, SFN, S100A2, TFF1, TGFBR2, TP53, TP73, PLAU, TMEFF2, ESR1, SYK, 
HSPB1, RASSF1, TES, PITX2, GRIN2D, PSAT1, CGA, CYP2D6, COX7A2L, 
ESR2, PLAU, VTN, SULT1A1, PCAF, PRKCD, ONECUT2, BCL6, WBP11, (MX1), 
N.N., APP, ORC4L, NETOl, TBC1D3, GRB7, CYP2D6, CDK6, (Chr. Ipl3.2), (Chr. 
17q25.1), ABCA8, (Chr. 12ql4.3) 5 (Chr. 8ql2.1), MARK2, ELK1, Q8WUT3, CGB, 
BSG, BCKDK, SOX8, DAG1, SEMA4B, ESR1 (exon8) and/or their regulatory 
regions by contacting said target nucleic acid with one or more agents that convert 
cytosine bases that are unmethylated at the 5' -position thereof to a base that is 
detectably dissimilar to cytosine in terms of hybridization properties 

c) determining therefrom the prognosis of said subject, characteristics of said cell 
proliferative disorder, disease free survival and/or probability of response of said 
subject to said treatment 



2. The method according to claims 1 further comprising 
d) determining a suitable treatment regimen for the subject 

3. The method according to claims 1 and 2 wherein said suitable treatment 
regimen comprises one or more therapies selected from the group consiting of 
chemotherapy, radiotherapy, surgery, biological therapy, immunotherapy, antibodies, 
molecularly targeted drugs, estrogen receptor modulators, estrogen receptor down- 
regulators, aromatase inhibitors, ovarian ablation, LHRH analogues and other 
centrally acting drugs influencing estrogen production. 
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4. A method according to Claims 1 to 3, wherein said treatment is an adjuvant 
treatment and said genes are selected from the group consisting of ERBB2, STMN1, 
TFF1, TMEFF2, ESR1, HSPB1, PITX2, COX7A2L, PLAU, VTN, PCAF, 
ONECUT2, BCL6, WBP11, TBC1D3, GRB7, CDK6, (Chr. Ipl3.2), ABCA8 and 
(Chr. 8ql2.1) 



5. A method according to Claims 1 to 3, wherein said treatment is an adjuvant 
treatment and said target nucleic acid(s) are selected from the group consisting of 
SEQ ID NO: 5, 6, 12, 17, 18, 20, 23, 28, 16, 31, 33, 35, 36, 37, 43, 44, 46, 47, 49 and 
51. 

6. A method according to Claim 1 to 3, wherein said disorder is a metastatic 
disease and said genes are selected from the group consisting of APC, CSPG2, 
ERBB2, STK11, S100A2, TFF1, TGFBR2, TP53, TMEFF2, SYK, HSPB1, RASSF1, 
PSAT1, CGA, ESR2, ONECUT2, WBP11, CYP2D6, CDK6, ELK1, CGB and DAG1 

7. A method according to Claims 1 to 3, wherein said disorder is a metastatic 
disease and said target nucleic acid(s) are selected from the group consisting of SEQ 
ID NO: 2, 4, 5, 7, 11, 12, 13, 14, 17, 19, 20, 21, 25, 26, 29, 35, 37, 45, 46, 53, 55 and 
59. 

8. The method as recited in one of the Claims 1 through 7, characterised in that 
the genomic DNA is obtained from cells or cellular components which contain DNA, 
sources of DNA comprising, for example, cell lines, histological slides, biopsies, 
tissue embedded in paraffin or sections thereof, breast tissues, blood, plasma, serum, 
lymphatic fluid, lymphatic tissue, duct cells, ductal lavage fluid, nipple aspiration 
fluid, cerebrospinal fluid, bone marrow and combinations thereof. 

9. A method according to Claims 1 to 8, wherein said cell proliferative disorder of 
the breast tissue is selected from the group consisting of ductal carcinoma in situ, 
invasive ductal carcinoma, invasive lobular carcinoma, lobular carcinoma in situ, 
comedocarcinoma, inflammatory carcinoma, mucinous carcinoma, scirrhous 
carcinoma, colloid carcinoma, tubular carcinoma, medullary carcinoma, metaplastic — 
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carcinoma, and papillary carcinoma and papillary carcinoma in situ, undifferentiated 
or anaplastic carcinoma and Paget' s disease of the breast, 

10. A method according to Claims 1 to 9, wherein said subjects are estrogen and/or 
progesterone receptor positive. 

11. A method according to claims 1 to 10 wherein b) comprises 

a. converting cytosine bases in the genomic DNA sample which are 
unmethylated at the 5-position, to uracil or another base which is dissimilar to 
cytosine in terms of base pairing behaviour; 

b. amplifying at least one fragment of the pretreated genomic DNA, 
wherein said fragments comprise at least 8 base pairs of one or more sequences 
selected from the group consisting of SEQ ID NO: 206 to 449 and sequences 
complementary thereto, and 

c. determining the methyl ation status of one or more genomic CpG 
dinucleotides by analysis of the amplificate nucleic acids. 

12. The method according to claim 1 1 wherein ii) is carried out using one or both 
of MSP and/or Heavy Methyl. 

13. The method according to claim 1 1 wherein iii) is carried out by means of one 
or more methods taken from the group consisting oligonucleotide hybridisation 
analysis, Ms SnuPE, sequencing, Real Time detection probes and oligonucleotide 
array analysis. 

14. A nucleic acid molecule consisting essentially of a sequence at least 18 bases 
in length according to one of the sequences taken from the group consisting of SEQ 
ID Nos: 206 to 449. 

15. An oligomer, in particular an oligonucleotide or peptide nucleic acid (PNA)- 
oligomer, said oligomer consisting essentially of at least one base sequence having a 
length of at least 10 nucleotides which hybridises to or is identical to one of the 
nucleic acid sequences according to SEQ ID NO: 206 to 449. 
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16. A set of at least two oligonucleotides as recited in claim 15. 

17. A kit comprising a bisulfite (= disulfite, hydrogen sulfite) reagent as well as 
oligonucleotides and/or PNA-oligomers according to one of the Claims 15 or 16. 

18. A kit according to claim 17, further comprising standard reagents for 
performing a methylation assay from the group consisting of MS-SNuPE, MSP, 
Methyl light, Heavy Methyl, nucleic acid sequencing and combinations thereof. 



19. The use of a method according to one of claims 1 through 13, a nucleic acid 
according to Claim 14, of an oligonucleotide or PNA-oligomer according to Claim 15, 
of a kit according to Claim 17 or 18 or of a set of oligonucleotides according to claim 
16 for the treatment of breast cell proliferative disorders. 
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EPOBERLIN 

Abstract « A „ ft 

1 1 -12- 2003 

The present invention relates to modified and genomic sequences, to oligonucleotides 
and/or PNA-oligomers for detecting the cytosine methylation state of genomic DNA, 
as well as to a method for predicting the disease free survival and/or response of a 
subject with a cell proliferative disorder of the breast tissues, to endocrine treatment. 
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1 * -12- 2003 

seq_prot.txt 

Sequence listing 
<110> Epi genomics AG 

<120> Method and nucleic acids for the improved treatment of breast cell 
proliferative disorders 

<160> 1087 

<210> 1 

<211> 4200 

<212> DNA 

<213> Homo Sapiens 

<400> 1 

gaaaatgagg gttggctttg aagttggaca gatttgtttg acctgtctgt tcctagcctg 60 

ggttaactac attgttccag caagctatct acattgcttc cacatctttg aaatgaggta 120 

tatgcctgct ttatttggga atgcgaggat taaggagaat aatatataca taatgttgaa 180 

cacctacgcc ttttaaccac tttgaagttc cagaaacacc tccagccctt aggtgagctg 240 

tgattaaatt cgttcattaa cccaacacac atttactgaa tgcctactct gtgccgcagt 300 

tcctggcagg tgtgcctgac agtgggtgtg taatatgttc agggtctgta tcccatgagc 360 

gtggggatct cctttatctt ctggcacata tggtcctggg ggagaagcta aggggaaggg 420 

tcaggagctt acatggcaga ttcagtaagc ttttagcaca ataattttaa ttgcaaaaat 480 

aaacagtttt gtcaactgct tgagagtagc gtctgcttta caaaaattaa caacaacaaa 540 

aggaaaaaaa acccaaagca aaacgttaca tccaatggct gctggataag acactgctgt 600 

acaaagtccc gcgcgagacc gccttggcgc tgccccagcc tgtctctggg ggttatagag 660 

gaaggcgtgg ggcgtgtgcc agttacaaaa gactgtcttg aatcccaggg tagtgcctat 720 

tcactgggtg gtctcagaag acttccccca aagcgcggct gacaagggtg acagcgcctg 780 

gaccgcggga actgtccgcg gagctggtgc tgaatagggc gtgtgcggcg ggcgcttcaa 840 

ggaaactgga agcgggaccg gaggccggcc ctcgggcgtg cgaggaggag ttggaagaag 900 

agcggggagg ggaacggccc ggaactcgtg cgcgccgaac ggcggcgccc caacccataa 960 

acccatctct tgctcagaga gaagcgaagg agaggcccag cgagtaaaag ccgaggccct 1020 

tgagccgctc ctcggcgcgc gccttctgct tgaggtgcac cttgctgtgc cgtttcttct 1080 

catcgctgcg cgcgaagcgg cggccgcaca cgtcgcaagc aaaaggcttc tcgccggtgt 1140 

gggtgcgcac gtgcgtggtg aggtggtcgc tgcggctgaa gttgcggagg cagatgcggc 1200 

actggaaggg tttgtggccc gtgtggatgc gcaggtggcg attgagctcg tcggagcgcg 1260 

caaagctccg cacacaactc tccaccgggc aagcgaaggc cttggcgtgc ggccgcgggc 1320 

agaagcagcg cgtgctgcat ttgccgccgc ggcgcccctt gcgtcgcgcc ttggcctggg 1380 

ggaaaggggt gggcggcggc ggcggcacgg gtggtgcagc cacgccactg cttccaggga 1440 

tgtccgccac cagaggttta gggaagtccg ccgcggcggc gctgcgaagg cccagcgggg 1500 

aaagctgagg ctgcgtactg gccagaaact ctccgccgtc gccgctactc cctccctccc 1560 

cactaggagg ggtcaggagc ccagggaggc cctcagcccc ctcccctaag tcacccgggg 1620 

ccagcgggaa agcgtcatag gccccgctgg gatagagtct gttggctggg acggccggca 1680 

gttccgcagg gcagctgatg gacagcaagt cctcaatctt ggtccctatt acgggaaaac 1740 

gagcctccgg ggcggcctgg tagtctccct gtgacccaca gttccctggg gcccccacag 1800 

aaagcagctc ccagggcgcg tagggaccct tgaaggcaga gacagcgtcc agcgctggcg 1860 

aggcgggagg cgcccggagg ccgggcttga cgtcgggcgg ggagagctga ggctcataca 1920 

ggcactgcga gggggcaccc gcgcaaggcg aggcctccca gaacgcctct gggaaagggg 1980 

cagcgcccag atccggggag taaaggtccg gcggacccgg cagcaaggca tcggaccccg 2040 

caggaaaagg ggcatccagc ggggatctgg acgctgctgc ctctggaccg gggaagggtg 2100 

ccaggcctaa gatgcccgac atgaggttga agagtgcctc cgggtcgtgc gggtgttcgg 2160 

gcactgcctg aatgaagaag ctaccgctgt agctgaggcc gggagggggt gtgggcgcag 2220 

gcccctccag gaagcaggag tcggctaagt ccccacttgc gccgcagctg ttcaaagccc 2280 

agctcaagaa gtcgcctgct gaggagggag caggaatcag ctctgggcac atcaaagggt 2340 

gcacctgagt ccacagcccc tacctacaaa actcttggtg cccacggata tacacaaagt 2400 

gccttttgca cattctgatt cagcaggggc ccgcatacgt cccaaggagc acatagaaac 2460 

atgcacacgc aaaacacacg tgcacaggca aaagggcgct ccgataaccg cacaggttcc 2520 

tgcggaggcg ctggcggccc agtgtgggtg ggaatggggg tgcgcacccc aggactccta 2580 

agcttcccat cgcccctatt ctcatagctc caatgtccca gtccctcctg gttctcaggt 2640 

atggtgcgcc cccgcgctgc gccgtcgtcg tctgagcacc cctgctcgcc ctccttacct 2700 

ccagggtagc cggtggccgc gggagcgtcc ctggcaggca gccggggcaa ttcagcgctg 2760 

ggttcggcgc aacagccttc agtggacttg acgaggagcg cgtcgggttc ggaaaactcg 2820 

ctaaggtgga gcatggcgcg gcgccggctg tggggcgccc ggggcctcgc ccgctgggct 2880 

tgggggcgcg cgggtggcgg ggaggctggc ggtaggggtt ccccgcagcg cacagaccta 2940 

ggcgcccggg ctcctggctt cgggcactca cctctgggaa aggagtcggg gagccgcggc 3000 

gccctcgctc gcccgcaccg gcctcgggcg gcggctcctc gcctctccaa agcgctgccg 3060 

ggctccccca gcccacagcc cccccactta tatagctgcg gcggcgctgg ctccgggctt 3120 

serfte 1 



seq_prot -txt 

ccgactcccc gggccactgc cattttggga ggccccggcc ctgccgccgt gacgtaaatg 3180 

cccaaacatg gacacaggat gtgtgccggg gactcccgaa aaggaaagct cgagctgtga 3240 

cgttgttgtt gtcgccgccg ccgccaggct cgcgccgctg cgtgcgcgcg cgctccccag 3300 

gcgtggagcg ctggggccgc ggatgcgcca gtctggggcg ctgcgcctca ccgggccgac 3360 

cgtccggtag atgcgcttcc cacctggagg cagaggccga ggggcagcag gggaggaccg 3420 

tgttgggttt gaatctgccc tctgccacta actagcttga cactcagtcc cccttaacgt 3480 

cagtttctca cctgtgcatt tcctgtggag agaaactgag agcacataaa tgtccaggcg 3540 

cactcagtaa accacagaga ttattgccat tcatttcttt tgctcacgcc tgtgtgtttt 3600 

tccattttaa gttttttttt tttaaatttg tgctaggaaa aataatttaa attaaaaagg 3660 

actgtaatac ttgttaacag taggaggaaa gtcaactagc acaacagggt atattctaaa 3720 

ttgaaaagta aaagtccctc ctctccacct accttcattc ctagctcctg aggtagccac 3780 

atttaagttt cttgggtatt ctttcagaag aattttatgt atatccaacc atacacattg 3840 

ttctgcaata ttattattat tattattttt tcacttgaaa taacttagag attgtgccat 3900 

gtcctgacat aaagatcttg tcattctttt tcaaggctgt atagtactcc attgtaaggg 3960 

aggaccttga gttagtcgat ttgtccccta tccatgagca tgtaggcagt ttccaacttt 4020 

ttacttatac caatgttgtt ggagtcaaca tccttgacta tataaatgtg aatgtgtctg 4080 

tgagataaat tccaacgagt gtaattgtgt gccaaagtat atgtaaaatt ttgatagaca 4140 

actggcaaat tgtcttccag aaaagtaggg ccaattcctg otcactcctc cctcacccac 4200 

<210> 2 
<211> 4491 
<212> DNA 

<213> Homo sapiens 
<400> 2 

ttcagtaatt ttccatgctg aaatttggcc cagtatactt tgattatata tctatattta 60 

attatagttt taaaatagct aagcagtttt aaaaattcaa attaaacttt tgtggaatct 120 

ctaaaaataa gaaaaatatt actgagtaat tggcaacata aaatacaaaa tgtttaactt 180 

ggctttcttt tagatttgaa gcatagtgac taatgaacat ataatggtcg tagctttctt 240 

tacaactatg tcatttaagt cactgatgaa tatctatgat acatgcagta aaaatttaga 300 

caaacatttt ggctacaaac tggtttacta aataatagct tgtatcatag tctgagtcaa 360 

gagtaatagg cttacaattg tgtcttaggg tactaagaag ttcttttgag gaccttttaa 420 

aaggtattct ttgttttgaa agactgtttt ccttctggaa gttctaggcc ttatgagaaa 480 

ttacctattc tgaatttgtc acaaaaaatg acttgtaaat aagtcctgtg cagtgaatca 540 

tgtgctttca gtaagaggat tttgaaagcc ttttgtaaaa tgaagtcatt ctcaaaagag 600 

agttatttag aatataatca aattgattta catgctttta tttatttcaa atatgaaggg 660 

aaattgtttc taaatatatt taaactttta atagaacagt agtatgccat cagtgtggaa 720 

ataactcact tgttaaataa atatttgggg tgtatttgct gtatatcagg cattgtaaca 780 

cagggactaa tgaaagagaa aaaaaacaaa gcacgaaagg acagcagaaa tagctccata 840 

atctcacttt ccaggaataa cctctatgga cattttgttg tgtgtctatt gcttttctaa 900 

tatatacatt ttttaatcaa atggaattgc ctcaaatatg ctatttaata gcttgcttct 960 

tttcatacta tttgaaaatt aagaaacgat tataatatgc ttcatttaaa aactttaagt 1020 

ttaggccggg cacagtggct cacgcatgta atcccagtac ttggagggac cgaggtgggc 1080 

ggatcacaag gtcaggagat tgagaccatc ctggccagca tggtgaaacc ccgtctctac 1140 

taaaaataca aaacttagct gggcatgatg gcacgtgcct gtagtcccag ctacttggga 1200 

ggctgaggca ggagaatcgc ttgaacccgg gaggtggagg ttgcagtgag cggagattgc 1260 

accactgcac tcgcctggtg acagagcaag actctatctc aaaaaacaaa caaagaaaac 1320 

ttgaagtata gtatcctttt aaattttaaa tagataatag aaactggttt ccccccattt 1380 

aaaccagaat ttaagtttaa ctttatatat tcttgacagt ttggattttg tccttcaacc 1440 

tcataaaatt gggaatttaa gcatcacctg gttcgattta aatgcaatgt agaatttgca 1500 

ttaaaatact acattaaagc ctcagatttg tagtagctaa cagcacttct atgtatgtgt 1560 

cagggactgc tctaaatact tcatatatat taactcctct attctgtact tctgttcccg 1620 

ttttatacag caggaaattg aaacactgag aggttaagta actaaagtta cagagctaga 1680 

gtgacaggag taaagcttca actcaggcaa cccagacttc cagagttctg atctccacta 1740 

ctaagctgct agcatagctt ttctggtaac tatttttaat tcaaatataa ttcgagtgat 1800 

ctatctaaca agtcatcact ctgacaactc agtgacttgt aatgtaaaat tattcattgt 1860 

aattcattta atattattgt ttctctgtgc tgcaaaaatc atagcaatcg agatgtaatt 1920 

tattactctc cctcccacct ccggcatctt gtgctaatcc ttctgccctg cggacctccc 1980 

ccgactcttt actatgcgtg tcaactgcca tcaacttcct tgcttgctgg ggactggggc 2040 

cgcgagggca tacccccgag gggtacgggg ctagggctag gcaggctgtg cggttgggcg 2100 

gggccctgtg ccccactgcg gagtgcgggt cgggaagcgg agagagaagc agctgtgtaa 2160 

tccgctggat gcggaccagg gcgctcccca ttcccgtcgg gagcccgccg attggctggg 2220 

tgtgggcgca cgtgaccgac atgtggctgt attggtgcag cccgccaggg tgtcactgga 2280 

gacagaatgg aggtgctgcc ggactcggaa atggggtagg tgctggagcc accatggcca 2340 

ggcttgctgc ggggggaggg gggaaggtgg ttttccctcg cactgtctta aaccgatggc 2400 

ctttccttgg cacagggtcc actgcagcat gccaaacgag gaggcagggg cgtcgtcccc 2460 

cc ^ c l c " c ? f* 9 .^?* 1211 JL9_ag«ggat Jttcacgtact tcggatccag ggtttttgac 2520 
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agaagaggaa 
tttgaagcca 
tctaaggaaa 
ctxtacccca 
caaaagagag 
actggggatg 
tggtgaaata 
tgtgaacagg 
ggctttttgc 
tgcagaggaa 
atagtttata 
aatgtcaggc 
gaaagtgatg 
tgtatgaaac 
ttcaagagca 
tttgtaggcc 
tttagataac 
aagcaacatg 
cttgatataa 
ctttatgcat 
cagtggcttt 
atagaaagtc 
attgggctgg 
ctttaaaaaa 
cttaggagcc 
agctgaggca 
agaccctatc 
gctagaacta 
gttctttgaa 
attacctttt 
ctcctattca 
acttaagtct 
acaaaataat 



gaagggggag 
gaaggggttt 
cgggtagagg 
ttgaaaggct 
aggagacaaa 
agagaaagag 
ctggtcacca 
gtggcaaaca 
ggacgactgc 
taagaaacct 
aatgcagcct 
tatgaatgtt 
ctcaaattct 
agtggatctt 
aaccacttta 
taatatttaa 
actgaataaa 
gcttttcttc 
agtgtttttc 
acttatagta 
gtaaagccct 
ttaagatgac 
cttagataaa 
ttatttctgc 
gggcgtggtg 
ggaggatcac 
tcaaaaaaaa 
aagatgtcac 
ttccattagc 
accattttaa 
agtatgttgt 
tctattttat 
atataacaga 



gggtagaagt 
ttgtttttat 
acaacaaaga 
tcttttcctc 
accgctgcag 
gaggaggcag 
gtagtgtgcc 
gataccagtg 
agcagtgctt 
tgcagcaaat 
gaacagcaga 
ttgtcattgg 
ttgatttaac 
ttcttttgtt 
caacgtaaat 
gacctatagt 
tatttaagat 
tagatgtatt 
att gage tut 
gccagcctgc 
ccagggtcag 
ctattatatt 
taagactaca 
tttaagaatt 
gcccatgcct 
ttgagccttg 
gtacttagag 
attaaaattc 
atagtgtttg 
cagctcattg 
gactttatcc 
tctaattcta 
ggtaatttga 



seq_prot 
gttaagggga 
aatgccattt 
atggagcata 
cctggcgaca 
atggctgatg 
gtactgeaga 
tgcttttgta 
tctttgttag 
tttctccctc 
gctggggtag 
aaaaaaatta 
aatgtattgg 
ataaatccta 
agtgaagctt 
tactttgtca 
taagtaattt 
attaattcag 
tctgtttagt 
tttttccttt 
cacacctccc 
tgacagtgtc 
gtttcagtaa 
taagtttttc 
acttgatcat 
gtaatacaag 
gcgttcaaga 
tctctttttt 
caaatgacat 
agagtctata 
tagcattatt 
aaggtcaaat 
taaaactata 
aaaacagtgt 



txt 
gtctgetgag 
gacagagtgg 
ttcatggcga 
aggacacatg 
tgaatctagt 
gcgtgagtgg 
aaacatctaa 
ttacaaaatg 
tgttaggccg 
aagcccattt 
ctgtttttta 
acactttgat 
tacgaaatct 
ttatgecatt 
aaaattatgg 
tatatttctc 
tgtgcaaata 
gagtactcat 
acctaaatgt 
cactccctga 
tcttaaataa 
tattttcaag 
actgataaat 
ggatatcagg 
ctaaacagga 
ccagcctggg 
aaaatctgat 
ttaacagcta 
aaactattat 
tttattttaa 
attgtataac 
atgectttta 
aagtcactga 



aaaagctgtt 
aataacagta 
ggagcaaaag 
cattggtggc 
ggaaagagct 
tggtgttggt 
gtaaactccc 
cagtggtagt 
aaaagacaac 
acaagaagee 
aagtaggaat 
tctacatcac 
taataaatta 
aattaggtca 
tgaacaaatt 
tttggttgct 
ttttaaatta 
gagatatact 
aaaagectat 
acaaggatgg 
tgtttatgta 
ataatgegga 
ttaaatagtt 
gttaatagta 
ggctgaggga 
caaccaagca 
ggaaactatg 
atattactca 
ttttacataa 
atgttagaga 
aatgtaaaga 
ttacatacat 
c 



<210> 3 

<211> 4256 

<212> DNA 

<213> Homo Sapiens 



<400> 3 

acacttccca 
cctaaatcat 
tcatagcaac 
actgttattt 
ggtacatttc 
ttatattatg 
acaaagagaa 
gtatatttat 
aaaggttttt 
ttaaatcatg 
ggatgcttaa 
ctttaaaatg 
agccaacatc 
aactatccac 
tgtaggacta 
agttacccat 
gggagaggag 
ctatatcaag 
ggtaactaca 
ctacagcaaa 
cttctgaata 
tgtcacccag 
gggttcaagc 
ccacccccgg 
ggtctcgaac 
tacaaggegt 
cgatatttat 



ggtttatgat 
cttggaaaca 
cttaaaattt 
gttatattag 
atactaaaca 
gttattaaat 
agtataagca 
tgttttgtgt 
cttatatttc 
atagaaatcc 
acccatttga 
ttatataaat 
atttcccctt 
ttttcttttt 
caaagecaaa 
gtattaatta 
aaacacctct 
taagctaaag 
tattacctct 
catatattta 
atttttctct 
gctggagtga 
gattctcctg 
ctactttttg 
tcccgacctc 
gaggcaccgc 
tccaacatac 



ttgagagttc 
aagecattte 
gagtgetgea 
tttgaggatg 
caaattgeat 
cctgtcctga 
aaccaaattg 
attcaatgaa 
atagtatgaa 
aaagcaaaaa 
taatatatcc 
aaatgttcct 
ttctacatgt 
cctttttgtt 
ctcccgggtt 
cttttaaatc 
gttcaaaaat 
atagtaaaag 
ggaatgatca 
tttgccctac 
ctctctttct 
agtggctcca 

cctcaacctc 
tatttttagt 
aggtgattcc 
gcccggccgc 
aocacagatt 



attaaacaag 
cagagaggaa 
tggtggaagt 
gtggtgttaa 
aatttgecta 
gaaagtgaaa 
cagctacaaa 
gtccttcgtc 
aaatctaaaa 
gaaaatgaat 
atttgataat 
atattaaaca 
tcttctcccg 
tttaaacaaa 
caagctgttg 
ctcccctaac 
gaggaattga 
agcaaaaatg 
catgaatgtg 
tcagttaaaa 
ctctttcttt 
tctcgctgtt 
ccgagtagct 
agaggegagg 
ccccgccttg 
ttctgaataa 
tccactgata 
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agatggtcac 
ttttaaaata 
agacaattta 
agaggagtta 
aatcaaggaa 
ctgactcagt 
aagaaagaca 
ttggtcataa 
agtaacccat 
caattgaatt 
atactaatat 
ccaatgtagt 
tctccattaa 
taaggtctct 
gcaaaatttt 
tccctcacaa 
aaactcttat 
ttagcagata 
gctcattatt 
ataaacacaa 
ctttcgacaa 
cactacaacc 
gggattacag 
tttcacctgt 
atctcccaaa 
tttcgatcaa 
atccctccta 



ctctttggtt 
ctgtctgcag 
ttttaggata 
cttattttta 
ttatactaaa 
tttcaaagag 
aaatgttgca 
aactagcett 
atgtaaatat 
aaaatgtgta 
gaatttagta 
taggattcta 
aaattgtcaa 
tctaagatat 
agagatgeta 
aacaggagta 
cacaaataaa 
ttcccaaaat 
tcctaagttc 
tatgtagttg 
agtctcaetc 
tcagcctccc 
gcgcctgcca 
tggecagget 
gtgaajgggat 
aatfcfcatatt 
gtaagaaaga 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4491 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



seq_prot . txt 

taagctccat ccaggtatct gtgaattgga ggctaagtag tcccagcaca tcttacattt 1680 

ctttaagact ccctttttat cccaaacgtt cgtaaatttt gtatctgata aagagcatac 1740 

ttccatctaa tacaaatatg ttcccccctt cagatcttct cagcattcga gagatctgta 1800 

cgcgcgtggc tcctcattcc tcttccttgg cttcccaagc ccccagggcg tcgccaggag 1860 

gaggtctgtg attacaaacc ccttctgaaa actccccagg aagcctcccc tttttccgga 1920 

gaatcgaagc gctacctgat tccaattccc ctgcaaactt cgtcctccag agtcgcccgc 1980 

catcccctgc tcccgctgca gaccctctac ccacctggat cggcctccga ccgtaactat 2040 

tcggtgcgtt gggcagcgcc cccgcctcca gcagcgcccg cacctcctct acccgacccc 2100 

gggccgcggc cgtggccagc cagtcagccg aaggctccat gctgcrcccc gccgccggct 2160 

ccatgctgct ccccgccgcc cgctgcctgc tctccccctc tccgcagccg ccgagcgcac 2220 

gcggtccgcc ccaccctctg gtgaccagcc agcccctcct ctttcttcct ccggtgctgg 2280 

cggaagagcc ccctccgacc ctgtccctca aatcctctgg agggaccgcg gtatctttcc 2340 

aggcaagggg acgccgtgag cgagtgctcg gaggaggtgc tattaactcc gagcacttag 2400 

cgaatgtggc acccctgaag tcgccccagg ttgggtctcc cccgggggca ccagccggaa 2460 

gcagccctcg ccagagccag cgttggcaag gaaggaggac tgggctcctc cccacctgcc 2520 

ccccacaccg ccctccggcc tccctgctcc cagccgcgct cccccgcctg ccagcaaagg 2580 

cgtgtttgag tgcgttcact ctgttaaaaa gaaatccgcc cccgccccgt ttccttcctc 2640 

cgcgatacaa ccttcctaac tgccaaattg aatcggggtg tttggtgtca tagggaaagt 2700 

atggcttctt cttttaatca taagaaaaag caaaactatt ctttcctagt tgtgagagcc 2760 

ccaccgagaa rcgaaatcac ctgtacgact agaaagtgtc cccctacccc ctcaaccctt 2820 

gattttcagg agcgcggggt tcactaagtc agaaacccta gttcaaagga ttccttttgg 2880 

agagtcggac tgctctctcc ttcccctccc cttcccctcc tgcgtgtaaa acggctgtct 2940 

ggggcaaggg tttctcagac gtgtacattg cctggtataa gagcagactc tgaaaagatg 3000 

aggtttattt aatacggacg ggggagaatt ctgcctgtag gcagatagga aaatggggag 3060 

ggagtcattg gaaggacgga ctccattctc aaagtcataa ttcctagacc agaaaaagtg 3120 

ctcagtgttc tagaagcaga gttgcacagt gatccaaaga ccagcttcaa atactgtcct 3180 

gtctccttca cacttctcac atttctcttt cctactgaaa ataccttgca tttttcgtaa 3240 

ttataaaggg ggaagggaat atgagtgccc cctgctttat aggggttgtt gtgagtttaa 3300 

atgatgtatt aatacatata agccttaaga acagtgccac acatcctaag ctaatacctg 3360 

ttagctcttg aattatccgc tttgaggact ggcttgcaat cttgttttga ggcatagaaa 3420 

gaaaatgctt tggagcagga cgcggtggct cacacctgta atcccagcac tttgggaagc 3480 

cgaggcgggc agatcacctg aggtcaggag ttcgaggcca gcttggccaa aatggtgaca 3540 

ccccgtctct actaaaaata caaaaattag ctggccatgg tggcgcacgt gtgtaattcc 3600 

agctactcag gaggctgagg caggagaatc gcttgaaccc gggaggcaga ggttgcagta 3660 

agccgagatc gcgccaccac cctccagcct gggtgacaga atgagactcc gactcaaaaa 3720 

aaaaaaaaaa aatgctttgg atagaattat cactattaca taaaaggaaa gtccggatgc 3780 

ggtggctcac gtctanaatc ccagcattct gggaggccga gacaggcgga tcacctgagg 3840 

ccaggagttc gagacaagcc tgaccaacat ggcgaaaccc tgtctctact aaaaaataca 3900 

aaattagcgg ggcttggtgg cgcatgcctg taatcccagc tactcggagg ctgatgtagg 3960 

agaatcgctt gaacccagga gaaggcggag gttgcagtga gccgagatcg cgccattgca 4020 

ctccagcctg ggagacaaga gcgaaacttg gtctcaagaa aaaaagaaag aaagaaagaa 4080 

agaaagacca agaagaactt actxcctgaa aagattatgg gcaccctcca ccaccctcac 4140 

ttacaaagaa aagttaaaca gcactaaaga gtataacaag cgcaaggagg taaaagttct 4200 

aatttttcct gtgactacta ctttttaagc ttatcaaaaa catgtactac gtttta 4256 

<210> 4 
<211> 4414 
<212> DNA 

<213> Homo Sapiens 
<400> 4 

aatgtctgga gtatatartt caatzgaacat tcattttart ttatttctct ccattcctga 60 

atcaagcaat cttgaatcta aagttgctat gattagcact gaaaagacca ctggactatt 120 

aattgtgtga ctttgggaca gtaactttct gcaccttagt ttgtttacat gttatacatg 180 

aaggttgaag tctgattctg ctctgtgact atcattctaa acatctgatg aaatcaaatt 240 

tcagtgtttg gaatggtagt acaataaatt tactaagaat aaataattca ctgcaaaaac 300 

acattgattt ccaaatgatg taactgacag ttatattact gcagagggct gataaataac 360 

aaaagaaatg aaagatgcac atggtgagaa ctgaaattat cctgacaagt cttctacctg 420 

tttatcactt aaaatcaatg accatgctga atgcctacaa attacaaaat ataaaagaaa 480 

tcttataaat gcgcatgtac aggagtctaa gttactaaaa gttttaaagc ataagtttaa 540 

accaaactaa tcaaagaagt tgagaggaaa aattggcttt catctttaat cactactgtt 600 

ttgaggtcct atgtttaata taattttcta agtagaggct tcagagagaa gagttgtgag 660 

gatactttca tarttgtgta gaaggaaaag tttgccatcc attctagtat ccctagtgtt 720 

atactgatgt gcaccttgga tttattttgt tcctattgta taaactcata cttgacttca 780 

aagaaaagga aaatccaaag tccctctttt ctaaggggac agaaatcctt tgtgtcaact 840 

gtttgaccct tttctctgta aggtcctatt ggaaatcttt tgtaacacaa tgcaggggac 900 

tcttccatgt gttgatgctg tttacacagt ggggtgggcc tgactgaaga aaaaaaatcg 960 

Selte 4 



seq_prot.txt 

catatacgca tgaaagatta tggtcttatt tccggaaagc atgaaaggtg attgatactt 1020 

ccaagaagtc cctgttactc aggaaaatta tcaaatattc tactcagaga tacttggaaa 1080 

gactgaagga aaggaagaac gaagaaagca gaatctagac ttatgtgggg agagatttgt 1140 

ggcagaggaa aagtattctc tttgaatccg acaagggatt tgcctggggg aatttcctgt 1200 

ccagcctttt attaccaggg tcttttgaag ccgggctccc cattgggcag ttccctggga 1260 

gtgcagtggg gaattcttac actttccctc taggtccccg aaggatctcg ttttctcagt 1320 

gtctctttca ggttggcagg agccttgagc ctgacacttc cctttgatgg gacaggcaag 1380 

ctctgtgggc gcgtaaacac gctgtaacca agttctttgc tgattttaca gttttgtgtg 1440 

ctcccgagaa gaagtgatcg tactcaattg tctattgctg gcctgccccc taagagcctg 1500 

ggggctcctt tcccctaacc cagaactagc tgcacggggg gcggggaaat gggggtgggg 1560 

aaggagtggg agggcagtgg tttccgcgag cagagcgatg ttactgagtg agtccctgaa 1620 

tggggagcgc tgctgtcccc aagccgattg gtacttcttg tcaggaagaa acgccaagag 1680 

gtgggagtgc ctggggaggg aggcaggcgg tccctaccgc aggcgcgggg agctgccttt 1740 

ccgcccctcc gcctgctttc caagcctgga ctcttaggag tggctgaagc tgcggagcgc 1800 

ttttggagcc tgtgaatgaa ccctcctcct ctccctcctc cttcttctcg ctgagtctcc 1860 

tcctcggctc tgacggtaca gtgatataat: gatgatgggt gtcacaaccc gcatttgaac 1920 

ttgcaggcga gctgccccga gcctttctgg ggaagaactc caggcgtgcg gacgcaacag 1980 

ccgagaacar taggtgttgt ggacaggagc tgggaccaag atcttcggcc agccccgcat 2040 

cctcccgcat cttccagcac cgtcccgcac cctccgcatc cttccccggg ccaccacgct 2100 

tcctatgtga cccgcctggg caacgccgaa cccagtcgcg cagcgctgca gtgaattttc 2160 

cccccaaact gcaataagcc gccttccaag gtaatcacgt ttcttttgtt ccccccttaa 2220 

aaaacaaaaa caaaaaactt atagaaaaaa acccgcgagc ttagaaaaaa gaagcaattg 2280 

gtagaaggct ttaattaagg caaagagctg taaggcgaag ttaagaaaat gtaggcactt 2340 

aaaaaatgca ggtaactttc ataagggctt ttggggagag gcatacagag ggaccttggt 2400 

gttgaaaaag attcagacaa aagaaaccca gggtggggtg gggggtaaaa tgactaacgg 2460 

aattggggga agggagggaa taaattgtaa agaaatcata gaaaagtggt gggttcttga 2520 

gctggagaga agagagggac ctttggcact ttgatttttt tgttgttgtt gttcttaaca 2580 

cgctcgaggc aaaagtttga atggggacta ccaagacttg ccacagacaa gtccccgaag 2640 

ccgccttggt gcaggccacc tggtttccca gcccctggtg tgtggtcagt gcctggtgtt 2700 

cctggaaagc cactcccggg cagctcctga cagtgcgacc cggcgcccaa gcagcctggg 2760 

accttgcgcg gacctgaccc cttcagaccg caggcagtct gggaggaggt ccggccgggg 2820 

gaggngcagg atccccgccg tgtctctttg acgacttggg gactgtcacg gttctctccc 2880 

ggcgcccctg ggttcttttg tcctgcacgc ggtgcgaagg ggccagcagg gaaggagcag 2940 

aggatggggg gtggggttgt tggagccccg cggaggtctg ggaggcccct gggcgggaaa 3000 

agcctgttct gaatcggcag ggatgtgcaa caatttttct caccctgaag agtgaaatag 3060 

ggtctgtcgc tcccatctca acaagcaaac cggcacccag agcgcactgc agacaaaggt 3120 

ggctcgggga cccgaatcag gggcctctgg gtcagtcctc tcgcccagac tacaggagtc 3180 

ctccgtttcc ttacatattc cccaccctct cctagttctc cgctctagtt gagcaacttt 3240 

actggcaggt: ctagcggcag cggcgcgcgc gtgtgccggg agccccgggg gacgtcctcg 3300 

gctggagcgc cccaccgtcc tgcagaggcg tgggcgctgt agggcgacat ccctggtgcg 3360 

tgcagaccta gggcatccgg gttgttctgg cccgcggtct gtgtcactgg gcgtggagcg 3420 

gtctgggtgt taggcagagg agagcggggt agaaaaatag tgtcatagcc taaactattt 3480 

gctcuccaar ccaactgcgt ctcggcaagt cctgcatgtc cctgggagat gctgcgggaa 3540 

gggggagaaa tctacacggc gcctggagag ttgtcctgcc gcgcgcacac ccgcggcaga 3600 

gtctctctgg gtcgcgttcc tcatctttgt gtctccctct gtctccatct tctctgcctc 3660 

ccttcccctc tcccagcctc tgttctctct ctctagcttc tgcgtcccct cccccacagc 3720 

tgacaaatga atggctagtg tgaaatccct gcctttcccg tgctccaagg cagcagggag 3780 

ggaggagcga gggaggcggt gcgctccttc ggatctgccc ccagttcagc tcacaagact 3840 

tgcagaacct agatgtctag gaattgggag ttttgcggcg ggtgtgggcg ccccctgatg 3900 

gagaagiiccc gcacaggcgg agaaaaacaa gccccccaga ccaagcgagc atctttcaca 3960 

accttgtgtt caaggatgga aggccctagt tttcccttaa gtcattcatt ttgcactcca 4020 

caatctgcgg cgtacatcta ggagtttgga ggaccctgaa aaaaggtctt ggtctgtgca 4080 

aagtgcagag atgccttctc gcgagtcggc tggaacgcac gcggcgccct gggtccagtt 4140 

cgccgcttag tggacgaacg cacatggcca gggtggagcc aggtttcgga ggtctagacg 4200 

cgccaccctg ggcgtcttta agaaagataa tacacatacc gtgctctcaa aacccatgac 4260 

cacttgaacc caacgctagg gccccgcccc aggatattgc aaaagaaggg ctcctaatcc 4320 

aaaacttaaa cttcctcaac ttcagggcgg gcgtcggacg ggaaagtgga gagaaggcgg 4380 

gcagtgggag gaaaaaagaa agggaaggaa ggga 4414 

<210> 5 
<211> 4001 

<212> DNA t 

<213> Homo Sapiens 

<400> 5 . ; <' 

ctgatatgga atagagtcaa gatttttttt tttttttttg acacggagtc tcactctgtc ; 60 

tcccaggctg gagtgcagag gcgcaatctc agcucactgc aagctctgcc tcccaggttc <120 

Sei te 5 



seq__prot.txt 

acgccattct cctgcctcag cctcctgagt agctgggact acaggcaccc gccaccacac 180 

ctggctaatt ttttgtattt ttagcagaga cagggtttca ccgtgttagc caggatggtc 240 

tcgatctcct gacctcgtga tctgcctgcc tcggcctccc aaagtgctgg aattacaggt 300 

gtgagccacc gcgactggcc agattcaaga tttgaaccca ggtcctcttg gtcccagagg 360 

cccctgtttc tcaactccct aggatggcat agcaacctgt cccacaagag gtgcctgctt 420 

taagtgtgct cagcacatgg aagcaagttt agaaatgcaa gtgtatacct gtaaagaggt 480 

gtgggagatg ggggggaggg aagagagaaa gagatgctgg tgtccttcat tctccagtcc 540 

ctgataggtg cctttgatcc cttcttgacc agtatagctg cattcttggc tggggcattc 600 

caactagaac tgccaaattt agcacataaa aataaggagg cccagttaaa tttgaatttc 660 

agataaacaa tgaataattt gttagtataa atatgtccca tgcaatatct tgttgaaatt 720 

aaaaaaaaaa aaaaaagtct tccttccatc cccaccccta ccactaggcc taaggaatag 780 

ggtcaggggc tccaaataga atgtggttga gaagtggaat taagcaggct aatagaaggc 840 

aaggggcaaa gaagaaacct tgaatgcatt gggtgctggg tgcctcctta aataagcaag 900 

aagggtgcat tttgaagaat tgagatagaa gtctttttgg gctgggtgca gttgctcgtg 960 

gttgtaattc cagcactttg ggaggctgag gcgggaggat cacctgaggt tgggagttca 1020 

agaccagcct caccaacgtg gagaaaccct gtctttacta aaaatacaaa aaattagctg 1080 

gtcatggtgg cacatgcctg taatcccagc tgctcgggag gctgaggcag gagaatcact 1140 

tgaaccaggg aggcagaggt tgtggtgagc agagatcgcg ccattgctct ccagcctggg 1200 

caacaagagc aaaagttcgt ttaaaaaaaa aaaaaagtcc tttcgatgtg actgtctcct 1260 

cccaaatttg tagaccctct taagatcatg cttttcagat acttcaaaga rtccagaaga 1320 

tatgccccgg gggtcctgga agccacaagg taaacacaac acatccccct ccttgactat 1380 

caattttact agaggatgtg gtgggaaaac cattatttga tattaaaaca aataggcttg 1440 

ggatggagta ggatgcaagc tccccaggaa agtttaagat aaaacctgag acttaaaagg 1500 

gtgttaagag tggcagccta gggaatttat cccggactcc gggggagggg gcagagtcac 1560 

cagcctctgc atttagggat tctccgagga aaagtgtgag aacggctgca ggcaacccag 1620 

gcgtcccggc gctaggaggg acgcacccag gcctgcgcga agagagggag aaagtgaagc 1680 

tgggagttgc cactcccaga cttgttggaa tgcagttgga gggggcgagc tgggagcgcg 1740 

cttgctccca atcacaggag aaggaggagg tggaggagga gggctgcttg aggaagtata 1800 

agaatgaagt tgtgaagctg agattcccct ccattgggac cggagaaacc aggggagccc 1860 

cccgggcagc cgcgcgcccc ttcccacggg gccctttact gcgccgcgcg cccggccccc 1920 

acccctcgca gcaccccgcg ccccgcgccc tcccagccgg gtccagccgg agccatgggg 1980 

ccggagccgc agtgagcacc atggagctgg cggccttgtg ccgctggggg ctcctcctcg 2040 

ccctcttgcc ccccggagcc gcgagcaccc aaggtgggtc tggtgtgggg aggggacgga 2100 

gcagcggcgg gaccctgccc tgtggatgcc ccgccgaggt cccgcggccg gcggggccag 2160 

aggggcccgg acgagctctc ctatcccgaa gttgtggaca gtxgagacgc tcagggcagc 2220 

cgggccctgg ggccctcggg cgggaggggg cagttacacg gcagcggctc gagatggccc 2280 

atccaagaga ctggcgctrt ccaggctccg aggggctccg ggaacttgtc aaagaagttc 2340 

tctgaaattg ttcagaaagt tttcccgcaa agggtgtatt gcgtagagcg cgcgcgcgcg 2400 

tttcccccct tcttgagccc cctcaagctt tctcaaagcc tttccagttg gcagcctccg 2460 

cctccggact ggcctgggct ggattccttg ggggggtcct ctgccctgcc cctcctccag 2520 

cccctccccg ctccccttca gacgattttg gtttggttgc tcctgcttct ggcggggtcg 2580 

ggtgtgtgtg tgtgtggtgg agtggagggt ggcatagcaa cctgtcccaa ccagagccgg 2640 

ggaggaaagg gtggcccgga gggtggcctc ttgctggggt ctgggttggg ggcgggggag 2700 

acgtttgctt tgaacagatt cttggggcca gcttagggac tgtgctctgt gacttttgga 2760 

gcgcgtggac catggagggg tgggggtggg tttcttgggg tgtaaagtgg gagagttccc 2820 

agagaaggaa gctaagaaat aaggccagat gggagcctag ggagggctgc gttgttctgc 2880 

tgccttttcc tuggtgctgt gcgtggggaa gggtgagtgg gggcagtgtg tatcctgacc 2940 

catctgtcca cctgtgtgca ttaatcataa aagctaacat atagcctggg ccaggtatac 3000 

tctgccagga actgtttgtg gtgttttgca tgcattctcc tttaatccta gaacacccct 3060 

atagtggaag ttctgccagc attctggact gagtagcagt ccagaggttg agtagcagct 3120 

agtaagtggt ggggtcaaga tgggacccca ggcagtgcga cccccaacca tgcattcgaa 3180 

atcgctatat ggatgagtgc acctggagca atgagggaca ctgctccctg agtcactggg 3240 

ctgcagggga gacaaaatga aagtgttctg ggagtcgtgg gtggtctcca taggtcagag 3300 

ggtctgggga gggagtgggt gtxatcgtgg ctgtgtgttg cccgaggggc cctctgtgag 3360 

tgagtgcatg gccgtgttat ctctgcaggt ctacgccagg gtgttcctca gttgtgtggt 3420 

ctttgtattt gtgtgtctgg gctttgtgtt gccaaacagc agtctctctg ctgacttggg 3480 

gacacaggct gaactctgtc ctctgcagga actcccttaa ggtgctgggc cagatctgcc 3540 

ataaacagag ggaggtagcc ttctatggcc acgccttctt gctgaggaag aaggttcctc 3600 

tcttccaggg agtacatcct tgccctccct gtttcccaga caagcatctt cacctctcat 3660 

cttctgatga gaagggtgag gccatactga gctgtcaggc tgagctgctg cccttcctca 3720 

ccttgggctg ggagttgatc agggaatggc agttgctgca gagctggatt tgagggctgg 3780 

gttctctgga tggggcctcc tcatgtcctc acccctcaac ctgcactatt gattgtgttg 3840 

tgcaggagtt agttaaaaag tcattgcaca gcctgggcaa caaggcaaaa ctctgtacaa 3900 

aaaatacaaa aattagttgg atgtgattac acgtgcctgt agtcccagct actccggagg 3960 

ctgaggcagg aggatcacct gagcccagga agttgaggct t 4001 

<210> 6 . / . : , 

<211> 4334 . ' . 

_______ _ 



<212> DNA 

<213> Homo Sapiens 



seq_prot.txt 



<400> 6 

aaattaacca 

aatatgatat 

gatttttaaa 

catttcaccg 

tcttaatact 

ctgacgtggt 

tgtccctcag 

ttttgactta 

ttaacattat 

aaaagctagg 

gcagcaccgg 

cccccggcca 

gcgccttttc 

gggtgggcgg 

cggacgcccc 

cagttcagcg 

gccaccgcct 

ggccgcgaag 

gcggtcgcca 

taaagccgct 

gcccgcaggg 

ctcgacggca 

gcgcccccgg 

gacccccgcc 

gtcgctgcca 

cgccgcctcc 

agcaccaaca 

gggctccgca 

gagcgccgcc 

ggaggacaat 

ctaggggggt 

ccagctcccc 

tccctacctt 

cgccccaagg 

gcctaagacg 

ctgcccccgg 

ttctaatcca 

gccagcatcc 

ctggaatttg 

ttcttccaag 

agattgcagt 

ttctttttaa 

cagaacctca 

tttctctcac 

ttctctgacc 

atcttgacta 

agaaacaaat 

ctagttagtc 

aaaaactgga 

ttgcctgaaa 

ctccccccca 

tcagatactg 

tgcttccgtc 

gatgggcatg 

acccaagccc 

tgatctggag 

aattgccaca 

aatctttcaa 

otccctagiag 

tcatgttcat 

gcctggttct 

agggctgggt 

ggattcctgt 



cgaaaaaaat 

tggttccaga 

atcagacaaa 

aaactgtaag 

gcctacaact 

ggataagaaa 

gtttaagctt 

tatttgctgc 

gacttttgta 

aattaaaatc 

acccaggatg 

atcagagcac 

gaatctcccc 

ggctaacggt 

ccccaactcc 

ggtggggcag 

cccgccgtcc 

acagaagaac 

ggggacggtc 

ggtccctggg 

aagggaggga 

ccccggagcg 

agagcgggga 

caccaacccc 

cggcccagcc 

tccgagcccc 

gacccgggcg 

ttggccgaga 

ccccgggacc 

gccgggggtt 

ggtctcgccg 

attctcgaga 

tccaaacttt 

gcagcactcg 

cccgagcccc 

caccctacac 

ctcagcccac 

tcaggttccc 

atcgcctttg 

ccttctgcgg 

taagtgttgc 

aaatcgaaaa 

gcaaaggcac 

aaaaagggga 

aagaccgtgt 

agcagaagag 

gtgaccaacc 

accagggatg 

ncatacaagc 

tatgcttgtt 

agccaatcga 

agccttgaaa 

tgtcattttt 

agacccctca 

aagaggtcct 

gttcacagag 

caaaagtctc 

aacactgata 

cagatgtcta 

ttttaatctt 

gccagtfctgt 

taaagtttcc 

ccttttafta 



tatcaaaatc tactaaataa 



aaatcaagtt 

gtcacaaggg 

aagatattcc 

cattgtgcaa 

acgcccaagc 

ccttgcccaa 

cgttatgtca 

gacgccagct 

agaggcagcg 

ggcgtggccc 

gccttgcgct 

gaactcaaca 

ccaatccggg 

cgccaaaaac 

gccccgggaa 

ccatgcccaa 

cctxgtgggg 

gcagggaggg 

gcaccgcccc 

gggagggagg 

gacgcccggt 

caaaggcgag 

tccgaggccc 

cctcccgccg 

gcacccacct 

cgcacaaaag 

gcggcggcac 

ccgaagcacc 

gcagggcgcc 

tggaagaggg 

acaagggcag 

ggccgcctgg 

gacagagcag 

tgtgacagca 

cgacaccggc 

aggtcgcggg 

gagccggtga 

gtctaacaaa 

atgatagggg 

ttttgttctc 

aggagccaca 

aaaggaaatt 

aaaaaataaa 

cctcggttgg 

atatctttat 

ttcctgaaac 

tgccagagat 

agcagcacac 

gcggcatttt 

agtggttttc 

ggcttcttct 

ctgaacagaa 

ggaaagcgac 

tacccactga 

gaaggttcca 

acattcctga 

atttgaaagc 

tttccatgga 

ttttagcaat 

gacaatgttc 

cctagaatga 

ttattattat 



aactgagctt 

tgaagaattt 

agattcaact 

atgaaatacc 

taatagtaac 

cagcaagcaa 

tgaaaaatgc 

agtaccaaaa 

acaagattta 

tggtccgggc 

ccggcccctg 

ccccggagcc 

taactccgcc 

acctccaggc 

caggcccccg 

cctgccgcca 

cccgggagcg 

tcgcggccct 

gtcccttcag 

taaacgaggg 

gatcgcccag 

gctccgcccg 

gcagccccgc 

accgcgtccc 

gctcagtccg 

cgccaaacag 

gtgccgcccc 

tcgggaagtg 

tggggagcat 

cgtggcagtg 

ggcggagcag 

actccacctt 

tgccccttct 

caatgccact 

gtgtcaccca 

cgtggccctt 

ccagggacct 

gccactcagc 

aggaagagca 

agagcacgtg 

aggtcactgt 

ggtctgaccg 

ugaaatgaac 

tggtcaatac 

tttattaatg 

aacagtccta 

aaaatacaac 

tacacaaagc 

ttccaacaat 

acttagcctg 

cagaccaaac 

tctcagacat 

agataggcag 

acaggctttt 

ctcatttaat 

ctattaatgt 

acttacactg 

aaccatagca 

gccacttgge 

caagctcctc 

gcctttgaat 

tattattatt 

Sei'te 7 



acattgggaa atatgacctg 
tctcggtcat aaaactcttg 
ttcccccata aactctttaa 



caacagtgcc 
cacccctgct 
caataatgtt 
aatgtttatg 
tgtatctctt 
acgtaggatg 
agaaaagcag 
gaggaaaaga 
gccccgcccc 
cgcgcgcgtg 
cctgcctgac 
cgccgccacc 

ggtggtcccg 

cccacgctgt 
ccgcccctca 
ggctgcccgc 
acaatgggga 
cccacgcccc 
ccccctgccc 
agccacacac 
ccgccgccct 
tcttgctggc 
agccgcctga 
aggcacgtga 
cgccccccac 
tagtcctgtc 
ccccaggtgt 
gaccccggac 
gcggagggcg 
gtagagccac 
agtgcgccac 
ccacgcccct 
tattgtgaca 
ccctcactgt 
gtgcgggaaa 
ttaaatgcag 
atagccgttc 
ctccccagag 
tctccctgac 
tgccttagaa 
atggtccccg 
accaagacac 
agctgcatct 
atttagaaag 
taaggggtgc 
aaacaattag 
cgcccttgtg 
tgagcaggac 
ttagcttcaa 
cattagaaaa 
gcaacacaaa 
tatcttgaag 
gcattttttt 
tgtgacagaa 
cttcccccca 
aggaacgtag 
ttctgggttt 
acagctgctt 
aaaaaggtgc 
attattattfa 



cccggagcca 
ttgtctgtgt 
gaactcctat 
aacctctaga 
gttgatcttt 
acaggttccc 
agaattagac 
ggaccggtcg 
cttcctcccc 
ggaaggggag 
tccccctgcg 
acccccttca 
cggtgagctc 
ccgcgcgtcc 
atcaccagct 
agtttggtcc 
acccggcggg 
ctattgtctc 
acgaacagcc 
aaagcggagc 
gcagcagggg 
ctcagccagc 
ccacactctg 
cctccgcgcg 
aacccgcgtg 
ccggagggca 
gggcctggga 
taggtccgaa 

ggggggtcct 

atgcccgcct 
ccacatgggt 
gcagggcccc 
tcatcgtaat 
gtgcgttctt 
gcgtggggct 
actcca-tctg 
tagactacac 
cagcactggg 
ctggatagct 
taactggccc 
cctxctccca 
ctgctttgag 
ccccaaatca 
actctcatgc 
ccaagacaat 
caatgcgctg 
tggggtcggc 
caggaataat 
aatggctagg 
gttggagaag 
tgagcaagga 
tctttgtatt 
aagtacaatza 
aaaagaaata 
cattttcttt 
atgttagaac 
gttgggagat 
gaggactgag 
ttrttgaggtg 
ttattattat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 



seq_prot.txt 

tatttgactt aaaaaaaatt gagacagggt ctcactatgt tgcccaggtt ggcctcgaac 3840 

tcctgggttc aagcaatcct cttgtctcag cctcccaaag tgctgggatt acaggcatga 3900 

gccaccacac ccagtctgat ccttgttctt tagggtgggg ctgccttccc aagagccaga 3960 

tcacagctca ttattgtcag cttttgtggg tgtcctatat aagccttgga ataggaaggg 4020 

ctttcaactg taaggagaga caacagtttt gccacttcac cttggagagg ggcagaatcg 4080 

cccttctcga aagcttccta aaacgaactt caagatccac tctcttctga agtggcagca 4140 

aggagttaat gttcacatct tggaactgtc tatctttgcc cataagaatt atttttgctt 4200 

ctgctcagat ggtttggttg gattgttatg gagctggctg tgactatgga gccacacaac 4260 

gaagtggaga tagggcagtc atccagctta trttgatctgt gtagacagct aaaggagagc 4320 

acttcaggct tcaa 4334 

<210> 7 
<211> 4528 
<212> DNA 

<213> Homo Sapiens 
<400> 7 

cgcgcagacc tgccaggaag agcacaagaa gaaacacccg gactcttctg tcaatttcgt 60 

ggaattctcc aagaagtgtt tggagagatg gaagaccacg tctgcaaagg agaagtgaag 120 

tttgaagaga aggcaaaaag tgacaaagct cgctgtgaca gggagattaa aaattacatt 180 

cctccgaaat gtaagaaagg gtaagaaagg aaagaaaaag gatcgcaatg ctcctagaag 240 

gccaccatct gccttcttcc tgttzttgctc tgaacatcgc ccaaagatca aaagtggaca 300 

cccaggccta tttgtcgtgg aaactgcaaa gaaactgggt gaaatgtggt ctgggcagtc 360 

agccaaagat aaacaaccat atgagcagaa agcagttaag ctacaggaga gatatgaaaa 420 

gggtattgct gcatatxgtg ctaagggcaa aagtgaagca ggaaagaagg gctcaaagaa 480 

gaacaaacca gaagatrgagg aggaggagga ggagaaagaa gatgaagatg aggaggaaga 540 

gggtgaagat gaagaataaa tggctatcct ttaatgatgc ctgtgcagtg ggcttgtttt 600 

gctaagaatg tgaattctag tacagctcag tattagcttc agtataaaac tgtacaaatt 660 

ttcgtatagc tcataagatt ctctgtacag aaaatacttt tcttttcttt tctttttttt 720 

gagacagagt ttcgctcttg ttgcctaggc trggagtgcaa tggcgtgatc tcggctcacc 780 

gcaacctccg cctcccgggt cctggttcaa gcagttctcc tgcctcagcc tcctgagtag 840 

ctgggattac aggcacatgc caccacgctc agctaatttt tgtattttta gtagagatgg 900 

ggtttcacca tgttggccag gctgctttca aactcctgac ctcgtgatcc gcctgcttcg 960 

gcctcccaaa atactgggat tacaggtgtg agccaccgca ccctgcctaa tgtccctaaa 1020 

tarttaatgg tttttaaaaa atttattgtg tatggcagca cagcacactt gtaggaatta 1080 

gtatcaacag tacatcttgc gttttttaag atgctgcatt ttttaacatt ttgtaataaa 1140 

attatgcgta tcaaaaaaac aaagaaattc cgtgtgtagt tcacactcac agcacatctc 1200 

cgtccaggca cttgagagaa tgactaggag gggttcttgg aggaggtggt ctttgaacgg 1260 

agaatccatc ttcaaggatt ctgtctgtaa tggtcaccaa gtatttcctg agtcacttcc 1320 

atgtgtcctg cagttctctg aaggggcgtg ggacctaccg atgccaatta tccagcatta 1380 

tctccagatt ccaagaagtt ggggtgtgag ccagcaatca gtacagaaaa gagataccaa 1440 

aataagtttg agttggggag tgttccttca acttcagttt tctggaagag atcttttttt 1500 

tttttttgag acagagtttc gctcttatirg cccaagctgg agtgcagtgg cacgatctcg 1560 

gctcaccgca acctcctcct cccgggttca agcgattctc ctccctcagc cttctgagta 1620 

gctgggatta cagacatgca cctgtaattt ctactaaaaa tacaaaaatt agccgggcgt 1680 

ggtggcgcac gcctgtiaatc tcagctactg gggaggctga ggcaggagaa tcgcttgaaa 1740 

ccaggaggcg gagattgtac caagatagtt tgttccagct aaacaacctg gcgctagtgc 1800 

aggaaaaggt ggaaggcacg gggctagcac aggagggttc aatattttca accttatcaa 1860 

gccatatttt ggcaactctt gtttttcacg agaagccccc gctgggcttg tcccagcgct 1920 

gtcctgaggc ttcccccatg agttccgana gggcagaggc cgccctgagc gtttctcttt 1980 

cccctggtcc aagagtggct caaaagaagg atttttgact ggaattggcc actttgtgtt 2040 

actttttgac ccrtgacctc gccccaaagg gggatgcggg ggaggggctc tggtaggggt 2100 

ggccccgctc cttccaggtc cgcaagccca ggttcccgcc caccgggctc agcccaccct 2160 

gcggccgttc agggaggccg ttggcacccg tgacctacga cccccttccc gagccccacc 2220 

gaggtcacag ccgtggcctc gtctccccat gcctgcttcc cgccccctgc ccgtgacggg 2280 

cgtctccgag gaccaatgag cgcgctgtat ccacccctcg ggcggggcca agcgccgacc 2340 

aatcgccgct cgggcgcccg gccgggtcca aacgctccaa tcgtcagcgg cggcggggcg 2400 

ggcagagggc cggggatggc aggttcaacc aacgggtggg cacgtcgtcc tcgcgaggag 2460 

gcgtgccctg cggccgggcg tgcggtgtcc gcggcggcgc agggaggggg agggaggtaa 2520 

acaagatggc ggcggcgtgt cgggcgcgga agggggaggc ggcccggggc gcccgcgagt 2580 

gaggcgcggg gcggcgaagg gagcgcgggt ggcggcactt gctgccgcgg ccttggatgg 2640 

gctgggcccc cctcgccgct ccgcctcctc cacacgcgcg gcggccgcgg cgagggggac 2700 

gcgccgcccg gggcccggca ccttcgggaa ccccccggcc cggagcctgc ggcctgcgcc 2760 

gcctcggccg ccgggagccc cgtggagccc ccgccgccgc gccgccccgc ggaccggacg 2820 

ctgagggcac tcggggcggg gcgcgcgctc gggcagacgt ttgcggggag gggggcgcct 2880 

gccgggcccc ggcgaccacc ttgggggtcg cgggccggct cggggggcgc ccagtgcggg 2940 

ccctcgcggg cgccgggcag cgaccagccc tgagcggagc tgttggccgc ggcgggaggc 3000 



seq_prot.txt 

ctcccggacg cccccagccc cccgaacgct cgcccgggcc ggcgggagtc ggcgcccccc 3060 

gggaggtccg ctcggtcgtc cgcggcggag cgtttgctcc tgggacaggc ggtgggaccg 3120 

gggcgtcgcc ggagacgccc ccagcgaagt tgggctctcc aggtgtgggg gtcccggggg 3180 

gtagcgacgt cgcggacccg gcctgtggga tgggcggccc ggagaagact gcgctcggcc 3240 

gtgttcatac ttgtccgtgg gcctgaggtc cccggaggat gacctagcac tgaaaagccc 3300 

cggccggcct ccccagggtc cccgaggacg aagttgaccc tgaccgggcc gtctcccagt 3360 

tctgaggccc gggtcccact ggaactcgcg tctgagccgc cgtcccggac ccccggtgcc 3420 

cgccggtccg cagaccctgc accgggcttg gactcgcagc cgggactgac gtgtagaaca 3480 

atcgtttctg ttggaagaag ggtttttccc ttccttttgg ggtttttgtt gccttttttt 3540 

tttctttttt ctttgtaaaa ttttggagaa gggaagtcgg aacacaagga aggaccgctc 3600 

acccgcggac tcagggctgg cggcgggact ccaggaccct gggtccagca tggaggtggt 3660 

ggacccgcag cagctgggca tgttcacgga gggcgagctg atgtcggtgg gtatggacac 3720 

gttcatccac cgcatcgact ccaccgaggt catctaccag ccgcgccgca agcgggccaa 3780 

gctcatcggc aagtacctga tgggggacct gctgggggaa ggctcttacg gcaaggtgaa 3840 

ggaggtgctg gactcggaga cgctgtgcag gagggccgtc aagatcctca agaagaagaa 3900 

gttgcgaagg atccccaacg gggaggccaa cgtgaagaag taagtatggc ttgctggggt 3960 

cggggccggg ccgggccagt cacggtgctg atggttctgt cttccttcct tctctcctcc 4020 

ctccctccct tacttcctct taacaccctg agctggaccc guctggcgcc tgtgtcctcc 4080 

gtgccaggga gagcgtggtt gggggcctgc gttacggact ttcactcagg caaggccagt 4140 

tgtcgcagcg gggcgtgcgt ttgcatgggc tcttggactc cagttaaaat gccctggtag 4200 

cgaaaccctc ctgagaaggg agcggccccc aatcccctaa gactagcccc ttggctcccc 4260 

cagctgtcca aggagcagag gcgcccagtg gaatcagcct gtgtttgttt gggccccgag 4320 

agtttgtgtg cggccgccaa cacgttttct gcgcagtgtg tggccgttac cggggccagg 4380 

cgaaatgtga tttgtttatc ctgtcagagg ggaaccctgg gctgccaaaa ataactgttt 4440 

gcaccggctt atcagtcagc aggagggaaa cgtagcctrt cctcatttgc cagggatgtg 4500 

acgctggaag catccctggc ccccgggg 4528 

<210> 8 

<211> 4616 

<212> DNA 

<213> Homo sapiens 

<400> 8 

tgaaatggcc ttattctctt ttagtttgtt taagccaatg gagccttact agaatgtgag 60 

ctacaaaaca gtagagctat tctctattca atgctgcatc tgtagggcct agaacagcac 120 

ctgaaggact ggcggaggct taataaattg ttaaaagggc agactagcca ggtgtgatgg 180 

cgtgtgcctg tagtcccagc tacttgggag gctgaggcag gaggatcact tgatcccagg 240 

atttctgggc tgtagtgtag tataccagtc aggtgtctgc actaagttct gcatcaacct 300 

ggttgcctaa ggatggttga actgacccag g-tggaaattg gagcaggtca aaactcaatg 360 

ctgatcagta acagggtcgc acctgtaaat agccaccgcc ctccagcctg ggcaacacag 420 

tgagatccca tctctaaaat aaattttaaa aataattaat tagaaaaaaa aaccagcctg 480 

taatcccagc actttgggag gccaaggcgg gcagatcacc tgaggtcagg agttcaagac 540 

aagcctggcc aacatggcaa aaccccaatc tctaccaaaa atacaaaaat tagctgggcg 600 

tggtggcagg tgcctgtagt cccagctacc caggaggctg aggcaagaga atcgcttgaa 660 

cctgggaggt ggagattgca gtgggccgag atcgcgccac ttcactccag cctzgggtagc 720 

agagtgcgac tctgtctcaa aaaaaaaaaa aaagaaagaa aagaaaagaa aaagaaaaga 780 

aaagaaaaaa actgggaggc ctaagcccat tcttggtgtc ctcacactct tctgccccat 840 

ctccttgcac ccagtcttcc cctgcaatct gtgctccata ctagccctca agccctcaac 900 

gtgacctagt atgagaactg gattctgtca ctctcctgct cataatattt tatgccttcc 960 

ctttgcccag aanatcattc tccctattgt tuttaccaan ggaactggta tttcttcaag 1020 

gacatgatca aatttgccta cttctatatt atcttctaaa gcagaattca tctctcttcc 1080 

ctcaatatga tgatattgac agggttitgcc ctxactcact agattgtgag ctgctcaggg 1140 

caggtagcgt tttttgtttt tgtttttgtt tttctttttt gagacagggt cttgctctgt 1200 

cacccaggcc agagtgcaat ggtacagtct cagctcactg cagcctcaac cgcctcggct 1260 

caaaccatca tcccatttca gcctcctgag tagctgggac tacaggcaca tgccattaca 1320 

cctggctaat ttttttgtat ttctagtaga gacagggttt ggccatgttg cccgggctgg 1380 

tctcgaactc ctggactcaa gcaatccacc cacctcagcc tcccaaaatg agggaccgtg 1440 

tcttattcat ttccatgtcc ctagtccata gcccagtgct ggacctatgg tagtactaaa 1500 

taaatatttg ttgaatgcaa tagtaaatag catttcaggg agcaagaact agattaacaa 1560 

aggtggtaaa aggtttggag aaaaaaataa tagtttaatt tggctagagt atgagggaga 1620 

gtagtaggag acaagatgga aaggtctctt gggcaaggtt ttgaaggaag ttggaagtca 1680 

gaagtacaca atgtgcatat cgtggcaggc agtggggagc caatgaaggc ttttgagcag 1740 

gagagtaatg tgttgaaaaa taaatatagg ttaaacctat cagagcccct ctgacacata 1800 

cacttgcttt tcattcaagc tcaagtttgt ctcccacata cccattactt aactcaccct 1860 

cgggctcccc tagcagcctg ccctacctct ttacctgctt cctggtggag tcagggatgt 1920 

atacatgagc tgctttccct ctcagccaga ggacatgggg gccccagctc ccctgccttt 1980 

ccccttctgt gcctggagct gggaagcagg ccagggttag ctgaggctgg ctggcaagca 2040 
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gctgggtggt gccagggaga gcctgcatag tgccaggtgg tgccttgggt tccaagctga 2100 

gtccatggcc ccgataacct tctgcctgtg cacacacctg cccctcactc cacccccatc 2160 

ctagctttgg tatgggggag agggcacagg gccagacaaa cctgtgagac tttggctcca 2220 

tctctgcaaa agggcgctct gtgagtcagc ctgctcccct ccaggcttgc tcctccccca 2280 

cccagctctc gtttccaatg cacgtacagc ccgtacacac cgtgtgctgg gacaccccac 2340 

agtcagccgc atggctcccc tgtgccccag cccctggctc cctctgttga tcccggcccc 2400 

tgctccaggc ctcactgtgc aactgctgct gtcactgctg cttctggtgc ctgtccatcc 2460 

cxagaggttg ccccggatgc aggaggattc ccccttggga ggaggctctt ctggggaaga 2520 

tgacccactg ggcgaggagg atctgcccag tgaagaggat tcacccagag aggaggatcc 2580 

acccggagag gaggatctac ctggagagga ggatctacct ggagaggagg atctacctga 2640 

agttaagcct aaatcagaag aagagggctc cctgaagtta gaggatctac ctactgttga 2700 

ggctcctgga gatcctcaag aaccccagaa taatgcccac agggacaaag aaggtaagtg 2760 

gtcatcaatc tccaaatcca ggttccagga ggttcatgac tcccctccca taccccagcc 2820 

taggctctgt tcactcaggg aaggagggga gactgtactc cccacagaag cccttccaga 2880 

ggtcccatac caatatcccc atccccactc tcggaggtag aaagggacag atgtggagag 2940 

aaaataaaaa gggtgcaaaa ggagagaggt gagctggatg agatgggaga gaagggggag 3000 

gctggagaag agaaagggat gagaactgca gatgagagaa aaaatgtgca gacagaggaa 3060 

aaaaataggt ggagaaggag agtcagagag tttgagggga agagaaaagg aaagcttggg 3120 

aggtgaagtg ggtaccagag acaagcaaga agagctggta gaagtcatct catcttaggc 3180 

tacaatgagg aaattgagac ctaggaagaa gggacacagc aggtagagaa acgtggcttc 3240 

ttgactccca agccaggaat ttggggaaag gggttggaga ccatacaagg cagagggatg 3300 

agtggggaga agaaagaagg gagaaaggaa agatggtgta ctcactcatt tgggactcaq 3360 

gactgaagtg cccactcact tttttttttt ttttttttga gacaaacttt cacttttgtt 3420 

gcccaggctg gagtgcaatg gcgcgatctc ggctcactgc aacctccacc tcccgggttc 3480 

aagtgattct cctgcctcag cctctagcca agtagctgcg attacaggca tgcgccacca 3540 

cgcccggcta atttttgtat ttttagtaga gacggggttt cgccatgttg gtcaggctgg 3600 

tctcgaactc ctgatctcag gtgatccaac caccctggcc tcccaaagtg ctgggattat 3660 

aggcgtgagc cacagcgcct ggcctgaagc agccactcac ttttacagac cctaagacaa 3720 

tgattgcaag ctggtaggat tgctgtttgg cccacccagc tgcggtgttg agtttgggtg 3780 

cggtctcctg tgctttgcac ctggcccgct taaggcattt gttacccgta atgctcctgt 3840 

aaggcatctg cgtttgtgac atcgttttgg tcgccaggaa gggattgggg ctctaagctt 3900 

gagcggttca tccttttcat ttatacaggg gatgaccaga gtcattggcg ctatggaggt 3960 

gagacaccca cccgctgcac agacccaatc tgggaaccca gctctgtgga tctcccctac 4020 

agccgtccct gaacactggt cccgggcgtc ccacccgccg cccaccgtcc caccccctca 4080 

ccttttctac ccgggttccc taagttcctg acctaggcgt cagacttcct cactatactc 4140 

tcccacccca ggcgacccgc cctggccccg ggtgtcccca gcctgcgcgg gccgcttcca 4200 

gtccccggtg gatatccgcc cccagctcgc cgccttctgc ccggccctgc gccccctgga 4260 

actcctgggc ttccagctcc cgccgctccc agaactgcgc ctgcgcaaca atggccacag 4320 

tggtgagggg gtctccccgc cgagacttgg ggatggggcg gggcgcaggg aagggaaccg 4380 

tcgcggcagt gcctgcccgg gggttgggct ggccctaccg ggcggggccg gctcacttgc 4440 

ctctccctac gcagtgcaac tgaccctgcc tcctgggcta gagatggctc tgggtcccgg 4500 

gcgggagtac cgggctctgc agctgcatct gcactggggg gctgcaggtc gtccgggctc 4560 

ggagcacact gtggaaggcc accgtttccc tgccgaggtg agcgcggagc tggccg 4616 

<210> 9 
<211> 4374 
<212> DNA 

<213> Homo sapiens 
<400> 9 

ccggcgtagc agtgggggag gggaccggcg agaggggagg aaggaagggg ggaggaaggg 60 

ggagacctgt ctgaatattg caataaaaat aaagcgagaa gaaagaagcg gactcacctt 120 

tatgaggcat cctttctggt tgtcacagct tctgtcaaga gttttgtttg gttggggttt 180 

tttgtgctgc tgttgttgct tgaagaccac aatggtttga aatgacggtg ttttaaagga 240 

gttgctggtg agagttttct ccacggatgt tgctgggttg gtgtgtgaga gcaattctca 300 

gattcctggg aaggagacag agattgacaa taaaatgggc tgtcagcggc tggagagtga 360 

gagataaaga gtgtgggtga gggaagtggc tgcagccagc acacctatgc tgattggtga 420 

tggctcaagt gtgttaatgt gtgtgtgccg gcgcccggcc tcgcctccac cgctcctcac 480 

tggcccatta gcgaagcctg acctctgtca tcatcctcca gcaaaacact tcctcctgcg 540 

cptgaaccag agcgggaaat gaggccgagc cacggttccc ttttcaaacc cactaatcac 600 

tccgcaacat gcaaatgcac cgctcgctct cacacaaaat attgctttat gcaaagcagc 660 

gccggggcct cgcgccagcc gattggatgc tccctctccc cgcctggtgc aattggtccg 720 

tttgtttgaa tatgaataca cttgttttcg gctccgccgg tgggtcccgg ctcctctccc 780 

cggcgcaggg cccccgagcc cgggctctgg cgctcgcctg catctccccg ctccgcgcac 840 

gccgagattc ggcgcggccc gccctgctgc cgcgaacttg agcggtcaag tgaaggtttc .900 

Ugggttgggg gtcgcgcccc cactccccct aagctggact cgggactccc agtcctcgga ; 960 

gggtgcaagt tt otgtgctg tcctttcttg ccaactccag gacagcgtgg cctgccgccc _1020_ 

seTte 10 



cgcccccgcc cgcccccacc ccatcccagc 
gccgcacccc agcccacctg gccccttgga 
gtcctggcag ctccattcca gctgcggagg 
gccgggccga gaactaaacg aggccagagg 
caaatcctcc accccgctct ccccgccctt 
ggtgcgcagc ggagcagagg cacagctctg 
actttgtgta cagagggttc ttgtgaaaag 
acttctccca cgtgccattt gttgactaag 
ggggaagcca ggggagataa gaggggaggg 
accaatgctg ggagggattc tctctctctc 
tgtgtgtgtg tcccactggc gatgtcggta 
atctgtgtgt ggtgcgtgtg tctccgcacc 
acggaggcag ctccccttca gctgttgcca 
ttttctcttc tgaccacaca ccagaagtcc 
cctcagtaca ctccaaaggc gcccctcgga 
ggctgcctgg tcctgcgctg tcccgagctc 
aggagaagga aggcctgggc tgctgtacac 
tgaagagctg aggcagggag gctgaaatga 
gacttaggga aagtgctgaa agtcagagct 
ctgctgactg gccgcgtcgc cgcagcccct 
cgccctgggc cgccgggcca catgtgggcc 
agccctgcga ccaaggagcc cgggcagggc 
gccagaggtg gggtcgcata accctctccc 
aacccttccc cgggagctcg cagcgccggg 
gcgccgggtg tccaggtccg cggcctcctg 
ttcggaggat gccggcaagt ggggccgcgg 
ttctctttct ctctcctttt actttccaag 
caagtccgct gtgcatgagc gagggacttg 
gtccgaactg gaggcgagta tcatttttac 
caccttctta agctttgcct cggaaagaag 
gggcactggc tctttgaatc agctccttct 
agaaaagttt taacaaaaaa tcagaaaagg 
cattgggtgg ggaagggggc tccgggaaag 
aacaaaaagg actrggctgc ttcccatgtg 
gaatgaggcc gagttttcaa agtttaagga 
tctctggact ggtctcaagg aaggtggggg 
cctctgccaa gtaccggctc tgggggcgaa 
ggtgcgggcg cggttgccag aggtgtgtac 
taattgcggc taccaacacg cccgaggccc 
gcggggattg cgctttaggt tctgtctcct 
ccctaccctc cgccccctgt ggccagaggg 
gcgggggccg ccggggtcgt cccacccggt 
ctgggggacc gtggctcgcc agagcctgcc 
catgaggtga ggcaagcatc tcacccgcac 
atcaggcctt cggggaggca aagaagtgcc 
agcttcagaa ccaagagcct cccatccgga 
ctctttcccc ggtcaagcag aggcgcagcg 
ttttttggag aaggaaagtc ttggcagaca 
tctctttttg cagaccctgg gaaggtggga 
cagtgagccc taaaggcaaa ggctcccatc 
cttccaggcc tttgtccctc tcctttctcc 
gcctcgccga gcccagagcc tagaaaagcc 
ttcaggctgt ggccactgct cggcttgggg 
ctcctgccaa atggccagtg ctaggggctc 
cgctctggga ggccagcctg tctctctctc 
tctctctctg tcttctccct gcccaccccc 

<210> 10 

<211> 2534 

<212> DNA 

<213> Homo Sapiens 

<400> 10 

aggcaggaaa gtgggacagc cggggagctg 
ctgatggcat gtggcttgga gagggcaggt 
ctcaaacaag tggcaacagg ccaccaactt 
gtgtCcaaca aacetactgg gtgactaatt 



seq_prot.txt 

cggggcccca agacacagag cacagcggcg 1080 

agaagaggaa aggggcggga gcgtcactgg 1140 

ccgcggcgaa gcctaggccg agaggaggtt 1200 

cctccccagc ccaagcctcc agggcctgtc 1260 

tcttctctcc tttctccttt cggaagccgc 1320 

gctggagagg cccgagtaaa cacgccactt 1380 

ccctgaagag tcctcaccaa acactcacca 1440 

tgcggctgct tttcgcagcg cccgagagga 1500 

ggagtggaag cctgggtggg ggggcggatg 1560 

tctgtgtgtg tgtgtgtgtg tgtgtgtgtg 1620 

tgtcagcgtg tgtcgtgtgt gccaccggtc 1680 

acgactccct ctcctgggtc tgctcagtcc 1740 

gaaaaccggg gaggaaagta agtaaatggc 1800 

atttgttgaa tgccgcacga ttaggacaca 1860 

gcctgcagtt aaggtgacag gagagcgagc 1920 

tctcggcggc ttcgggccac ggagccggga 1980 

accccaggac agggcgaggg ggaatagggc 2040 

agcagccgaa gggtacgcta gactcttctt 2100 

catcaataaa atctgcgcga cctgaactcg 2160 

ggcagccggg acagtgagga cgctgcaagc 2220 

tcaccctctc gttcctgcct gggacccagc 2280 

tcccgggaag caaaggcctc cccggcccta 2340 

ctccccgggc tggggagggc tgccggctct 2400 

aggaaacctg ggtcagaggt gaaggaggtg 2460 

ggcgaggttt catttcgcag gagcgcgact 2520 

ccctgagtcc cagaccagcg aaagtcggct 2580 

cgagcattaa agaagcttgg actcaagact 2640 

cgggggggcg gggggggggg aagtgattct 2700 

aacctctcaa ttagcgaccg agtggattag 2760 

cagccgtaat cttgttgcta tttcccccca 2820 

cccacccctc caacccccat ctcgctgcgg 2880 

caaggagcga ggagtgaatg ccactgacgt 2940 

actcctggaa actcctccgc acaaacaaaa 3000 

tcaatatttg gggaggggag ttcaacccca 3060 

gagaggggtg agagtgagat ttcctcgttc 3120 

tgctgttcta cctcgtgggg agcctcactc 3180 

gcggcctagg ggtgtggggg ccgagactga 3240 

aaaaccgaga caactcgcag gtcgccattt 3300 

acttgctgcc ctccaagacg cgggggggcg 3360 

ttccccgcct ccccacaggc ccctgttccc 3420 

aggacctctc aggccggcgg gtgcgttccc 3480 

agcggggctg tctgcgtgga gggagtcccg 3540 

cagagcttct agccagtatt gatgcgtcaa 3600 

cgatccctgg cattcccctg tgcccacccc 3660 

cacttctggg gggaagtgcc tccagctccc 3720 

gttctgacag ttctgacacc tccaaactct 3780 

ccggcgcgct ggaaagggtg aagactggca 3840 

cctgcgtcac tttgtagtgc cccaaacgtc 3900 

tcccttgtcc gcggagctgg caaacgggtc 3960 

ccggccgccc agtccctggt gcctccccca 4020 

agcaacctgc cggttcccgc taacccggca 4080 

aagaagatcc cagagccgcc ccgtggcacc 4140 

ggaagccgcg gtgccctcgc ccttcctcgc 4200 

cctgggttca gtcctcagtc ctcttccgtt 4260 

tctctctctc tctctctctc tctctctctt 4320 

aaaatacctt taaaataaca ttgt 4374 
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gaccccaccc tctgtgagcc ccgctggtac 60 

gacctggcgt ggagggccag agggtaaatc 120 

gaaagggaaa attgtgtagt gatgggaaat 180 

acaaaggctg ggctggagct tcagaggctg 240 
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cttgttaaac acttcattaa 



cagaggggac 

gaaggagcct 

gggagaaaag 

actttgagca 

gtgtcacaga 

ccatggccct 

aggattagga 

gggctaaagc 

gccgtttcct 

agcctggaaa 

tgatgattca 

gggaagagag 

ccttctctgg 

ccggcattgg 

gagagagcca 

atggcagcct 

aacctgctct 

ctgtccagta 

cgtgagtacc 

ctgctggaca 

aagatgaagg 

cgcatcattg 

atgccgccca 

gagatcgcca 

atggctgatc 

ctgctgcgag 

gctccccagg 

tcccccaccc 

gctgttcttg 

ctggggatcc 

ccctgctctc 

tgcctccctc 

gctgagaact 

tgtgtgcgcg 

ccatgtttcc 

ggcgatgggc 

gttagggccc 

tgaggccggg 



cctgaggggg 

ggatctcact 

ctaaaggaag 

aatctcaccc 

gacatggtgg 

acctaaacat 

accccatcct 

cagtccttag 

gatgtgggca 

ggtgccagtg 

cccccacatc 

gagagggagg 

gcctggccac 

tcccaggcag 

gtctgatcca 

tcatgaaagg 

cagtagccta 

ttgagcagaa 

gggagaaggt 

gccacctcat 

gtgactacta 

actcagcccg 

ccaaccccat 

acagccccga 

tgcacaccct 

acaacctgac 

agccccagag 

tgccgagagg 

ctccaaaggg 

cactcttctt 

cgcacccgct 

ctgcccctgc 

ggacagtggc 

cgcgccagtg 

tctcaataaa 

tgggagtggg 

tgaactccct 

gtcc 



gcggcactct 

aatgaggcct 

tccaaacaca 

agggtgcaga 

ctctctgaga 

ggattaaatt 

gagttatcct 

cctgaacctt 

cccctggaag 

gccatgtgat 

caggtggggc 

ccagcctttt 

caggtcttgg 

ccagccaaaa 

cagttagccc 

gaaggccaag 

cgccgtggag 

taagaacgtg 

aagcaacgag 

ggagactgag 

caaggaggcc 

ccgctacctg 

gtcagcctac 

ccgcctgggc 

ggaggccatc 

cagcgaggac 

actgtggacg 

ctgagtgttg 

actagtatgg 

ctccgtggag 

gcagctgttg 

tcctcccgac 

tgcctctgat 

aggggctgga 

caagaccgag 

gttcccctgt 

actggaatct 

aggtggctca 



seq_prot . 
gaaagctgcc 
ggaggangga 
gtctggagct 
aaggagtttc 
cttagtgttc 
cgatcgtgat 
cacctgaacc 
tatgggctct 
ggcactgtga 
gccagccccg 
ccacgcccag 
tacctttact 
gccctggtcc 
ggcaggccaa 
gccgcccgcc 
ctggcagagc 
aagggcgagg 
gtgggcggcc 
gagggctcgg 
ctccagggcg 
ggggacgccg 
gccgaggtgg 
caggaggcca 
ctggccctga 
tctctggcca 
tcctacaaag 
gccgacaacg 
cccgccaccg 
ggtgggaggc 
agggactggc 
agcgcaccta 
cccaggacca 
cgtaggaatt 
gatgggtgtg 
actgagggaa 
gacactcctc 
gacttagaga 
gtggcccgca 



txt 

acctgcgcat 

accatcttca 

cataggtcag 

agggaattgg 

ccatctctat 

atgaaagtgc 

aaggggggaa 

gtcgaggctg 

aagtggatct 

aacaagaggg 

atttctcctg 

gcagagccgg 

cgccccctgc 

gagcaggaga 

tgtgtgtccc 

aggccgaacg 

agctctcctg 

agagggctgc 

aggagaaggg 

tgtgcgacac 

agagccgggt 

ccaccggtga 

tggacatcag 

acttttccgt 

agaccacttt 

acagcaccct 

ccggggaaga 

ccccgccctg 

cccacccttc 

agagctgagg 

accactggtc 

ggctacttct 

gaggagtgtc 

tgtgtgtgtg 

agcatgtctg 

ctgtctctct 

ccctgacttt 

cgcaagactt 



tctgggagct 

ggtagactga 

aggcctcaat 

tggctatgtg 

ggtcctgtgt 

ttgggaaact 

gttacctggc 

aagcagccag 

gatttgagaa 

ggcagcctgg 

ctgactgttc 

aaagggtgtg 

tcctccccac 

gacacagagt 

cagagccatg 

ctatgaggac 

cgaagagcga 

ctggagggtg 

gcccgaggtg 

cgtgctgggc 

cttctacctg 

cgacaagaag 

caagaaggag 

cttccactac 

cgacgaggcc 

catcatgcag 

ggggggcgag 

ccccctccag 
tcccctaggc 
ccacctgggg 
atgcccccac 
cccctcctct 
ccgccttgtg 
tgtgtgtgtg 
ctgggtgtga 
tccagttctt 
ggacctctga 
tgagtccagg 
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tcaagggatc 
gctcggtcta 
atctcttgag 
ctttattctg 
aatgatagtg 
gtcagcaacc 
ctaaagcata 
ggatccagga 
actgagcaat 
gcatcaacag 
tcggtctggg 
agcatactcc 
cttcagcccc 
ccagccccaa 
gctctccctc 
agagggccca 
ctcctcctgg 
cacaggagcc 
tgctgctccc 
aaggtcttgc 
tcaacctcct 



tgcctgcctt 
acaaggaatc 
tagatcctgg 
aaattatacc 
atggtggatg 
tatctcgatt 
gctctggcct 
ggccctcatc 
taaaatatta 
tcctgggccc 
cagccctgga 
tggaagtcca 
tcctcatcca 
ccccagctcc 
cccactgggc 
gtgagtaagg 
acacagggag 
cagcgaggta 
caacaccagc 
tctgttgccc 
gggctccagc 



ggcctcccaa 
ttttaaataa 
tacaactcat 
tttcctcctt 
gtatttgcct 
tccaattttt 
tggagagatt 
tcccagcact 
tcaacagaca 
aagagatcca 
agaagtcatt 
cctgctggtc 
ctttctcctg 
accaaacacc 
agctgggtat 
agagagacag 
gacactggat 
ggtatcatca 
caggtagatt 
aggctggagt 
aatcctccpa 



cgtgctggaa 
agctttgcgg 
tacttgcagt 
tattggaatt 
tttttaatat 
tgctggtgtt 
tccaggagag 
ccagctgagc 
aaaaaagttt 
tggcaggaag 
gcacatgaca 
actgttctca 
aaagtgacag 
cagaccctca 
aaggtgggca 
atactcattg 
gctcctctga 
ccccattctg 
cagaaaagaa 
gcagtggcac 
cctcag cctc 
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ttacaggtgt 
ggtcgattag 
attgaatgtt 
gaaattttat 
ctttatttga 
tgaaattcca 
tcagagccca 
cagccgggtt 
attgaataca 
tcaagagttc 
gtgatgagtg 
tccaggctgc 
gaaatacact 
cacattcagg 
ggaggccctg 
gtgagcatct 
gcatgcatca 
ctgatgagga 
cactttttat 
aatcacaact 
ccaagtagct 



gagccaccgt 
actaattcat 
aaggtctgtc 
ctttttatga 
caaaataaaa 
aaattgagac 
gaagggagca 
atggaacatc 
aaactcaaag 
tgcttcaggg 
ccaggaaaac 
ccatcagctt 
catcaccaag 
ttggttccca 
ggagctacca 
ccagtgtgcc 
gctcgaatcc 
aaccgaaggt 
tttttaagac 
cacptgcagcc 
gggaccacag 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2534 



.60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



gctcatgcca ccacacccag ctaattgttt 
tgcctaggct ggtcttggca accagagtga 
ctcactgtaa ccttgaactc ctgggctcaa 
cgtatgccac catgcctggc caatttttat 
tgttgcccag gctggtcgaa ctcctgggct 
ctgtggggat tacaggcata ggccactgca 
ccgaaataat tagaaaaatt taaccagcca 
acctgcctca tgtgttcaaa gcctgacacc 
agggagggtg agggacaaat tgaaggtcga 
atatcttagc tgggcttttc tatccctaca 
aagaggtcaa gcacagtgcc tcacatctgt 
aggattgctt gaacccagga gttcaagccc 
tacaaaaata aaaattggcc aggcatggtg 
aggattgctt gagcccagga gttcgagatt 
gagtgagatc ttgtcccccc accacaaaaa 
tggctcacgc ctgtaagccc agcactttca 
aggagttcga gaccagcctg accaatatga 
atcacttggg aagccgaggc aggtagatca 
aatatggtga aaccctgtct ctactaaaaa 
gcctgtagtc ccagctactc aggaggctga 
gaggttgcag tgagccgaga tcgtgccatt 
tccatctcaa aaaaaaaaaa aagaaaagaa 
tgatcttggg acaatcactt ttcccttctg 
gattaaacag atggactcgg aaggccctca 
aacagccatt aattaagcac tcccactcgg 
gctaaagtgg ggagggggct gggccccacc 
tgggggagag atttaacctg cacccccaca 
gccactcacc cccacaaagc tgggcagctc 
actcagcttg aacttgtcgc cctcttggca 
agccagcgcc tgctccagag aactgcacat 
gatgggtaca ctgctgaggc tcttggggcc 
ctatgccccc aagccacctc acaaaacccc 
ccccctcaga ctgtggttcc ctgaggccgg 
gacgctaaag cttcccttgt gtctgggagt 
acccccctga gggcagggct ccccatttta 
ccagagtccc attttgatgg catctgcgca 
ctggcagatg ggggctcctg tgcagtctgg 
aagctccctg aaggagtcgt ttccctcctt 
tctctctgcc atctaactgg aagctccctt 
agctcccaga gggtggggaa tgtgtctccc 
agacaggggc atggataggg tgggggcaac 
ttggaattag aattaaagat atgaaatgtt 
tcagcggcaa ggcctctgag aatactcaga 
gtgaggccag aggagctatg ccagaggcag 
gccttcaccg aggcaggggg cagagctttg 
tgagacccct gcctcttgca ccttagtcct 
aaaaaaaaaa aaaaaaatct caggattctt 
gctcctaggc ctagaccccc aaagaagggc 
cagagcaacc cctgcactga gacctcacag 
cgggcgcggt ggcttactcc tgtaatccca 
cgagatcagg agatcgagac catcctggct 
tacaaaaaaa ttagccgggc atggtggtgg 
gaggcaggag aatggcgtga acctgggagg 
ctgtactcca gcctgggcga cagagcgaga 
aaacccttca gtcaacactt cagcacttca 
agaggcccag gttgatccca ccagggaagc 
gagacccacg gggatcctcc ccaccctgcc 
ggtggcagag acagacccag gaaggagagc 
atatgtgggc cccactggcc cccaacttgg 
tgatcccacc aaaccctggc ct£ttgccat 
cccagagagt gccagctcca ccaccagctg 
cccctgagcc cacccaggcc acagtgggaa 
ggaccggggt gaggggaggc aacccctgct 
ttctcatgac atgcctgtga gtcacagccc 
tgagacttca tggtccagtg ctcgtg { taca 
aacctggggc cacccttagg aagctggggg 
gagaaggaca ggactgactt cagggcatgg 
ccagaatacc tcttaatgag gtaatgaggt 



seq_prot.txt 

atttttagta gagacaaggc ctcgctatgt 1320 

tctcactttg tcacccaggc tgtattatag 1380 

gcaatccacc caagtagctg ggactacaag 1440 

ttttattttt gtagagatgg agtcttgcta 1500 

taagagatcc tcctgcctta gcctcccaca 1560 

cccagctgaa aaacactttt ttaatactag 1620 

caaaaacaaa attaaaaaaa ggaaagaagg 1680 

taactttggg aatcaccaat gagtcagtat 1740 

tgcatttctc cacaaaatgt cccagactat 1800 

tactttctta gtttggcata taaaaatgga 1860 

aattccaata ctttgggagg ccgaggtggg 1920 

agcctggaca acatagcaag actccatctc 1980 

acaatactcc tgtagttcca actacttggg 2040 

acagtgagct atgatccagc ctgggtgaca 2100 

aaaaaaaaaa aaaaaggagt ccgggcacgg 2160 

gaggccgagg agggcagatc acctgaggtc 2220 

tgaaaccccg tctctactaa aaatacaaaa 2280 

cgaggtcagg agttcgagac cagcctggcc 2340 

tacaaaaatt agctgggcgt agtgggatgc 2400 

gacaggagaa ttgcttgacc cctggaggca 2460 

gcactccagc ctgggcaaca agagggaaac 2520 

aagagagaga gagaacgtgc cagtgctgtg 2580 

ggctctggtt tctcaagtct ccaatgacag 2640 

cagcacatgg ttctctggaa tgctgcagga 2700 

gcctcatttc cactccagtg gagccatgaa 2760 

ctggcacctg catcgacagg cacccggaac 2820 

ctcacaccag gacctccccc acaggcctgt 2880 

cttgtgcaga agttccttca tttccccctt 2940 

ggagtacttg tggaaggtag tgaccagcac 3000 

catggatctg tggctgcaga ggtgccaggt 3060 

ttagctcagc ctgtaggatc cacttctgcc 3120 

tctctgaatg ggaaggcagg ggctgcatct 3180 

aggcgatacc tccatctcac cctgagggca 3240 

caccaagcct cctctgatgg cacttgggac 3300 

gtgtctgcca gaagggaaat gacagggtga 3360 

ggctctgggg agcagggcca catgcaccgg 3420 

caggagccat ggctacagcc taggagcagg 3480 

agactcggga ctcctgaagc cagaggcatc 3540 

gggcaggagt gtttcctctc tcagactagg 3600 

ctcaccctga aggcccctga gcctctaggc 3660 

ttaacattct ctccaccagg aaggatcctg 3720 

cattgccttg ggatcaggat gggacagacc 3780 

ccagcctgga aaggttaggc aggtttcagg 3840 

ggcatgccct gggcccatca tgccagccct 3900 

ctgcttgggg ccatgaccca ggcagagtct 3960 

gctagaatcc aggactttgc aaaaaaaaaa 4020 

tttgcagcaa cgcccacatg cacacctcct 4080 

ccctagactg agatatccag gggagggcct 4140 

aggaacctgg agaaaaaccc ttcagtcggc 4200 

gcactttggg aggccgagat gggtggatca 4260 

aacatggtga aaccccatct ctactaaaaa 4320 

gcgcctgcag tcccagctac tcgggaggct 4380 

gggagtttgc agtgagctga gatcacacca 4440 

ctccatctca aaaaaaaaaa aaaaaaaaaa 4500 

cagagagttc ttccaagggg gaaggcccag 4560 

tggaatgaga gaactgtgga gacaaagaca 4620 

ctcagcagga cagaggtact caccagacca 4680 

aaggcagcca ggctcccagg gtgaggattt 4740 

cattttaagg aaaccaaacc tgcctcaacc 4800 

tcccactcat tccctcccag ccccctccct 4860 

ggggagggct ctaggccagg tcaagggcag 4920 

gtgggaggtg tcgtggggac tgggcatc<;t 4980 

ctggggcctg gagactggtg ccaggcctca 5040 

actcctgtgt gagaggctgt gagcacacgc 5100 

tacatgcgcc tgtgtgtgtg agagagaagc 5160 

ccagaggagg cagctggggg acatcaggag 5220 

atgaagtgga gtgactttta atggccagca 5280 

aactaagtgt gct^agaa^t acctgggaag 5340 
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gtccctctgt gtaggaaggt gtctggaagg atggggaagc agtgagaaat ggggttgctc 5400 

tacacacagc cgcagttggg ctgggccaca gtctgtgggg agcataagga agtcccaagt 5460 

agttaaaggc tgaacaggaa aactttggcc tcctcttctc cagcttgagg gctgggtggg 5 520 

tgggtagggg tacatcccta cccaaggctg aaaacaccag ctggacagct tcccaagcca 5580 

gaatccaggc ctgaagcaca gccctcagcc acagccctct ctcactgccc cccacacaca 5640 

tgtcctcccc agacccggag aaaggcaaac actgtggcca ttcagagccc agcccaaggg 5700 

agggctgccc cctcccaggg ccccttcctc cttggccctt tgtctatccc ggcccgtctg 5760 

ggcatcaggg caccgccctrt cctgtggccc cagctcccct ccctgctcca gctgtcgagt 5820 

agcttcttcc cctcccccag cccctctacc tcagccctaa gtgactcacc cgaggttcca 5880 

ggcccagatg cccaggcagt gactcagcgg cacaccctcc tcccccaggg ccagggcagc 5940 

agcagtccca gagcagtagg tggaagcccc agcccagcct ggcagggaaa gcactggcac 6000 

agagacccca aacaggccct gccaggagga gctttggata atgacatggc aggaggatca 6060 

gactgagaac aggctggtcc cttcctagcg gtcaccgtgt gacttggggg ccagcctgag 6120 

cagtgtacat gaagtcaggc cttggggccc ccagtgtcct cttctacacc ttggcacagc 6180 

caccgagtgt cccttgctcc cctcagtact tgaacccctc atgctacttg tcttcccatc 6240 

gaggcctact tcaagtagaa ctcggaataa aaacccttca cactgagggt gagggacaag 6300 

ggagcgcaat gcatcgatrc ncacagccat tctttccttt taactaaggg cctccacagg 6360 

gccttccctg ctcagaaact gcagtgcatc cccatggctg tcccttaaag ccaggcggta 6420 

tgttctcact cataagtggg agttgaacaa tgagaacata tggacacagg gaagggaaca 6480 

ccacatacca gggcctgtgg gcagttgcgg ggagaaggat gggggcaagg ggagggagag 6540 

cattaggact aatacctgat gcatgcgggg cttaaaacct agatgatggg ttgataggtg 6600 

cagcaaacca ccatggcaca tgtataccta tgtaacaaac cacattctgc acatgtatgc 6660 

cagaacttaa agtaaaaata aaataaaatt aaattaaaat taaaaatttt taaaaagcct 6720 

ggcagtgagg cgctccagac tcggcctgct gctgcgtgtc ccttaaacac accggcacat 6780 

gcaactagtg tggccaagac cctcggcttg cgctgacccc tctgttcaga acgtgcccac 6840 

acatgcccat gccctctatc ctgctgaaat ccaacccccc cttcaaaggc cactgcaggg 6900 

agccttccca gggcccaacc ggattcagtg tctacctgtt ctgtcctgta caaagcgccc 6960 

cttttgccct tttcgtgtgt acttagagaa ttctgatttn gactgaatca cctgtgtact 7020 

tgttttctct cagcctcacc cccacatggt gagctcaccg gtcttactca tttctgcatg 7080 

aggtaagggt ctcgcccaca ccaggtgcca accagtgctr acggctacca catggcaaca 7140 

ggggtcactg gaggagtgct taggagcaag ggagtccttc tcaccccgtg aaatcacacc 7200 

actgactatg ctacgaaata atcacctctc ctctggatcc caagagtgga accgagacaa 7260 

cgggtcggaa aagagcttca gagagctaca aatccctgag gagtgacagc aggaaactgc 7320 

ggctcccagg acccagtgag ggagttgagc aaacccccca gcgggcagag acggagacag 7380 

gcaggcagac gaagaagaca aagagtcagt cagatggaaa agccagggag aaaaacagag 7440 

acagcagcag agccgcgggt cactagggtc agctcctgaa ggcctgtcct taatcaggtg 7500 

tctctgtctc aggaactgag gctgcgactg cgcccatcaa atggccatat acctgagact 7560 

ggtgataatc gtattcattt gccactaact tgctgcgtaa ccttgggcaa ctccctcccc 7620 

cacggcaggg caggagagtt gtacggaatg gcccctagag tccgatccca atcccactct 7680 

cacattctcc agctgtctac aaaggcagag tagtcaagac acaagcaggc cctgcggaca 7740 

tccaaagcag tgactctgcc ccggcccagc ctcaggaagc tgggtcctca cattccgtcc 7800 

accccatctc cgggctccag gtgccagaaa ttrtccacca gcggccccgc gggattttgc 7860 

ccagctgctt cgrgccctct ggtggctaag gcgtgtcart gcagtgccgg cctcctgtca 7920 

tcctcccttt ctrgtccgcc agaccctctg gcgccctgct tacgactcaa acaggagaca 7980 

gtgctgattc atttccaagc ggccttccta cacccacacc tgcttcacat agatgaggtt 8040 

tcccggacag tccctgccca gaagcccagg tggatgtctg agacggcagc cagatacctg 8100 

gaagccattt taaaagccta gggatatggg tactcgggag ataggaggcg gaaagaccac 8160 

ttgagcctag gagtttgagg ctgcagtgag ccatgatcgt gccactgcac tccagcctgg 8220 

acgacagagc aagaccctgt ctcaatcaat caatcaatgc aagcctagag ataagggcag 8280 

gagaaaacca ggggacagaa gtgagggatt ggccactgcc aagttctgcc cacccacccc 8340 

ctttccccag gccgttcagg ggaaagagaa atggcctcat tgtgggggcc acaggtttag 8400 

tttaagtcct gacattgcct ctggcttact gtrgtgactgt: cgaccttggt tgagtggtgc 8460 

cacgaacctg agcttaaacg ttttcagcag aaggaccccc tttattatgc cccgccccac 8520 

aaaataagtt ctgaggtatt ctgtcttagc aaaatgcatt aaataattta tttcccctgt 8580 

tgtactaatt ggatxcataa gtgtctcccg gcagcatgac cagcatgcga ggacccacgt 8640 

tatxacattg ggttcaggtc atcctggatt tgtcactcac aatgggcgtg tgcatatcct 8700 

ttgctggggg gtgcagactg aaattttgaa gcactcgctg tgcgtcaggt gctctccgta 8760 

cattacccca tttaattcac acaacaatcc caggagttgg aataagatga ccccacttca 8820 

c.agaggagaa cactgagact caaagtctgc caaattcaca caattagtaa ggggtgggga 8880 

agcatgacga acacaggtgg tcacaacgtt aaacctttcc tagaagctcc aattattccc 8940 

cgataaccct gagagcccta acatttcatt tccattcaga ccagcctcag tctctctgcc 9000 

catcctctct ccaccccaca ctcagcctat ctgtctcatt tctcaacctc agcaggcgcc 9060 

agaagcccat cttctcccca tctctgattc tggcatcttt tcctcttgcc ttcaaatgcc 9120 

ctgtggactg gattcttttt aatttttgtt tacttactta tttgagacat agactcactg 9180 

tcgccaaggc tggagtacag tggcaccatc acagctcact gcagcctcat gtgatcctcc 9240 

cacattaacc tcctcagtaa ctggggctat aggtgtgcgc cactgtgcgc atctttttgc 9300 

atcttttgtg gtgatggagt tttgccatct tgcccagacg ggtcttgaac tcttgagctc 9360 

aagcaatcct cctgcctcgg cctcccaaag tgctgggata ccaacgtgag ccaccatgcc 9420 

— i — ----- - - seite 14- ■ -------- 



seq_prot.txt 

cagcctagcc cggattaaat ataaatataa aagagactgg gcatggtggc tcacacctgt 9480 

aatcccagca ttttgggagg ctgaggcagg cagatcacct gaagccaaga gttcaagacc 9540 

agcctgccta acatggcaaa accccatctc tactaaaaat acaaaaatta gccaqqcqtq 9600 

gtggcacaag cctgtagtcc cagctactca gggaggctga gccaggaaaa tcgcttgaac 9660 

ctgggaggca gaggttgcag tgacccgaga tcatgccact gcactccagc ctgggtgaaa 9720 

gagcaagatc ctttctcaaa aacaaattaa aaatgaacat gccacttaac ctaatatttc 9780 

tactgccaaa catttacctt atgttgttat ttttccacca gtatgtaggt atacaaacag 9840 

atatttgcaa gaaggaagaa atggacataa gaaaaatttc catcaataaa ggattggqct 9900 

gggcatagtg gctcatgcct gtaatcccaa cactttggga ggccagggca ggaqqatcac 9960 

ttaagcctag gagttcaaga ccagcatggg caatataatg a yy««.««- ^ou 

<210> 12 

<211> 4449 

<212> DNA 

<213> Homo Sapiens 

<400> 12 

ttccttgcac atctgagagg taagaggagg tgtctgtggc ccccctgaca catacgtgac 60 

atatgtatta ttacggctca tgtgtgtgct gctgtgtgca gggggcaggg ggcttcccgt 120 

tcctgatggt tcttcttggt gggtgtctcc cggcctctca ggcctcttta tctgctgaac 180 

gtggtagcca agggtttggg aatgatttgc cagactggtt aatgtggaaa gagctggccc 240 

cgctcagttc tgcacctccc tcttcctccc tggaaccgtc tggctgcaga gagcaggagc 300 

tttccagagc tcccggggga tcccatctcc agtataactg tgcaacgtgt gcccagcaca 360 

agagcaccag atggagggac aggtgggagg aggaggaacc cagggtgcat ttagcttggg 420 

gcttgctgag ctgaggtccc cagagcaagg acgactaggt tggggggaca ccctctccca 480 

atccttgcca agctacagag gggccagtgg gaataaagtg acattttggc ttctgtatcc 540 

ctcctctgct gttctatttt ctagttcgtt cagatctgag gttggggagg ggttttcctg 600 

gaggggtcag tccatgccac cctctgcaat ttttattaca catgctttca qacattctqc 660 

tacaggtttc atctcctacg ccaatttcta tttgggaaag caaataaacg gaaagctaac 720 

ttgtgtcact tggtcgctgg ggcatccgca gtgagtagtt aagaaatgcc aggggagtcg 780 

gtgtccattt tcatgcctgg acaagagtgc gtcttggact tgcgtcctgc tgcacacccc 840 

accgtcccgc tgcacacccc accgtcctac tgcacacccc actgtcctac tgtacacccc 900 

accgtcctac tgcacacccc actgtcctac tgcacacccc actgtcctac tgtacacccc 960 

accatccctg ctttctgcac tcagctgccc acttgtatcc cagctcagga agcccagaag 1020 

atgcagaacc tctgcgagag cccagggtga aacgctgcgc ttcactttca aagaaaggaa 1080 

aatcattcat attctttaaa agaatgaaca gcacagatta atatcgatct cttttaattt 1140 

ttaggccaat tttgagtagt caaagtcaga gcagtcaatc tgtgttgtga gccgaggcac 1200 

agctgcagaa gcgtgtctga ggtgtccggt ggaggtggca gccgagctct gggactaatc 1260 

accgtgctgg ggacggcacc gcgtcaggat gcaggcagat ccctgcagaa gtgtctaaaa 1320 

"S?"" 00 tctacagggg tgagggggag ggagaaagag atgctttagt gaggataaac 1380 

attttctttc acatttaaac aattacagag tttttacttt aaagtatcca taggcacatt 1440 

ctttagaaaa catgaactgt cagccgggca cggtggctca cgcctgtaat cccagctctc 1500 

tgggaggctg aggcgggcag atcacctgag gtcaagagtt caagaccggt ctggccaaca 1560 

tggtgaaacc ccgtctatac taaaaataca aaaaaaattt agctgggcgt agtggcacac 1620 

acctgtaatc tcagctacta ggaagctgag gttgaactca ggaggcagag attgcagtga 1680 

gccaagatcg caccattgca ctccagctca ggggcaacgg agcgagactc catttcaaaa 1740 

aaaaaaaaaa aaaaagaata catgaaatgt cttcagattt cgtcatgcct tccccctcta 1800 

tcctaggcaa gctagaaagc gttaccaaca gtggctcttc ccaggctttt ggttagagat 1860 

gtgaagagaa gccgggggga aatcaggttt cttcccaagt cccttagccc tgcctttcta 1920 

ttcctggacc tgaatgcagc ctgactcagg ctaccccatt gcaccaccac tggcggccgt 1980 

gactctgtgt aaaggcatag ctggtgatgc tgatcagagc ctctgtagtc ttaaatgact 2040 

tttctaacta attctaaatc ttcagaaccc atcgtataaa aaggccatac cttctggagg 2100 

gacgtcgatg gtattaggat agaagcacca ggggacccca cgaacggtgt cgtcgaaaca 2160 

gcagccctta tttgcacact gggagggcgt gacaccagga aaaccacaat tctgtctttc 2220 

acggggggcc actgtacacg tctctgaaag tgcacaggta agaagcaaag taagttgtgg 2280 

gctgaattcc ttgatgttat catgcacaca cccatccagc ttccttctcc aatgacatca 2340 

gcaactgtcc agtgaggcgg atataaaacc ctcaggacat gagagggaga cgtggtcctc 2400 

acatcctgat gtgcaaacat tacgctcagg gaaaatgcaa ggtgccccag gtttgtggac 2460 

tttgcatctt tctaggtaac ttatttattc actttttaat ttcaacaaat gattattaaa 2520 

ttttactcaa tacataaata tttactgagc accatttgtg tgcatgagaa gtgggagcta 2580 

gcatggcaaa agccaggcac tgtgccaggt gagagagacc cagaaattaa aaccagagaa 2640 

gtcattaata agagtctaaa tatctgggcc caggctcatg cctgtaatcc cagcactttg 2700 

ggaggctgaa ggaggtgaat cacttgaggt caggagttca agaccagcct gaccaaaatg 2760 

gtgaagcccc atctctacta aaaatacaaa aaattaggcg ggcattgtgg cacacgtctg 2820 

taatcccagc tacttgggag gctgaggcag gagaatcact tgaacccagg aggcagaggt 2880 

tgtagtgagc caagatcgca ccattgcact ccagcctgag tgacagagca agactccatc 2940 

tcaaaaaaaa aaaaaaaaga gtctaaggat ctgatggagg agaaaggcaa gaacatgtgc 3000 
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seq_prot.txt 

gagacaacgc aaggccatcg tcccagggtg cccagggtaa ccacgggggt agggcactcc 3060 

ccggagaggc caatgacaag caggttgaac aaagcagggg ggctccctgc aggaggaggt 3120 

tcaccaggtg aagatggagc cgcatgggca aaggccattc cagagaccca ggtgtgttca 3180 

ggaggtggaa acccattgca ggtaaggtga gaggaccggg tggggtggtc taggaggagc 3240 

cgacagaggg tacaagctgt gaaacagctt gaagcagggc agtgaggaaa gggatctaga 3300 

ggaggaagac acgtggacag atggggctgg ctgggggctg ccgcaggatc ttatgcaaga 3360 

ggttctaaca ccagagcttc aggctctgag ctccgcggaa tcaaaggtct cagaaagcaa 3420 

cctattagga tctggtggtt gacagcctgc agcagggggt gaaagaggag cccagagcac 3480 

cctccggtcc ctgtccctgc cttggggcat aggaggggag gaactcagtc tggccacact 3540 

ggctcaggtg agggcgcccc aggggaggcc gagaggggct gttctcttgt ctgctttgct 3600 

atcagggact gcctcgagat gtctttggag aaagtgttcc tggcttgctg ggaaggatcc 3660 

gtgttcagct cccgctgccc agcagcttcc atgggaacct tgccctcctg gggtctggat 3720 

ccatcgcgac gtgaaggtga tcatcgccca cccccgggac actgtggggg tcaagagagg 3780 

ccccgtggtg agggaggatc atcatcctgg gggtcggggg ggtttcctcc tcagggagga 3840 

agatttccag ccccggtgct ctgcctccgg cagactctgt cttgcccacc cgctttgtct 3900 

ttgtcgcatg atggaaataa atggaaatgg tttccacaca cgaatgcact aaactgtaat 3960 

cacaattttc tagatttcag ttgtaacagg atcatggacc tgagtgaccc gcaaagacgc 4020 

tctgagcctt gagccagagg gtgtgtgggg tggggagggg agtttgccac ggcctctgtg 4080 

atcccacagc acagggggca gagctggggg ctggggtggg ggcaagggcg caggcagatg 4140 

ggcctggggg tggccagcac ggggacccac ccagttcgtt ctgtacaccg aggccacttc 4200 

ctctctcggg ttccaaagct cccccgctcc ctgccatggg cttgggggtc ttcccactgc 4260 

agcccaatgc tggctgctct tctacgctgc ccaagcccac ccctcaggcc cggtacccca 4320 

cagcagagat caagaggtgg ctggagggca gtgggggcac ggacagcacc actgggcgct 4380 

ccctcttcag gcctctctgg aaacccttgc tttggaaacg tagaaagccc tttctcctgc 4440 

ccccacctt 44 49 

<210> 13 
<211> 4296 
<212> DNA 

<213> Homo sapiens 
<400> 13 

agaaacccac atacttctcc attcttaaga aaaggaagta gtggtgacat cattctcggc 60 

taaaatgtag tctgtttata tctcaaatat tcggtaacaa tgttttctcc agagttatga 120 

agcaactaga acaatcaaat acaattttct tctaaatctt tattgcctga ttcattcatt 180 

catccaacaa atatttatta aacatcgatt atgtgtttga tgcttaggga cacaatagta 240 

agtggaggga aagatacata atacctgccc tcaagaaatt tggagttgag tggaggatag 300 

aaatataaat taaagaatga cacaaataat tataaagtta cagctgttaa aagaaaagca 360 

tatggtgcca agagaacgtg taatacaaga tctactcatg gaggtgaggg aaagcttgcc 420 

catcaaagaa gttatgattc aatccacgaa gaccaggagt tggctgggtg aagaaaaaaa 480 

ggtcagagga aggaagtcca cactggggaa ggctctaagc ataaagggta ggaggattac 540 

agaggcatat tcacgaaatt tggagaaggc tttcagtaag caaggagaag ccaaatgaaa 600 

gtttacggga gagttggagg cttgaagaca cgttcaagga tctggttttt atcttctctt 660 

tatctcaaga gcagtgggaa gccattaaat gattttaatc agagggttgg tataactagt 720 

tttgtatttt gaaaagctga attcagctct cgtttgagaa actgagtgaa agagcccaga 780 

acggccgtgg ctgagggtga ctcgtgggag actcctacac aagccatggc agtggcatgg 840 

gctggtggca gaagagggaa tagggagaag atttggaact caatcttcct ccattgacaa 900 

agtcactcca gctttggcaa ggcaattaat tggtgggaaa gaagatgcct agccctcctg 960 

atttcactgc actttctgca tcttcaacat gagtactggg aagtggcaaa acatccagag 1020 

gcagcttggg tgctaggtgg agcatgagtt aaaattccag gatgaagcaa atigaacactt 1080 

agaatgacag gaaagatttg ggagttgggt ttgggggagg gctatttacc tttartccct 1140 

ggagaccctg gcacaaaccc ttgcctctgc aatcttcctc tcaggtaaag gaattcatta 1200 

aatgaattgc tagaagatct actgaccaga gggctgtiaca gaatcatatc tttgagagtg 1260 

ggaagtaggt tgatcacata gtttattatc caatcaggac atatctgaaa gagaaagggg 1320 

gttctattaa tatttaaact acaaaacatg tacaccagga atgtcttggg caaatctggt 1380 

tgccctagca agaaaggaaa tttgaaagtt tatactgttc tgctcccatg ttaccccgtt 1440 

tgcacatgag agggtaagta ttctctttct tcacctgcat taagggaata aaagcacaag 1500 

cattcaggtg actcccaacc cacttttaat tttacagttt ctgctatact ctatacattx 1560 

tgaaaattac atttcccacc actatcactt cgtgataggt gatcatttac aattactcac 1620 

tgactcagtc ccgggaagag gcggtgcaaa atgggacgct ctatccaggt gctcattaga 1680 

aatgcagaat ctctgcctgc ctcctagacc tactgaatta gaatctgcat ttttaaataa 1740 

gatttccagg tgatcaatat gtacattaaa acttgagaaa aacctctaga cttcgaccta 1800 

aagaaaaaca ttttacaact tgacagtgta tgcacataca tadatgcata tagacacaac 1860 

tgaagcacaa atttaatgaa gtagaattta ccgttactat tttatttggg aaagaaatgt 1920 

gctcgcgact caatagattg gagtattcac tcctiggatct caacttgcaa tttgaaaacg 1980 

catctctaaa gcacctaggA gcaatctgaa gaaagctgag gggaggcggc agatgttctg 2040 

atctactagg gaaaa cgt gg acgttttctg jttgttacttt gtgaactgtg tgcacttagt 2100 

~ Senate 16 



cattcttgag 
gcagctgaaa 
acgagagagc 
cctgcccagc 
tgagtcactc 
gccggactcc 
caggccccct 
cccggcgcgg 
ccgtccgctg 
gctgctcagg 
cccaccgcac 
gggtcttcct 
ccccgggcgg 
tgcagggatt 
ggcacgtttg 
agcccccacc 
agcggcccgg 
catggcgatg 
ctcagtcctt 
gtttttggaa 
ctgacgcctt 
cactgaatgc 
cttcagggag 
ctaactaggc 
aaaaaatata 
atataatatt 
agtgtaaaac 
tctgtcacca 
ttcaagcgat 
cgcccagcta 
tctcgatctc 
gcgtgagcca 
tatctgcagc 
ccttagtttc 
atgtttaaca 
tggagatgac 
ggaggcactc 



taaatacttg 
gtcggccaaa 
taggggctgg 
tgttggcgag 
gcgcgcacgg 
ngtgcagctt 
cctggctggc 
ggtccggaga 
cgctgggggc 
ggcctgtggc 
gttcagaagt 
ggggtccccg 
aaacgaggaa 
gtgagttttt 
gtxccctttg 
cccggaaagg 
agcggcagcc 
ccgcgtgccc 
agctcctgga 
acaagrttca 
ggcatggatg 
ctactgggtg 
ttgatggttc 
tacccacctg 
tatatatatt 
aatatatata 
ctctgagact 
ggctggagtg 
txtcctgcct 
atttttgtat 
ttgacctcgt 
ccgcgcccgg 
tgtggtctgg 
ctcatctgta 
tcttacgtat 
ccatgtagac 
attttattta 



gagcgaggaa 
gctctcggag 
acgtcgagga 
gagtttcctg 
agcgacgaca 
ccctcggccg 
gagcgggcgc 
gggcgcggcg 
tcggtctatg 
cgctgcacat 
cgggtgagtg 
cctctccgct 
agtttccccc 
cttgaaaaag 
tgcgagcagg 
gaagtttgag 
tgagctgcca 
acccagctct 
gacggccacg 
agaaaataat 
gatagggagt 
ctagaggctg 
tatgcaatta 
atcgctgtct 
atatatatat 
tgcatgcagg 
tcaaaagtta 
cagtggcgca 
tagcctcccg 
ttttagtaga 
gatccgccca 
ccaaaagtta 
gtaaccgagt 
aaatggccag 
cgtgcccctc 
tttggatgaa 
tccagggaca 



seq_prot . 
ctcctgagtg 
gggctggtct 
gagggagaag 
tttcccccgc 
cccccgcgcg 
ccgggggcct 
cacatctggc 
cggaggcgca 
acgagcagcg 
cgtcctgtgg 
gtccccagcc 
gcgcttgaca 
gcgacactca 
agaaggaaag 
aaaggcgttg 
aagttggtta 
aagcagccag 
ttctgagcca 
ctttccctca 
cgatttccaa 
ggagatgctc 
tagattttgc 
tcgtttgttg 
ctgaaggttt 
aaatatatat 
aatgggggtt 
gatttttttt 
atctcggctc 
agtagctggg 
gacggggttt 
ccttggcctc 
gactttttgc 
atttgggcaa 
cttgaatxtg 
ccctcctttc 
atgtagagga 
tgggca 



txt 

gtgtgggagg 
aggaaacatg 
gctctcgggc 
agcgctgagt 
tgcacccgct 
ccccgcgcct 
ccgcacatct 
gccaggggtx 
gggtctgcca 
acgcgtatcg 
cgggctcggc 
gtcgggcccg 
cgcagcccga 
ttcagttgca 
tgttggccgc 
tctgaaggcg 
cgcacctggg 
cccgttcaaa 
ggccggcttc 
aagaaagrta 
aggtgaaacc 
ctggaacaag 
gaaaccgaag 
ttaaggaaaa 
ataattatat 
cttaactatt 
ttttggagac 
actgcaactt 
attacaggcg 
caccatattg 
ccaaagtgct 
agctatgatc 
gtcatagagc 
ggtaatttta 
ccctggctct 
gttgacagtc 



gcggtgaggg 
attggcagct 
ggagagaggt 
tgaagttgag 
cgggacagga 
cgccggcctc 
gcgctgccgg 
cgggaaggcg 
tgggtcgggg 
ccagcacgat 
ggggcgccgg 
gcaacccggc 
ctcccgtagc 
aggggcgcgg 
gttcgaggcg 
gccggggagc 
ccactccgcc 
aggccagctc 
ttggcccgga 
gcnggctcca 
gagaatccct 
atagtcctgt 
ggttaaaatc 
aataaaaata 
atataaatat 
tgttatggaa 
ggagtctcgc 
tgcctctggg 
cgtgccacca 
gccagggtgg 
gggattacag 
caaataacag 
tttttctggg 
catrcancttc 
tctggtgact 
cacatcaccc 



<210> 14 
<211> 4001 
<212> DNA 

<213> Homo sapiens 



<400> 14 

acgcccagct 
ggtctggaac 
agccacagcg 
tggagatxaa 
ctgggacgat 
ctcatcttcc 
aggaaaaaga 
aaactgcgga 
gttccgcctc 
gacgcctgcg 
ggcctagccc 
atcattgcac 
gggaagatga 
ttccctccct 
gggcacacca 
tttgcaaagg 
ccctctgcca 
catcgggtga 
agaaatgtaa 
tcctcctcgg 
ccttcaattg 
gaaggacaaa 
aaatgatcat 



aatttttgta 
tcctatcctc 
cccagcctga 
aataaaggtg 
gaaaagggtc 
tgtgcacttg 
tgacgtaagt 
atgaaacaac 
ctaccaggaa 
ggcccgcccc 
aaagctagat 
tctttcaaaa 
gatatactca 
gtctcacgcc 
ttcaaagaag 
ccataactaa 
agtaagagga 
atagtgagct 
atgtggagcc 
cctggcgggc 
gattrttctcc 
ggtccggaag 
ttcggaatgg 



ttttttagta 
aagtaatccg 
ttccattcta 
gggtcgggaa 
tgtgacttgg 
ctgtgtgcac 
acggcacaaa 
cgcgagctaa 
gaaccggctc 
ttccgcttca 
aacaggtaga 
tttgattctc 
ttcttgaaaa 
atggtagcgt 
gggagggatt 
gtaatccaga 
accggcctaa 
gctccggcaa 
aaacaataac 
aaacccctgg 
atctagcgga 
ttgtgggacc 
agccccagtt 



gagacgggga 
cccgcctcgg 
tatgaagttc 
tcgactggga 
taggcatcac 
tggcgctgtg 
gtggccggta 
gagatggggc 
gggccggagg 
cgacgttcag 
trtgttttrtcc 
agacgtaccc 
tacctccggg 
ccgcctaggt 
gaggtttgca 
aaaagaaatg 
aggacatttt 



agggctgccg 
tttagcactt 
gccgggggct 
ttagcagctt 
ttcactagga 
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ttcaccacgt 
cctcccaaag 
tccaacaggc 
agagacgtga 
ggagcggtta 
tgtaaatgcc 
cgcggcaggt 
agcgggagaa 
gctgcacgga 
cctgcgtctg 
gacaaattat 
attctttttt 
cttgccttct 
tgcaggcgac 
tcaaaacaaa 
caggcggaga 
ctctctctct 
gaaatgctgc 
ggcctctcag 
ctcacttcca 
gcctggaaag 
gggctccccg 

tgccatgggc 



tggccacgct 
tgcaggcgtg 
aaaatggtta 
tgaaacgttt 
ggggccaaaa 
acctcgattt 
gcatgggaag 
atgaattcga 
ggaccacacg 
gaactggaat 
caaacgaccc 
ttnttcctcc 
gcacacttct 
ccgcggggtg 
tacccctgcc 
atagcagcct 
cctcccctct 
tgcaagaggc 
attgcgacgg 
cgactgacag 
atcgctccag 
gatcaccccc 
tctaaaatat 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4296 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



acagctatga 
tcttttattt 
taggccggct 
cggatggtcc 
atgatccctc 
cccaccccca 
ctcagagagg 
gtcgtggaaa 
gctccctgga 
gcacatggga 
ggacagtcgc 
ggagttgggg 
cattttgggg 
tttggaggaa 
tgctctcagc 
cgtaatatca 
ctccttcaca 



aaacgtaagc 
gagtgcctat 
agggagacag 
cctctgcagg 

ggggagcgta 

cgaggagccc 

gcgggttccg 

ggaccgaaat 

gctttcaaaa 

ccagaagagg 

tcgaagccat 

ggagactcaa 

tccttctccc 

tcccgaaagg 

ccaggagcta 

ttcccctagt 

tcctgcagaa 

gtctggaaaa 

gtaagtggtg 

cctctggtcc 

tattgtcccc 

tctcagtggt 

gtggtaagga 

gtccagtttt 

gactgagctt 

ggcttaccta 

gccagctttc 



gttctcaatg 
cagggattcc 
caaggttccc 
taacatcccc 
tagccaagct 
gccccagcga 
actcatcaag 
gcacgctccc 
cggtggctct 
ggggaaaacc 
catgacaagt 
aggagggtgc 
tggaaaattc 
tcctgcattg 
tggatccttt 
caccctaacg 
acccttatca 
cttatgagga 
atcagtgctg 
gtctgaagcc 
cggcgggggg 
ggcgcttctc 
tagggcttga 
tgggtcgccc 
cccgcgacag 
gaattggcgt 
agggaagcac 
cctggctgag 
ccgttagctc 
cacgcttccc 
ggggatccgc 
cgggaggggg 
gagttgagtc 
gaagcaaacg 
gccatggaag 
atggcagtgg 
tgagacccac 
catcttcctt 
tcctggtcag 
taacaacggg 
ctaggagagg 
acattttatc 



tttcgagatc 
ctacgcccag 
caaagctcca 
atcatctaca 
tccatcccac 
ttttcccgag 
ttcagtcagg 
agcccgaacg 
agacttttga 
ccaatcccat 



ccccagaggc 
taactctccc 



aagggcaagt 
agagtcagga 
tgcaagccag 
tgtcaaatat 
cttctcatca 
ttttctccca 
ctctgataag 
tatagaattt 
ggtagcaagt 
tggagaatga 
gcggtgcagg 
gccaagatag 
tgggaacggg 
gcgaaatctg 
caagaggccc 
ccgctgttcg 
aggtgggttt 
gctaatctcc 
cggactgctg 
cgatgaataa 
cccggttgtc 
acccagtttc 
ctcgaatcga 
ggagcctttc 
atacctctgg 
agtgtggagt 
ttctccaggc 
tcagcgcttg 
agttcagcac 
gcagggagct 
gatgccccag 
tagcagtttg 
aggctcagcc 
ctgtttctag 



seq_prot. 
caaaagtctc 
caccgggtgg 
ctcctctgcc 
cccaggtctc 
tcacccccaa 
ctgaaaatac 
agcttaccca 
caaagtgtcc 
gaagctcaaa 
caacccctgc 
aatccgcctg 
ttctcgccga 
agctgtgagg 
gaagggtgga 
gttaaaatgt 
gatactttat 
atttatttgt 
ttctactatc 



aagagctcga 

gatgcagaag 

tgctttaaga 

aagcgttcag 

aaaccttcta 

atccgggatg 

gtagcgaccc 

cctctcctcc 

ctttagctct 

gctgtgcttc 

tccttcagac 

aaatgcggac 

cccagatcct 

tctctccact 

ggagcaagaa 

tgaagaagaa 

ggaacccgct 

ctggggagat 

ctctgccccc 

gaactacagc 

ccaacctgag 

gaccaccccc 

agctgggaga 

tgtcttctac 

ctagccctac 

ggctcctgac 



txt 

agacctcaat 
atgtgcaaag 
taggcgttca 
ccaacaatgc 
actcgctaag 
acggagccga 
atccagggaa 
ccggagccca 
acttttagcg 
gaggctcctg 
ccggaggaag 
cctggtgccg 
gcagaattgg 
aggaagaaag 
cattttttag 
atcacaccat 
tcattgcttt 
ttattcattg 
taataaatat 
aggtgcaaga 
attaccgcgg 
actacaactc 
acctttcacg 
cggcggccca 
gcggtgctaa 
cgggagtctt 
gcgtcggatc 
ctctgcagta 
caggacccag 
tctgaactga 
gtggctggct 
cccctggaaa 
ctttctgaaa 
gcgaacgggc 
gcagaggacg 
gaagtgtgag 
tttgcttggc 
ttctcccagc 
aacttcttga 
gagattttca 
agcggcacgt 
ttccctcaag 
acttgaaaag 

g 



gctttgtgca 
aagtacgctt 
actttgagtt 
aactcctatg 
tccccactgc 
gagcccgtga 
gcgtgtcacc 
gcagctacct 
ccagtcttga 
gcacaaagct 
caaaggaaat 
tagatactaa 
tggaaatcat 
cttttgcgtt 
gaaggctttc 
cttatttaat 
cagtacatgg 
ttgtattcct 
tttttgaatg 
cctgctgcgc 
gactcggtag 
ccagcagcca 
tcccggctcc 
atcggaaggt 
ggaacacagt 
ctgcctactc 
cctgagaact 
tgaagacttt 
ctccagccca 
tgccaccgcc 
cagctgtgtc 
cagagtttgg 
atacaagcct 
cagagttggg 
agggagacac 
gtcgatctgt 
gacccaggtt 
tgcctcgatt 
aaggctgtaa 
ctgtaaaaca 
agccctttta 
gattcagggg 
cataggtctg 



<210> 15 
<211> 4607 
<212> DNA 

<213> Homo Sapiens 



<400> 15 

tgccacccat 

gggatagaat 

gaggagaagt 

atcaatttgc 

tgtgtgcagt 

ctgtctacgg 

tgtccggccc 

tgcctgggcc 

ttcctgggtc 

tgcccgggcc 

cacgcggccg 

ggcccaggtc 

tggagagctt 

cgcccgaaaa 

ccgaggccgg 

cgcggcattg 



ggaacgaaga 

actagtattt 

agagatgaca 

cttttgtgac 

cggcactgcc 

tatcttcctg 

ccatatctgc 

ccgcacctgc 

ccgcacctgt 

ccgeacctgt 

cccccggccc 

cccgccgctc 

gaatacctcg 

cccgcgggac 

aagtcagaag 

tggggcttgt 



cctcaaatcc 
aaatcattta 
aagtagccac 
aaagtaacaa 
gatacccacc 
ggtcccgcac 
ttggggtacc 
ttggggcacc 
ctggggcact 
ccgggcaccg 
tgcccgccgg 
accaggtact 
gagaagttca 
ccctgggcgc 
geggaagtga 
agttcttgtg 



cagccatgag 
ttcggcatgt 
accacttacc 
caaagagccg 
tttccaaggg 
ctgcccgggc 
tgcctgggcc 
tccccgggcc 
tgctcagaca 
cacctgcagg 
ggacgctggc 
tgccgtccgg 
tggccgccgc 
gcagcaggct 
actgcagcct 



gacaactact 
ggtagaggag 
agtttacagg 
acatctccta 
cacctcccca 
acctgccggg 
ccgcacctgc 
ctccacctgt 
ctgcacctgc 
atccccaagc 
accgaggatg 
ggagaacttg 
ctgccgcggg 
gcaacagccg 
atcagcgccg 



Seit£ 18 



tccttacctg 
aagagaatta 
caacagaatc 
taccctcacc 
gaccccccac 
gtcctgcacc 

ttggggtacc 

ctagggtatc 
cccgggtacc 
tgcctccacc 
tcctgcccgt 
cagagtaagc 
cgccaccctg 
acgccggcct 
ccggcttccg 
cgcccacgtt 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4001 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
S&0— 



seq_prot.txt 

tcttccgacc agggatttcc gacccgagaa ccttacctca aaggccggga ggcctttgag 1020 

cacctccagc tagggctgct gataaaaatg tagaaagcac agtaaaattt gaatttcaga 1080 

ttcacaacaa atctagttat aagtatgttc ccaaatattg cacgggacat gctaatacgg 1140 

aaaaattact cgctagtctg aaattcaaat ttaattgagc gacctgtgtg tctgcgtgtg 1200 

tgtacacatg catatatata tatttatatt tatatgtaaa tgtatgttta catgtaaata 1260 

tatgtttacc tacaaatata tctttaataa gtaatacggt gtctgtcgca catatattat 1320 

atcgtgtatg taatgtataa gtatttattt cgtttgcttg gggttttgtt tgcttttgct 1380 

gagtccgacc cctctacctg ccgcctggcc cttgcctcac gctccagtgc cactgagatc 1440 

aaggagagaa cgaatttgcc gctgactggg cagagcgagc gcgtggatcg cggccaccgc 1500 

ccgttcatca cccgcgcgca tctgggctgg caccgggcga agaatcgtgc gggtctggga 1560 

cctgggggcc cagagggagc gagctcctgc gcgggcgctc ggtccgcagg tttcgcaggc 1620 

tcaggggcgt gcctcgttct cacccccact ccggaccccg gtcctcttcc ctagacagcg 1680 

gccccctcca cccctggctc ccgcaggccg ctagtagtcc gcgccaggcc ccgccggcgc 1740 

ctctagggcc ccccagatcg cgcagaccct gacatccccg cctggccctg ggttctggga 1800 

gctgagagcc ggccagggtc ctgctcgtac ctccgggcgc ccagcctcgg gtctgctccc 1860 

cgcggacgcc ccaacctccc cggccgaatg gatggtggtg cgcgcgcgtc ctactccggc 1920 

ggtgccggcc ttttctgttg ccaaaactag acccaaacct ctgcatggga ttcgtctttg 1980 

ggtccccacc ccgtgcgccc agcaaacagt gggtgagcca tgaagatgtg cgagtcagcc 2040 

ggaccctccc cgtcaggcgc ggacccgctg cggccagaga acccagtctg cgccagcccg 2100 

gctcgctcgc gaagccacgg gcttcactga cgcgactttc caagacgtgg gggtcaccat 2160 

gggcagagga catcggttcg gagccagatc acgggcccca taagcatcag accataagca 2220 

gcgccgccac tgagagccgc tcggaactcg cccagcatgt cgggtcccct agccagggcc 2280 

tggtgtacgt ggtcgagggc cctggaagcc ccgatggcct aggaggagca ggcgggcggg 2340 

gcggcgggtg tcgctggccg gtagagagct tcggcctgac ctagcgcagg tctggtgcgc 2400 

gcagagaaca actccaagcg caccgacgcc cgcgagctcc ttccaaacac cgaacgggat 2460 

ccagagcccg agcccacagg cggcggccgg gggagggagc agggtgctgg ccgccgcccg 2520 

ggagtgttcg cgtcctgggt gacccctgga aggacgtggg gcccaaactc cggctggggt 2580 

tgggagagca gcccccagag gctctccgcg ggatcctctg ccgggcggga ccgtggctcc 2640 

acaggagaag tgggtggcaa gccctgcttg gcggaaagca gccgttcccc tcctcctggg 2700 

cctggggcgg cgcccctcac ccctgttccc cgcccctcac ccctgttccc cgccggccac 2760 

atcccctgcc ccttggattc caagcgcccc gcgcgccgag gagcccagcg ctagtggcgg 2820 

cggccaggag agacccgggt gtcaggaaag atgggccgtc tgggggacag cagggagtcc 2880 

gggggaaacg caggcgtcgg gcacagagtc ggcaccggcg tccccagctc tgccgaagat 2940 

cgcggtcggg tctggcccgc gggaggggcc ctggcgccgg acctgcttcg gccctgcgtg 3000 

ggcggcctcg ccgggctctg caggagcgac gcgcgccaaa aggcggcggg aaggaggcgg 3060 

ggcagagcgc gcccgggacc ccgacttgga cgcggccagc tggagaggcg gagcgccggg 3120 

aggagacctt ggccccgccg cgactcggtg gcccgcgctg ccttcccgcg cgccgggcta 3180 

aaaaggcgct aacgcccgcg gccgcctact ccccgcggcg cctcccctcc ccgcgcccat 3240 

ataacccgcc taggggccgg gcagcccgcc ctgcctcccc gcccgcgcac ccgcccggag 3300 

gctcgcgcgc ccgcgaaggg gacgcagcga aaccggggcc cgcgccaggc cagccgggac 3360 

ggacgccgat gcccggggct gcgacggctg caggtaggag gcccagggcc ggggggcggt 3420 

tcggctccgc gggcgggggc tggagcgcag cgctgggcag gcacctgggc tcgcagctcc 3480 

gaagctggga ggtgagggga gagcgatcgg ggacgagctg ggacaaggcg acacaggggc 3540 

tccctcggag ttggatcggc ccctgggact tggcgctcgc gagaggctgg agcggccaga 3600 

gtctagcctg cgaggagacg cgggtcctgc cctcagcgcc ggccgccttt ggcgccaaag 3660 

acagccccgc aggggttccg ggagggccct cctcctgctg tcccctctcc accccgggct 3720 

ccgagggccg ttgggagggt aaccccggga agaggccggg gtgcggggcg cgggtgcagg 3780 

tggaaatcgc cagcaagctc ctccccgccc gcgcgctccc tccgacctgc agggctgtgc 3840 

caancccgag gcctcagctt ccctgaggag ccagggccag gcccccctct ggacagggag 3900 

aaggatctgg gcgggggcct tgacccatgg agtitggttac taagcggttt cgatggtttc 3960 

ccgagggaca gctccctgtg gctctgagtt tgtctgtcga gggctcctgg cctgtctccg 4020 

gagcggtccc aggtagagaa agcccgtgaa gaaatggccc gggccggcct ggagggagac 4080 

acctcacgcc cccttagctc ctgggccgcc tcctcctgca gcccctgcct ttcccggggc 4140 

ttggacttgg ggagcgatga ttacctttgc tcagcttgta ttttggcctg gacgctagga 4200 

gataagccca tgtagtatgc acacgtctgc tacataaaca ggggacagat agacgatctt 4260 

caaccagcaa gggtgcaggg aaaagcaatg caccccaaac ttctgaccag aggticatttg 4320 

cttccaaaga tgctgccatc tgtttattca ctgtctggac atttggaaat ggctcaggct 4380 

cattaacaca atgctttggt ttttgttgtt ttgtttttrtg ttgctgtcat tgctgtttat 4440 

ttgttcagcc ttagctctgg gggaggagta aacaaagcgc gtggcctctg gcacttactg 4500 

agcgctgagc cacccctctt tggatttatt cggggaaaga ttaaaaagca tttcattaag 4560 

aacaggacac ggtgtttgaa atgttgccat atatgaatgt atgcatt 4607 

<210> 16 

<211> 4453 . ' - 

<212> DNA : «:.'■:; " <> i. . 

<213> Homo sapiens 1 

<400> 16 ' 1 ' 'i- • , * , .." ■.>■ -..»:•:- 
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seq_prot.txt 

ttggetctga agcctatagc atcgctgact cagtctgtcc cctggaaggc tggcagctca 60 

gcaagcacag aagtctctcc agaagacagt gggtcacctg cctcccaaaa gctgaaaggc 120 

taacttgtac tttccccagc aggcagctgg caccctgagc cctcggctgg ggcagagcaa 180 

aggagccttc ctccttccta ccttcctggc actctccctg ccttccttct gtcactctca 240 

ggtggaccca gacccaaggt ccagatttgc aaggcaggaa aatgctgcag gcctaggctg 300 

ggaaagggcc caaagccgct agtggattgc tgggactcag cctcctcctt cccactaaga 360 

gagcgagtcc tactgggttc aaaatgaccc caagccctgg ttcctgacac taggggaaag 420 

agatgggggt gacagaatca cagaatccct gctatgttcc tccaagtgtg cccagagatg 480 

cgtgtgtgtg tgtgtgtgta tacacaaatg tctgcttatc ctcaggcagg aagggtggat 540 

gcagtcattt acacatggtc tgtttttctg gaggacaatt ttatttgata aacaattgtt 600 

tctatctgaa tagaataaac aaggctctat gatgaagtaa aacactaaat acacatgcat 660 

taaaaaatgc ataattatct ttttggaatg ggctatacag agatgtgctt tttaaaatgt 720 

taagagtgta aaaggacaaa cagtgaaaaa taaatcttcc tcttattttg tcctccagtc 780 

tcccaattcc tctactcaga ggtgagaaca gaacttccac accctccaga acctccacag 840 

ttagaactgt ctacatgttt ccattgtctt tacttttatt cttgcctgca caaataaatg 900 

aattgctcat tatggaaact tcccaaaaga cccgrtaaca cttcaatagg aagcaccaac 960 

agtttatgcc ctaggacttt gtrtcccacaa tcctgtaaca tcatatcacg acacctaacc 1020 

caatccttat caagccctgt caaaaacgga ctttaaacca agctgcaaat tttcagtaat 1080 

ctggccttgc ctttccccct ctgatagcac catcaaacaa acccccttac tgccgaaagc 1140 

aataagcccg gctttgttcc atccactggt tgtgtrtggtg atatctgggg actgccactg 1200 

aacagacgca cagagggagc ccctacaggc aggggttttt ctgtctgtgc ttctgggaga 1260 

gtatgtctcg tacatttgtc gcgttgatga agacttcaca gctccatcag ctgcgggcaa 1320 

gggggtictga ggcagtctta ggcaagttgg ggcccagcgg ggagaagttg cagaagaact 1380 

gattagagga ccccaggagg cttcagagct gggcgaggta gagagtctcc tgtgcgcctt 1440 

ctctcctctc ngcaattcgg ggactccttg cactggggca ggcccccggc caggtgcatg 1500 

ggaggaagca cggagaattt acaagcctct cgattcctca gtccagacgc tgttgggtcc 1560 

cctccgctgg agatcgcgct tcccccaaat ctttgtgagc gttgcggaag cacgcggggt 1620 

ccgggtcgct gagcgctgca agacagggga gggagccggg cgggagaggg aggggcggcg 1680 

ccggggcggg ccctgatata gagcaggcgc cgcgggtcgc agcacagtgc ggagaccgca 1740 

gccccggagc ccgggccagg gtccacctgt ccccgcagcg ccggctcgcg ccctcctgcc 1800 

gcagccaccg gtgagtgccg cggtcctgag atccccgggc cggatgcgcg gcggccccag 1860 

ctcccgagcg tctgcctccc ccgccctggg ctgcccgggc tccctgggct ccccggcggc 1920 

tgcacggagt caaggcgccc cgtcccgggc gtcccccgcg ggtgccgatc caggctgccc 1980 

ggagtccgga gcccagagag gagagagaca gctggggagc ctggtcaccg cgggcatctc 2040 

ccctgcgctg cagtcgcccg cctggcctgc cttcccgttc ctccgcctct tgccctgact 2100 

tctccttcct ttgcagagcc gccgtctagc gccccgacct cgccaccatg agagccctgc 2160 

tggcgcgcct gcttctctgc gtcctggtcg tgagcgactc caaagtgagt gcgctcttgc 2220 

tttgactgat gctgcccaag gacctctgat cagcaccagg ggagaggagg ggctgctcag 2280 

ggagctgggg trctccggat tccatccaca gcagggccag actctcccca ggaaatggga 2340 

cagggtggca gcggaggctt gagaaccacg ggggttggca ctggctggca agggaggaag 2400 

aggccgccgg gactgcccca gcctgcgggc atctggraga tgaagcttgc ttgggtcaat 2460 

ccatttctcc tggctggaaa cccatggtct tccatttgag aactagatac gaacagggtg 2520 

aggcgagagg gagagggaag agtgggtttt gggattgggg ccagtttacc ctcaccctgg 2580 

agtccctgga gcatgggacc rttgatgaag cctcctcccg aatctcttcc agggcagcaa 2640 

tgaacttcat caagttccat gtgagtatcc acccctacaa cagttggctg cacagacaag 2700 

ttgggaaggc ttcaggggac atcccctccc tgccctctgc tgcagggctg cgccacccct 2760 

taccacttcc actccccctc gcttacccca cctttgttct ctccagcgaa ctgtgactgt 2820 

ctaaatggag gaacatgtgt gtccaacaag tacttctcca acattcactg gtgcaactgc 2880 

ccaaagaaat tcggagggca gcactgtgaa ataggtatgg ggatctccac tgcaactggg 2940 

agagaaattt ggggacaggg agggatgggt gggaggcaag agcaggcagg agttaggagc 3000 

tggaggtagg gtgggtgaca tcttcatccc tatgtgacaa gcataaacac acacacacgc 3060 

tcacgaaaca gtggccacac aaatgtgagg tggggttgga aggagaccct gtccagtctt 3120 

ctggcaggtc tgaaacgaca tctttaaaat gtccgttggc agccgggcat ggtggctcac 3180 

gcttgtaatc ccagcatttt gagaggtcaa ggtgagtgga tcatttgagg tcaggagttc 3240 

aagaccagcc tggacaacat ggtgtaaccc tgcctctact aaaaatgcaa aaatcagcct 3300 

ggcatggtag tggatgcctg tagtcccagc tacttgggag gctgaggcag gagaattgct 3360 

tgaacctggg aggcagagat ctcagtgagc tgagatcaca ccactgcact ccaactgggc 3420 

gacagagcaa gactccatct caaaaaaaaa aaataaaagt tagttggaat gttcttctct 3480 

ttctcatart ctctcatxct cctgtcccct tgtagataag tcaaaaacct gctatgaggg 3540 

gaatggtcac ttttaccgag gaaaggccag cactgacacc atgggccggc cctgcctgcc 3600 

ctggaactct gccactgtcc ttcagcaaac gtaccatgcc cacagatctg atgctcttca 3660 

gctgggcctg gggaaacata attactgcag gtgaggtggg ggcaacaagg accaaaagcc 3720 

ctccctacag cttcccagaa accttgttac catccccttc tcccagaggg ctggccatag 3780 

cacaagagaa gtgcggcctc tggttgagtc ttccctgagg ggaggaggca gggaaggccc 3840 

tctgggttgg aatgacatcc cctatctttc tgtgttgcca ggaacccaga caaccggagg 3900 

cgaccctggt gctatgtgca ggtgggccta aagcrgcttg tccaagagtg catggtgcat 3960 

gactgcgcag atggtgagca tcactgacct gctgatgaca gtggggtgga aggggacaaa 4020 
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cttacatgtc 
atgctttctc 
ctcctctcct 
taagattatt 
caggaggcac 
ctgggtgatc 
ttctgcccca 
tccctcatct 



cccttattcc 
ctacctgcct 
ccagaagaat 

gggggagaat 

cgggggggct 
agcgccacac 
ccccaagcac 
gcc 



atcacaggag 
ccctaagaca 
taaaatttca 
tcaccaccat 
ctgtcaccta 
actgcttcat 
atccctttct 



seq_prot. 
gactgaggag 
tccctctgtt 
gtgtggccaa 
cgagaaccag 
cgtgtgtgga 
gtacggccct 
ccttcccagc 



txt 
gtggggggtg 

tgtcctccag 
aagactctga 
ccctggtttg 
ggcagcctca 
gggtttctcc 
aaagtgttcc 



cccgagaggg 
gaaaaaagcc 
ggccccgctt 
cggccatcta 
tcagcccttg 
tcttcgactc 
gcctcarttc 



<210> 17 
<211> 6001 
<212> DNA 

<213> Homo Sapiens 



<400> 17 

aaatttacag 
aattcaaaga 
ctactttgaa 
gcaaggtcac 
ttttccacaa 
tttaaagctt 
ttttrtttatt: 
attttggaaa 
ttgttctttt 
gaaacaaagc 
gctcaaatga 
tgacaaccta 
tctggtaaat 
cctaaaacaa 
tcaacatttc 
aaagtgatta 
atgaatcaag 
ccaaaaaaga 
ttgtaaagga 
taactaggaa 
agttacatac 
aactacagca 
gaacagacca 
aaattacacc 
ctaggagtgc 
aagcctgcca 
tgacccactc 
ccgccctcgg 
gagacccgca 
aaagagaggg 
aatgagagag 
agagagagag 
ttcaacggga 
agtcaccatt 
atcctagtaa 
gattggtgct 
tcctcgcagc 
tgcccatccc 
atagaggtgg 
ctagggtagg 
acgggggttc 
gaggtaggga 
agcagcagcc 
cacagcacca 
gatcgcgggg 
cggcggtggc 
gtttcagcaa 
gctgcgccgg 
caggaggaga 
gagtcaaggc 
ggcgtttggg 
gctctgattg 



gtgttggtgt 
gaacactcca 
gctttaaaat 
tgtctactga 
ttcttttcta 
gcgttaatac 
tttatttttt 
agagctagtg 
gcttcaacga 
ctcaataata 
agattgatca 
tctgcaaaac 
atgtaatttc 
tgcaagggct 
caaaaattgt 
ggcaatagca 
aagttataga 
gaacaatcta 
taaattagaa 
ctatacaggc 
acttcagctt 
aaattatatt 
cacctagggg 
acaattctgc 
agcaactccc 
agcgttcgcc 
cacagaagcc 
ctcgcctaca 
aagtgttctt 
gcatatctgg 
agagagagat 
agagaaattc 
gaccagtgcg 
tgatttattg 
gacttttttc 
ctttcaaaac 
tccaatgtaa 
agaaccaccc 
gtggtattag 
ctggtctgtg 
tggacttacc 
aagcagcgag 
agcaaaagcc 
tgactagttc 
gccgggcagc 
agtggcaccc 
catccaggga 
agacgcggga 
cgggagtcca 
gccccgcagc 
aagcagcagg 
gtggagcagg 



cacagaggaa 
gtcattaaaa 
acttgtccta 
taacctcaaa 
gtcaattctc 
tgacagtggt 
gctcatnagg 
atgatctctg 
ctgaaacaac 
aagataaaca 
cctctaagtt 
cagtttattc 
ttgattgcaa 
aggaatgaag 
ttttgcaggc 
aagtgaaact 
aacatgttga 
ttgacccaaa 
tgatgcataa 
tttcaccctg 
atatacatct 
cacagaagat 
acaataccca 
ccagtatact 
atcctctggc 
gtcagaatct 
tgtccagaaa 
tggagcctga 
tttctcaact 
actgcaaaac 
agatagatag 
tattgaaacc 
acctcatggc 
ctttctcgct 
tttaatgatg 
agtgaaccca 
gcgcaagcat 
ctcacccccg 
gggtctaggg 
ctggatacgc 
agagcaattc 
cttcaccggg 
ctcgcaaagt 
gtgcaactct 
gggctactga 
ggcggggaag 
ctgggctcag 
agctgctgcc 
ggggcagacg 
ccggcagccg 
ttttcagccc 
atgcaggtcc 



gagatcattt 
catcttacag 
aattttaaat 
atctagttaa 
ctttagtatc 
gaatgaaagc 
tggacaatat 
aatacccttt 
cccttatttg 
ggcacagtgt 
aacaggggtg 
tctttagttt 
atgtcaactt 
caaaccagtc 
ncataattat 
tattctttca 
gttttattaa 
tctaataggg 
taattttcct 
agttacagtt 
ttgaaagaca 
gaaaaatnac 
ggcatgttaa 
acaggctgtc 
cctatttaat 
gaaggaaccc 
tggaggagtc 
ccagcctxag 
cccctgtacl: 
caatgtcttt 
agagagagag 
cagctccuct 
acctttgcca 
cattctttct 
acaaagcttc 
gaaaaccatc 
gcaaaggttt 
ggcctgcaac 
cagtaggagg 
gtgttcttct 
cagccggtgg 
cgggctacga 
gtccagctgc 
gcagcagcaa 
gcatcccgcg 
cagcagccaa 
ccccggagcg 
ataaggaggg 
agtggagccc 
cctctcgagc 
gcccggggtc 
cgggagggag 



tctctgtcat 
atttttaaca 
ccaacaacta 
gtgatcaata 
cttttctgat 
ttaacatttg 
ttatgaccac 
taccatttcc 
aaatgtatcc 
tttctgtgat 
gaagcggggt 
atgcagtccc 
cacatttaag 
tgtgttggac 
taccataata 
aaaacaacac 
aatgccaaat 
ttgcatatct 
tttggcattt 
ggtcattccc 
ctttattcag 
atacacactt 
cggagtttaa 
aaaccgaaat 
taggaagctt 
gagcgagcaa 
agcgcccact 
tcatgcccac 
accttgaagc 
tgccgcctag 
agagagagag 
agaatctgtg 
ggaatcagcg 
cataaagtita 
tgtttcagtg 
ccgtttaata 
cctgctacac 
agttcccctt 
tgaggggctg 
gcggagttaa 
gcgtttggca 
tgagtagcat 
tgcactgccg 
acggcttccg 
gacggcggca 
acccgcgcat 
agagggtcgt 
agctctggga 
gaggaggcag 
tctcjccgccc 
acgtgggaag 
gggtcgacga 



cataacctca 
aaaaaaagca 
tagctgtact 
ttcgtttcca 
agtgctattt 
cttcctgttt 
aaaactccac 
ccatctttaa 
agacaaagag 
ggtctgtttg 
gccaagttct 
ctttaaaata 
ttagtrattt 
tacaaagcca 
aagcatctaa 
acatgtacgc 
ttagaaactg 
caacttgtct: 
acatxagtaa 
tcctctctaa 
agccagattt 
gctaaaacta 
aatgccaagg 
gctatgccag 
cagcagagcg 
gaagagtgcc 
gaagtcggtt 
tccggcctgg 
ttagggaagc 
gagagaaggg 
agagagagag 
tgacctggtc 
attcccctgc 
tttcttcctc 
tttcccctag 
tttctcaaaa 
ctgcactttc 
gtttctctgg 
aggaggcgcg 
agggtcgggg 
gtcacttaag 
gacgggcagc 
cggggactcc 
aggaacacag 
gcagaggcgg 
gatctcgaga 
ccgctgagaa 
agccggagga 
ggtggaggga 
gcatccctct 
aggcagncgg 
ggaggtgcaa 



4080 
4140 
4200 
4260 
4320 
4380 
4440 
4453 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
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ggatgcaagg aggaggcggc cgcggaagcc acagatgggc tcgctcgcca ggcgctggcc 3180 
cgagtggggc taggcgggga tggctcaaat gagaagctcg ggcttcaggg tgggctaccc 3240 
gcacactcat ataccattcg cctcactctc cgctccagga cgccccctac cgaaggcgqq 3300 
9^ggacta gcgcccctct tccgcgcgtg accccgggcc gcgagtgcgg gccgcggctg 3360 
ggtggcgtct ctccgagctg gagatggtgg gggcggaggt gtcagaggag cagcagcagc 3420 
agggcagaga ggggcgagtc ggcgcgggag agggcgtcct gctggcgacc ggcgctccag 3480 
cgtgcgggag cgcgccgcct aggctgtagg gggatgcagg ctgggaatgt cgcggcqqaq 3540 
agg £5: a9gga cgtttctcta gggatttaca ggaaagaggg tgagaggcga tggtgttaga 3600 
accgctcttg ccgacctgga agcaacagca gcatctccca caagagcgtg caaccccaag 3660 
gctgctcgcc gaggcagctc agccatcccg gcaggcgctc tccttccttc tctcttctcc 3720 

cctctctcct cccaggcccc ccgcagctcc gacccagccc aagcgttcgc aggtttgaat 3780 

"" ct $ ctc atcacccgct cctctccagc ccgtagccta ttagtgtgtc cacctgggag 3840 

gtgcggtcag atgtgtttgg aaggtcagat tggtcgggac aagtggtctg agagaaagag 3900 

aaagg " cct "gcatacgc cgcgggtggg ttgccgggag catcggccgg gcagcggcgt 3960 

^«22S^f2 9 9agagcgggc tccatttgtt ggcccaggca gtgaccctgc gttccttact 4020 

cgggtctttg ccggatggcc ggtgacctgg ggcgacgaga gaaggtctaa ctcggcaqqa 4080 

gtctctggct ctgcgcgttt ctttcattct ctccagcggg iagggcaaac ggcltagcgg 4140 

I a £"|" tt: ccgtctgctg cattcttcag gcagttagac acactcttta gcctaatggl 4200 

a "" a ?!5H ccagtaacgg gaccaagagc tttcggggac aagggtggag aggaacatct 4260 

tt ^" c " t § accggggtca ctattgcagt ctcagtgttt tggatgcccc atagggaaga 4320 

attcct9ttt ttgtttttgt ttatcttctc 4380 

tccgctctcc ttctctattc ctctctgtta tccctcctcc ttcttctccc cccqctttca 4440 

aaagcctccg gatcctccct ttttctattt aaattctcct tttgtgcccc tctttctqta 4500 

tcccctgaat ttaggagagc atttgataac atttaacagg caattagtgt ccattcccaa 4560 

tcacttaaaa gaggcattca tatactttga aaacgggact atctatcctt tgcagacacc 4620 

agcagaaaaa caaattgtac ccgagtaatc cttttaagta ctttaacctc caacctcctc 4680 

ccacttcctt gctttttaac ttctcctttg agagatgtga tcgtgcagca cctcagtgcc 4740 

tcaacgaaat cttttttttt ttcctgtgtg aaatccatcc ctttatctta catctlcgcc 4800 

tccgtccgag actgtccctc tcccctccca cctccaaaga tttctgaatc tcagtgtctc 4860 

tcactcctgg caattaagca gcagatccca gcattctagt cggtggcatc tcgctcctca 4920 

ccgacgaaga ctccattaaa acagatcaat tagaccagac gttggaggca tcagaaaatc 4980 

ffllllllll "gagcagct aaattcttta aggaaacaga Itacccatta gatagagctg 5040 

aSn a ^ a ^ a "f^S aaCa a gQ aa "aga aatttctttc gctacaggct ttcagcigag 5100 

aaggcaacat aaatatagat caagatttaa caactctaca gcagagaatg agaacatgtc 5160 

attttccata gcaaggctgg tgtggtaact aatcaggctt atgaaaataa gtcatgcttg 5220 

aaa " aaa 99 caaagtcctt aaaagtgttt atgcagtaat tatgataatg aaacaggacc 5280 

tgctaggatt tcagagtttg gctatgtaag tagaatttta gagaacctct tagcagagga 5340 

^^f"^ ^aattttct gctaagtaaa tttttggcat ictttctaat aatatatgft 5400 

fH^ Ct ^ ga c 9 ttt:t g c ca aaagtaagtt aaaactccaa aggagttaat tactggttgt 5460 

aactggttaa caaatgcggt tgcttccaca gaggtccttt aaattattaa acagtttgaa 5520 

gcaaagcctt ttcaatggga atgctgcaat tttgttgcac ttactctgta cttagtqtta 5580 

| a 9^9ccacc a agaaacaaa ttctgaaact ggcaagcacc accaagtggc agaagaacat 5640 

^ a ^H 5 tga gcagagaatt gtattactga atratgtaaat aaaaatatat acattattta 5700 

2 a SH 9 £S ac ta ggta c caa agaagtagac aagactgcat tagcaattgg attagtgctt 5760 

ccagcaaggc aaaatcagtt tacttattag aattaaattt aagtctitga 5820 

SSEHEC* tca * at 9 att gctagtaata tgacacaatc ataccatgta 5880 

Srfr aaaa ^ JEESSH aaata « a ta ccatatttac tttcaatctt cttgagctag 5940 

aacactttat ttgtggcata tacactctag aattgatgca gaggagcaga gtccagttgt 6000 

6001 

<210> 18 

<211> 7001 

<212> DNA 

<213> Homo Sapiens 

<400> 18 

aa Ji§ caatgg aaaaagagag attgtaaagc tagaaggctt aggaattgcc tcttgattag 60 
gtgtggaagg caagggaaaa tcagccctcg aagaagacag tgagatttta atctgggtgg 120 
«=i? a ^H ac a9t 9 a HS tg 99 caca 9aca cggggaagtt gagaggaaca ccatgtttga 180 
2 aa ^? 9 9ac tca tatttga acaagcctgc aatgcccagc agaccgctgg aaaagtgggg 240 
ctggagacac attcaacgga ggagccagat caatctttac ccttcttcac ctgagagaqc 300 
cagtaagtca cggctggaac gtgtgtgtcc agcaggagag ggtagggagg gaaqccaaaa 360 
llillnil* cc cgagtgaa gtttttgcca aaggcagaag aggaaagtcg gcg?5ca« 420 
fSi«-53* c 5 cacccat 9 c tcaccaagcc cagggacaag gctcaccaag atgagtttgg 480 
aagagaatgc tggagagaaa gtggttaaga aaactgcctt tactgaactt cttgggctai 540 
SJS.IS!* aa g tc 3 ct 9 a acaatcaaag cctgtgagga gacagctaac cttcttattc 600 
ttcctatgtc aatagtgaac aattgcagat cccctttcct ttccttctcc tttcccctgt 660 
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tcctctctcc tccctccctg aatactcttg cttttttctg ggactggtct agagcatggg 720 

tggccattgt tgacctacag gaggcaccac tgtcaccaac aaagggtaac agtctttctt 780 

ttcaatattt atttatatcc agtatttatt ttcaatactg actatggaga gagctctcct 840 

gtgctcaaac actgcaatac tgggggtctt tcaaagcaca aaaacatata tttgcatgat 900 

ggcatcatta acatttttat ggctttctat ttcttttttg tactggtctc aagagccact 960 

cataaatctc tcagtaactg catagtgtcc cagggccaga gaccggccac tcctggcatt 1020 

gtgattagag tcatttaata tccaaggtgg tgactaatgt ctggcaacaa agcctccatt 1080 

gggtgtcatg tgtcctggga ccctgagcgt gggcactcta ggagcacctc agtattgcgt 1140 

gttagtacta tggccgagag aatagttgag aaagtggtca agaggtggat ccatgngaac 1200 

gccactggga aatgagagac ctcgttccca atcacggtca gtgcaactcg aaagcctaaa 1260 

atcagtttaa aacaaaggta tctaccttta tcttatgttc atatcctagg cttttaataa 1320 

tacgtatttt tcacatgttt acagaaagca gtcaactgag ctattcatgg aaaggtttgt 1380 

gggtttggtt aacgaagtgg aggagtatta catttcagct ggaaacacat ccctagaatg 1440 

ccaaaacatt tattccaaag tctggtttcc tggtgcaatc ggaggcatgg caatgcctct 1500 

gttcagagac tgggggctag ggccagtaag gcatttgatc cacatgtatc ccagaaggct 1560 

tttattgtta aattatattc tttcggaaaa accacccatg tcctattttg taaacttgat 1620 

atccatacac ttttgactgg cattctattt tagccgtaag actatgattc acagcaagcc 1680 

tgtttttcct cttgcttggg gtggcagcag aaagcatagg gtactrtcca gcctccaagg 1740 

gtaggggcaa aggggctggg gtttctcctc cccagtacag ctttctctgg ctgtgccaca 1800 

ctgctccctg tgagcagaca gcaagtctcc cctcactccc cactgccatt catccagcgc 1860 

tgtgcagtag cccagctgcg tgtctgccgg gaggggctgc caagtgccct gcctactggc 1920 

tgcttcccga atccctgcca ttccacgcac aaacacatcc acacactctc tctgcctagt 1980 

tcacacactg agccactcgc acatgcgagc acattccttc cttccttctc actctctcgg 2040 

cccttgactt ctacaagccc atggaacatt tctggaaaga cgttcttgat ccagcagggt 2100 

aggcttgttt tgatttctct ctctgtagct ttagcatttt gagaaagcaa cttacctttc 2160 

tggctagtgt ctgtatccta gcagggagat gaggattgct gttctccatg ggggtatgtg 2220 

tgtgtctcct ttttctttca ggacttgtag gattctttgt gccatttgca tataatttgg 2280 

caggttcaca ttttttaaga gccctatgaa gtgctttttg catgtgtttt aaaaaggcat 2340 

ttgaaaattg aaagtgtgat ttatggaaat taaatcatct gtaaaaaatt gctttggaaa 2400 

gtaatgattg ctggccataa agggaaatat ctgcgatgca cctaatgtgt ttttaaccct 2460 

ttatttgctg acaatctata gtcattaatig ctaaactcga ttttggcttc agctacattt 2520 

gcatattgtc caacaatggt ctatttttgt aagaattaga taaaatgtat acttgatata 2580 

aaatagtxaa aaatgtaact cttagtaaca gtaagcttgg catttagata gaccatgaac 2640 

acttcgtcag atactctgtt gggtgtttgg gatagcaatr aaaacaaagt attgatagtt 2700 

gtatcagagt ctattaggct gcagcaaagg aagtttattc aaaagtataa actatccaag 2760 

attatagacg catgatatac ttcacctatt ttttgtctcc ttaatatgta tatatatata 2820 

tatatatata tatatacaca tatatgtgtg tgtgtatgtg cgtgtgcatg tttaactttt 2880 

aattcagtta aaaacttttt tctatttgtt tttcatctgg atatttgatt ctgcatatcc 2940 

tagcccaagt gaaccgagaa gatcgagttg taggactaaa ggatagacat gcagaaatgc 3000 

attttaaaaa tctgttagct ggaccagacc gacaatgtaa cataattgcc aaagctttgg 3060 

ttcgtgacct gaggttatgt ttggtatgaa aaggtcacat tttatattca gttttctgaa 3120 

gttttggttg cataaccaac ctgtggaagg catgaacacc catgtgcgcc ctaaccaaag 3180 

gtttttctga atcatccttc acatgagaat tcctaatggg accaagtaca gtactgtggt 3240 

ccaacataaa cacacaagtc aggctgagag aatctcagaa ggttgtggaa gggtctatct 3300 

actttgggag cattttgcag aggaagaaac tgaggtcctg gcaggttgca ttctcctgat 3360 

ggcaaaatgc agctcttcct atatgtatac cctgaatctc cgcccccttc ccctcagatg 3420 

ccccctgtca gttcccccag ctgctaaata tagctgtctg tggctggctg cgtatgcaac 3480 

cgcacacccc attctatctg ccctatctcg gttacagtgt agtcctcccc agggtcatcc 3540 

tatgtacaca ctacgtattt ctagccaacg aggaggggga atcaaacaga aagagagaca 3600 

aacagagata tatcggagtc tggcacgggg cacataaggc agcacattag agaaagccgg 3660 

cccctggatc cgtctttcgc gtttatttta agcccagtct tccctgggcc acctttagca 3720 

gatcctcgtg cgcccccgcc ccctggccgt gaaactcagc ctctatccag cagcgacgac 3780 

aagtaaagta aagttcaggg aagctgctct ttgggatcgc tccaaatcga gttgtgcctg 3840 

gagtgatgtt taagccaatg tcagggcaag gcaacagtcc ctggccgtcc tccagcacct 3900 

ttgtaatgca tatgagctcg ggagaccagt acttaaagtt ggaggcccgg gagcccagga 3960 

gctggcggag ggcgttcgtc ctgggactgc acttgctccc gtcgggtcgc ccggcttcac 4020 

cggacccgca ggctcccggg gcagggccgg ggccagagct cgcgtgtcgg cgggacatgc 4080 

gctgcgtcgc ctctaacctc gggctgtgct ctttttccag gtggcccgcc ggtttctgag 4140 

ccttctgccc tgcggggaca cggtctgcac cctgcccgcg gccacggacc atgaccatga 4200 

ccctccacac caaagcatct gggatggccc tactgcatea gatccaaggg aacgagctgg 4260 

agcccctgaa ccgtccgcag ctcaagatcc ccctggagcg gcccctgggc gaggtgtacc 4320 

tggacagcag caagcccgcc gtgtacaact accccgaggg cgccgcctac gagttcaacg 4380 

ccgcggccgc cgccaacgcg caggtctacg gtcagaccgg cctcccctac ggccccgggt 4440 

ctgaggctgc ggcgttcggc tccaacggcc tggggggttt ccccccactc aacagcgtgt 4500 

ctccgagccc gctgatsgcta ctgcacccgc cgccgcagct gtcgcctttc ctgcagcccc 4560 

acggccagca ggtgccGtac tacctggaga acgagcccag cggctacacg gtgcgcgagg 4620 

ccggcccgcc ggcattctac: aggtacecgc gcccgcgecg cccgtcgggg tggccgccgc 4680 

gcccggcagg agggagggag ggagggaggg agaagggaga gcctagggag ctgcgggagc 4740 
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cgcgggacgc 
ggttctgcgt 
aaacgtactc 
ctaacaaaac 
tccaggcaaa 
aaagttgggg 
rtttgtttaa 
agctatgatg 
cccacgtcct 
ctggctgagt 
taggtgtccc 
ctgccagccc 
tcagccaggg 
gagaccactt 
acaagaattg 
tttgtttttt 
gtgaacacct 
agggcgccac 
ctggtgtccc 
tttccaggga 
actgaagggc 
gagcatccca 
aacttttaaa 
agagaaaaag 
gattaagtct 
gatgtgaatt 
tcagttcttt 
cttggtagat 
gggatcgcat 
tgccctatct 
agagacaagg 
ttcagtttta 
agaattctgc 
gacagatatt 
aaaggaaagg 
agttttgggg 
tcagtttagt 
aaatatcaga 



gcgacccgag 
gcagcccgcg 
tccacccact 
acacgttgga 
taaacacggg 
gtgtttggag 
tgtgctcccc 
atgactttct 

ggggctttag 

ccgcgctgga 
gcgagtttaa 
agtgacaagc 
tctgcgcctc 
tgtgacttca 
ttctttctcc 
gttttttgtt 
tcctaggtgg 
atggctgggt 
tatatctttc 
tatttatagc 
agattctggg 
cagccacgca 
ggttggcata 
gatttcttct 
aattccacac 
ccataacctt 
tattgcaaat 
tatttactgc 
cttattcaaa 
tacactttct 
aacttaaaaa 
gaatgatttt 
ctgttattat 
atttagtatt 
attgattggg 
ctgaaagcag 
gagtctagaa 
gtgcatggta 



ggtgcgcgca 
ctgcgttcag 
taccgtccgt 
aaaccagaca 
gcgctttgag 
ttagcagagc 
aaatttcctt 
gggaagcgat 
agagcgattg 
gcggattgct 
aacaagccat 
ctgtcccacc 
gcagccactg 
atggcgaagg 
ctctcctctc 
ttcctgatgg 
ccacaggaca 
cctgaattag 
tgccttgtaa 
agaaggaagg 
ggtattcagt 
gatccaagct 
tttcaaatta 
gcccaccttt 
aacatttaaa 
atgatggact 
taatgctgct 
cttgctaata 
attgttcatc 
gcattacatg 
ttaaaaaaaa 
tcctgtgttt 
ccagcaagga 
tttcccctct 
ctcttttagg 
tgttttcaaa 
tattttttaa 
tttcgtaagg 



seq_prot. 
gggagcccgg 
agtcaagttc 
gcgagaggca 
aagcagcagt 
tcacttggga 
tcagcagagt 
tcatctagac 
tcctgtcacc 
ggagttgaat 
ggcatgtgac 
atggaagcac 
cggggagaat 
trggaaggagc 
ttgtgtgtcc 
cctcccattt 
gcctgcagag 
ggtgtacccc 
gcatttccca 
gccgtggacc 
ggactaaagt 
gtgcatcttc 
gtctttactc 
atttztacttg 
ttzttttttaa 
actgcttgat 
ccagaaacca 
gaatttcaat 
atrtacaaagt 
atxccagtga 
atattcaaac 
aacttgcaca 
tatttcccgg 
aaagaagcat 
cagttrtttct 
aaactttaag 
ctgcttgtca 
aaggactaaa 
ataagttttg 



txt 
ggcgcgcggc 
tctcgccggg 
gacccgaaag 
tatttgtggg 
aggtctcgct 
tttatttatc 
tatttgattg 
cgctttcccc 

gggtctgatt 

ttctgacagc 
aagtgcttaa 
gccccggagt 
gcggccggtc 
tcattttaat 
tctcttgccc 
ggattaggtg 
ggactgggtt 
actgtacact 
agtttttgtt 
gcagtttggc 
agccgccttg 
aaaagacaaa 
ttttaattta 
atggaagaac 
gtgaaggaag 
ttttcttccc 
gggcactaat 
gaacctggtc 
taagtggtat 
actcttagaa 
aatgggactc 
attatctttc 
ctatgcaagt 
gcttaaatga 
tttcttaagt 
tgacccagag 
atggaaagga 
t 



ccagcccggg 4800 

cagctgaaaa 4860 

cccgggcttc 4920 

ggaaaacacc 4980 

cttggcattn 5040 

cttttaatgt 5100 

gaaatatgtx 5160 

tcctccccac 5220 

tcggagttag 5280 

cggaaatttg 5340 

aaataatctc 5400 

ggcgtgcggg 5460 

caggacacag 5520 

ttttttccct 5580 

agtttctcct 5640 

ggcgcttctg 5700 

tggaagcttc 5760 

ggtatccgga 5820 

cagtattctg 5880 

cccagaggat 5940 

gagaaattta 6000 

caatgaacaa 6060 

gggttaaaac 6120 

aaagtacagc 6180 

gcactggtat 6240 

tatttaattt 6300 

gagactgctc 6360 

aaatacagag 6420 

cagtgtaata 6480 

taataaaaaa 6540 

tgtgtggaaa 6600 

ctcttttgtt 6660 

tcttcatatg 6720 

ctctgggtat: 6780 

agttctcaaa 6840 

ggtcatgaac 6900 

atataataga 6960 

7001 



<210> 19 

<211> 3501 

<212> DNA 

<213> Homo Sapiens 

<400> 19 



caactctgaa 

tttcgggtag 

cttccattta 

atttgagtcc 

aatttaggag 

tcaaccattc 

gtaaccaatc 

ttgtctataa 

ctggaagctg 

ctgctgaagt 

aataaccctg 

tattttctcc 

ttttggctta 

ctggatgecg 

gttgggagaa 

tatttggttg 

ttaaagcctt 

cacctacagg 

ggagcttgtt 

cttccccggc 

tcacagaagc 

gccccacctt 



aaccatctgt 

ttgaagcaaa 

tgtttgttaa 

ttgtggatga 

tatgtgcctt 

atacactgct 

cagttgtttc 

atatgttctg 

ccccattcgc 

tttcttttaa 

gaagcagaaa 

atggcaatta 

gggtaaagct 

ttttgtggga 

gggtatcagg 

tggacgtcag 

cagaagcagc 

gtccctctca 

ggtttggtga 

actgcatagc 

caccgcgctg 

ctctgcctgc 



agacattctg 

attaagtcca 

agcaacctaa 

actgtaacct 

taacaatagc 

gagtgttcaa 

tctgccttac 

accatgaggc 

aaatcattca 

caggtttaga 

aggctggggg 

caatccagca 

ttttaaaaca 

acattttaaa 

cgggtatctc 

agccgtcatg 

gtgccccatt 

gcactgccca 

ccaagataca 

aaaaggaaag 

gcggggagga 

gaagggccct 



caggctccat 
atgataagca 
gtatgatctg 
agcttaatag 
tgagtcttgg 
actgtgttca 
ctccaatttc 
atccctggag 
ttactcaatt 
aaaaataatg 
ctccaataag 
cattatatat 
ggcactgcca 
gaggaatgtc 
aaaaccattc 
gtaagaagga 
gcccactagt 
ggcctcccga 
ctccagggaa 
ggccgctggg 

gggggaccga 

tgtccggcgg 
SeTte~24 



ctcaggaaca 
aatataacca 
agaaggactc 
gcagacaaga 
ccaatcccag 
aagaaggcaa 
tgtatgtcac 
tctctgaatc 
aaactgcttt 
gcaaaaatga 
tgtaaatagt 
atattttttt 
accagtgtta 
caaaaggaaa 
ttagggctat 
agcaaagcct 
gcgccgtgaa 
gtgctccagc 
tatgccatgc 
tgtctgtggg 
tgcggtccat 
gaggagagag 



atggctattt 

ttatcaaaat 

tgtattctat 

ttgaaaacct 

tggccatact 

aagccaacct 

ttcccttttt 

cgctgtgatt 

aaatttaatt 

atgaaaatcc 

cccatcccta 

gcttctcgca 

tcaagaaggt 

agggggatgg 

aggtttaatt 

tttgtaataa 

gtctggtgtt 

acagtagctt 

agtggagtct 

tcctgggcag 

gtcccgggca 

gcgcgcccca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



seq_prot.txt 

cccgggctcc tctacacctg ccgccgcctg ggccgattcc gcgggcctcg cccggcgctt 1380 

cagccgattc ccgcccagct ccgggctcat gggcgcggtc agcagggcgg gccagggcgg 1440 

cggggcgcga cactgggagg aagtgcgggc cgcctgcccg ggcgcgttaa ggaagttgcc 1500 

caaaatgagg aagagccgcg ggcccggcgg ctgaggccac cccggcggcg gctggagagc 1560 

gaggaggagc gggtggcccc gcgctgcgcc cgccctcgcc tcacctggcg caggtaggtg 1620 

tggccgcgtc ccctacccgg ccgggacttt ctggtaagga gaggaggtta cggggaacga 1680 

cgcgctgctt tcatgccctt tcttgttcta ccrtcatcgg ccgaggtaaa agtgctgaaa 1740 

ccatgtgaat aaaatacagg tgggttccgc cagcttcgct cctgaaccta cccgcgctcg 1800 

ggatccagaa gctgcgccgg gagagagggg etcaggcctg ggcggagggg acggaggtca 1860 

gaccgtgcgg aaagtgaccc gggcacccca gggcgcccag gcccccaggg agcgcggaaa 1920 

gtgcggtcgc ggcccggccc tcgggagacg cgggattggg atcaggcaca gcgcgaggaa 1980 

gtcgatcttg gagctagaac attttccttt ggccatttac acgaatccac tggaaaatgc 2040 

cgcagtgttr atcaaagrta ctcaaagtag aaatgtccag acgtcttatg agcttagaca 2100 

aatcttttac tacaaaaaga aacagcagtt gcattcaaac aacaaccctt ctgaaccact 2160 

actaaaattt agcataatta ctcttgtgga tacattttca ttgtcaagta atttacttag 2220 

ccaatgaact tggagagcaa gaaagtttta tgtagtaaaa tgtaaatttg agttaagagt 2280 

taagggtgtt tctttttgtc tgtttgtctg tttttggcaa tgtggctcca aaaccttaag 2340 

cccacctaaa aactatataa atgcaatcca ttcttttgtt ggaatgttca aggactagaa 2400 

agacaattgg agaagtgaga gtttgaatct ttttttacgt tggaaacagt gttgcaaaat 2460 

attttttgag ttttgcctga cttagcaaag attcagtcga actcaagtag agttcaaagc 2520 

attgcagcgt gtagtaaaaa aaaaaaagag ttgaagatgt tgtgccatat ctgattcttg 2580 

gtattaaaaa taaaaaaagg aaccatttaa nccctaagag ttcctggaag aaatggaatt 2640 

gattctacca tctgacttct ttgtcttagt agggacaaat tcattgcctt cacacaggca 2700 

gcattttaat taaatgcaac taaatacatt gatgtcctcc ttctccccag ugaaagtccg 2760 

agcagtgcat agatagaatt atattttctc aaaaaggctt aaatacatgt aaattattaa 2820 

gtgttttaag tgagaaatct ttgtcagttg aaattatttt: atcaaatcat ttttgttctg 2880 

aacggcactt ttgttttaac gtatttagaa aactcttgca ttaaaacaga aattgatttg 2940 

tattatcccc tgtatttgaa gtgcacttta aagtgtattt gaatgagaga ttataatcaa 3000 

attacttgat ttgtgtctta tcctcttact cttctacctt ggatttgaaa aggttggatt 3060 

tgaacactag gaaaaaagag atttttctac tgagggattg tgggaagatt tttttttaag 3120 

cctctgtatt tgaagtgaaa tataaaatag gattttataa tagtcttaat aactagaaag 3180 

ttttaagcta ggataaaaat ggggtgtata ctttaggtct ttatctttcc tgagagatga 3240 

aattggtaga aacgttattt tattgctaga tttcacttta gaaatgaatc accctgatct 3300 

ttgttaggcc cctatttaat attctaaatg caggtctaat ctctctgctt ttttttaatg 3360 

gaagacttct tagtagaact tcacttattt agcacattgt ttgcccatgg ttataatcca 3420 

atatgcttgt gccaagacgt attttggaat tcaagctttt tcagaatttg gaattaaaat 3480 

ttagaatcta aatttcagaa c 3501 

<210> 20 
<211> 4216 
<212> DNA 

<213> Homo Sapiens 
<400> 20 



aggtaacaat aattatgatg taccaggctc cgggccaaag tcatcacctg cattaggtca 60 

ccaaatcctc aggccaaccc attacagtca tgagtcactt aataacagac acattcaaca 120 

aattccttgt taggcaattt catcattgtg ggaacatctc agagtggact tacacaaacc 180 

tagatggtct agcctactac acacctaagc catatggtct agcctactgc tcctaggcta 240 

caaacctcta cagcatatga ctgcaccaaa tattgaaggc agctgcaaca tgagggtaag 300 

tattagtgtt nctaaagata gaagatggcc aggcgcggtg gctcacgcct gtaatcctag 360 

cactttggga ggcccaggcg ggtggattac rtgaggtcag gagttcaaga ccagcctggc 420 

gaacatggtg aaaccccatc tgtactaaaa anacaaaaat: tagccagtgt cgcggcactt 480 

gcctgtagtc ccagctactc aggaggctga ggcagaagaa tcacttgaac ctgggaggta 540 

gaggttggag tgagcctaga tcaggccact gcactccagc ctgggcgaca gagggagact 600 

ccatctcaaa ataaataaat aaataaataa ataaaacata gaagatgtac agtaaaaaca 660 

cggtaartgt ttttgtttgt ttgrtttgag acagggtctt gttctgtcat gcggactgga 720 

gtgcagtggc accatcaggc tcactgcagc ctcgacctcc ttggctcaag tgctcctccc 780 

acctcagcct cctgagtatc tgggactaca ggtccacgcc accatgcctg gctaatttgt 840 

tctgaatttt agtagagatg gggtctcact gtgttgccca ggcttgttcc agccttctgg 900 

cttcaagtaa tcctcccaca tcagcqtccc aaagtgctaa gattacagat gtcagccact 960 

gcacccagcc agtaatataa ttt^atggga ccaccttcat: atttgctgtc ccttgctgac 1020 

ttacacatct ttatgcaatg catgactgtt accatcatta tcatctctat tttccagatg 1080 

gggaaactga ggcacaaaga atctaacttg cacaagttca tctgcttagt gatggaacaa 1140 

agatgtg^at tcaggcagtc tggcttQaaa gtccacacgc ctaacaacca caccagatta 1200 

ctagattgct tttttctttt; ttcttttttt ttttttgaga tggagtctca ctctgtcacc 1260 

caggctggag tacagtggtg agatctcggc tcactgcaac ctctgccttc tgggttcaag 1320 

caattctcct gcctcagcct ccccagcagc tigcgattaca ggcgcccgcc accacaccca 1380 
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gctaattttt gtatttttag tagagatggg gtttcaccat gttggccagg ctggtctcaa 1440 

actcctgacc tctggtgatc ctcccacctc ggtetcccaa agtgctggga ttacaggcgt 1500 

gagccaccac gcccagccca gactgcttta tttttgtatt tgtatttatt catttactta 1560 

ttttgagaca gggttttgct ctgtagccca ggctgaagtg cagtggtgca atccagctca 1620 

ccacagcctc tactcaccgg ggttcaaagg atcctcctgc ttcagcctct ggagtagctg 1680 

gggccacagg catgcaccac catgcccagc taatttttaa atattttttg gtagaagtag 1740 

ggtctcacta tgttgcccag actggtctca aactzcctagc ctcaagggac ccttctzgcct 1800 

tggcctccca aagtgctgag attacaggca tgagccatgc acccagcccc tttttaaaat 1860 

ttttttgaga gacaagactt tgatctgttg cctaggctgg agtgcagtgg tgagatcata 1920 

gctcactgca gcctcaactc ctgggctcaa gcaccagact ccttttatca cattctatct 1980 

cacacgcgtg tggttccaat cctgcctctg ccacttctca gttgtatgcc ccaacccaac 2040 

ctgtctggct ctgtcctcct taacagaagg acggccctgg ccacgggcca cagccagcaa 2100 

cgcttaagca ccagggccgg cgagtgccct gccgtggcac ggctccagcg tcgcgctctc 2160 

gaattcattt gctttcctta acgagagaag gttccagatg agggctgaac cctcttcgcc 2220 

ccgcccacgg cccctgaacg ctgggggagg agtgcatggg gaggggcggc cctcaaacgg 2280 

gtcattgcca ttaatagaga cctcaaacac cgcctgctaa aaatacccga ctggaggagc 2340 

ataaaagcgc agccgagccc agcgccccgc acttttctga gcagacgtcc agagcagagt 2400 

cagccagcat gaccgagcgc cgcgtcccct tctcgctcct gcggggcccc agctgggacc 2460 

ccttccgcga ctggtacccg catagccgcc tcttcgacca ggccttcggg ctgccccggc 2520 

tgccggagga gtggtcgcag tggttaggcg gcagcagctg gccaggctac gtgcgccccc 2580 

tgccccccgc cgccatcgag agccccgcag tggccgcgcc cgcctacagc cgcgcgctca 2640 

gccggcaact cagcagcggg gtctcggaga tccggcacac tgcggaccgc tggcgcgtgt 2700 

ccctggatgt caaccacttc gccccggacg agctgacggt caagaccaag gatggcgtgg 2760 

tggagatcac cggtgagccc ccctgctcct gcaggggaga ggaggaggct agcagggcgg 2820 

gcagggccgg gggcgtgcgg ttgaaacggg ggtcccgggg gcctggggag ttaaacgttg 2880 

gcccagcacc gggaaaaaca ggactcctga ttcccttgct caggaattgg gagtgcgggt 2940 

cgcttctaag ggcgctttct gctctgtaat cccagcgctt tgggaggccg agacgggagg 3000 

atcgcttgag gccaggagtt caagactagc ctgggcaaca tagcgagacg cgcccccccg 3060 

ccccgacccc gcgccattac aaaaaaaaag caaacaaaaa tttttttaaa gatcatcgat 3120 

gaagagagaa aatgcgcttt tctacagagt ccccttccca cccacagccc catccccaga 3180 

taagcgggga gttccctggc gcggtgccag tttctagccg ctgagtgggc gtgtgcgcgg 3240 

ctccaagtgc gcctgcgtac tgctcactcc ccagctccgc gccctgctcc gttcctccca 3300 

aaactctgaa tcgaagaact ttccggaagt ttctgagagc ccagaccggc gggcacgccc 3360 

ccatccccaa ccccctctgt taatccctac cagcctgcag tcctggctgc rtccaagcag 3420 

gaggtggggc ctctggccta gcggggccga aaggcagtcc cctcccccgc agtctgattt 3480 

ccctcttccc cccaaaggca agcacgagga gcggcaggac gagcatggct acatctcccg 3540 

gtgcttcacg cggaaataca cgtgagtcct ggcgccaggt cggggtgggt gggtggcgtg 3600 

ggggtggggt cagggaagag ggcacaggga cccacccggt gtgtaatgta acgcttgcct 3660 

ttcctctctg cacgtccagg ctgccccccg gtgtggaccc cacccaagtt tcctcctccc 3720 

tgtcccctga gggcacactg accgtggagg cccccatgcc caagctagcc acgcagtcca 3780 

acgagatcac catcccagtc accttcgagt cgcgggccca gcttgggggc ccagaagctg 3840 

caaaatccga tgagactgcc gccaagtaaa gccttagccc ggatgcccac ccctgctgcc 3900 

gccactggct gtgcctcccc cgccacctgt gtgttctttt gatacattta tcttctgttt 3960 

ttctcaaata aagttcaaag caaccacctg tcactggccc aggccctggt gtttgtggaa 4020 

ggaagcctca ggcacctgcc atttgctggc tttcaggagt catctttgct caggcccgtg 4080 

ctgggccatg tgggtacact ggtgtaggtt gctggacaca ggctgactca catccataaa 4140 

gacagaggtc ttagggccgg gcgcagtggc tcatacctac aatcccagca ctttgggggg 4200 

ttgaagcagg aggagt * ^ " 4216 

<210> 21 

<211> 11001 

<212> DNA 

<213> Homo Sapiens 

<400> 21 

ccaagtcaga tgttccccaa ttacctgtgg acaggtcagg catattctga gtctaatttc 60 

actccacagg cctaatacct tggagccaga aagcttccag gtaaaaagtc tgaagggggc 120 

ctcctcatgt cattagatgg actcctgcat ctccagaaga ttttccacac caggaaagat 180 

caaagcacca aggcaattct tcctggcttc ttgggacaac cctaggcttt ggcatgagtq 240 

gtctggaagc ctttgcttta gttacaatgc ctatacactc ctggaactgt tttgcagggc 300 

ttgtcttcca gcacaattcc tcctccaagc cttactgtag ctacagccca tcagtcctgt 360 

ctagtgacaa ccaagaaact aagaactatg tactcacgtt cacctcccca gagtcatttt 420 

ccttcaggac aaagctcagg gccttgtcac tgggccctgc caggagccgc agccgcaggg 480 

gctgctcatc atccaacagc ttccgcaagt acactgtgaa ggggaagtaa tgatcagaqa 540 

cagggccagc tgctcagccc ctgcatgctc aggtgcatgc gtatataccc tcacataggg (500 

cagggtgggg tgggaagccc accttggccg tgacgctcag cgcgctcaaa gagtgcaaac 660 

t_tgcgggg5t_ catccaccac caagaacttt. cgcagcaggg . cctcaatgac ttGacgtg G c- — 
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cttgtgcgtg acagcacatg caggtgcttg acagcatcct tgggcaggta aaaggaagtg 780 

cggcgcctga cacttgtgcc ccgtcctggg ccccgccggg catcctgcaa ggagggtggc 840 

ttcttgctgg agggcacaga gacagggcgc accagcttca gctgaacctt gatgaagcct 900 

gtgtaagaac cgtccttgtt ctaaagaaat agagaaacca aaccttgata ataggttcca 960 

ggtgagatgt cagtctactt ggggctaggc tgggtatgca caaattactg cttgcgccca 1020 

cccaagataa cctcagttgt gaccctctga gtatcaggca catagctggg tacctgctcc 1080 

tccccacgcc cccttcctga gcagtcaact caccaagctc atgaagaggt tgctgttgat 1140 

ctgggcattg tactccttga tcttctgctc aatctcagct tgagaaaggt caggtgtctc 1200 

ccactccaca ggctcgtcct gcaagatggg ccagcatgga cacagggccc ttgaggaacc 1260 

cagggcttct ctgaaaaatg gcctctgggg cagtctttgg aaactgactg cctttggccc 1320 

cctgtccctg atgtacatat acatagctgg tgcccaccct gaacccacca ctgctcctgg 1380 

ttttgcatgc tctgggtgga taagggaaag acagaatcat ttggcttctc tctgctgcct 1440 

gcctagggcc tcagcactga atgtagcctt aaggatacca cagaagcagg ggcaactgaa 1500 

ggcacatggc caggggccag gaacagctga gggactctga agagggactx tcatttaaag 1560 

taaaatcagg ctgggtgtgg tggctcacat ctgcaatccc agcattttgg gaggctaagg 1620 

taggaggatc acttgatcct caggagtttg agaccagctt gggcaacata gcaagacctc 1680 

atctctacta aaaaaagaaa aaaaaaaatt agccaggtgt ggtggtgtgc ctgtagtccc 1740 

aactgttcag gaggctgagg tgggaggatc gtttgagccc gggagattgc agctacagta 1800 

agctattatc gtgtcactgc actccagcct ggggaactga gtgagaccct gcttcaaaac 1860 

acaaaaaaca aaaacaggct gggcacgttg gctcacgcct gtaartctag cactttggga 1920 

ggccgaggcg ggtggatcac ctgaggtcag gagtttgaga ccagcctgac caacatggag 1980 

aaaccccgtc tctactaaaa atacaaaatt agccaggcgt ggtggcacat gcctgtaatc 2040 

ccagctactt aggaggctga ggcaggagaa ttgctcaaac tcgggaggtg gaggttgcag 2100 

tgaactgaga tcgtgccatc gcactccagc ctgggcaaca agagcgaaac tcggtctcaa 2160 

aaaaaaaaaa aatcagtaaa atcacacctc aattgcacat tctgatcaca gcaccctagt 2220 

tgagttggag tgagggtttg tcctggagaa ggcagcccat ttttctcctc tgccccggca 2280 

cggggccatg acccactgca gggtgagagg agtggagagt ggtgcacatc agtagtccag. 2340 

ccaccagtgg acagagtagt acttggagcc agttctccat gtctcacaca tagtgagaaa 2400 

aatcactgtg acatgatgtt taaccttgac ccaagctgca taaaaggcag ctttaggcca 2460 

ggctccaatc tgccagaggt acacaggcag cttcctggtg ggtttctgca cctgcctgtg 2520 

ctgtctggag atttggccca aagatttttt ttttttttga gacgaagcct cactctgtcg 2580 

cccaggctgt agtgcagtgg ctggatcttg gctcactgca agttctgcct cctgggttca 2640 

agcgattctc ctgcctcaga ctcccgagta gctgggacta caggcgcgtg ccacaacaac 2700 

acccggctaa tttttgtatt tttagtagag atgggatttc accacattgg ccaggttggt 2760 

cttgaactcc tgacctcaag tgatccgcct gccttcacct cccaaagtgc tgggattaca 2820 

ggcgtgaacc atcgtacccg acccagagat ttttaactcg accactcact ccccacctca 2880 

tctagggact ggattcttgc cggaagggtg gagtgtggga cagggcagcc agggctctga 2940 

accgactttc ttctcccaga ctcccttggc cccactgcat cagccttact tcctgttgac 3000 

gtcagatagg ccctagttag aatgcgagtg tcacagacac agctaagctc agcgctgacc 3060 

aatactttgt cccagaagaa ttcccacaag gtttcctgta gaatgatctt gtgcctagcc 3120 

caggagagcc agggttctxc ctgactccgc cctggagtcc ccttaagcac ttaaaccatc 3180 

tgatggggac aaatggagag gacagatgag ggagcagggt ggagcgtttt agcagaatgc 3240 

tccttaccca gaacccgctg ctattctgca gccagcaagg atgtggggct aagaactaag 3300 

gccagggcct tacaggaaaa aggtaaaggg ggaggggtgg gaatttaagc tcattttxtt 3360 

ccccaagtat ccaaaggtct cctggatgga gaagagcact ggagtaaaaa ccccagtaca 3420 

aaccttactg gggacagtgg gcaaccttgt cgggttagta aaaacaaatg gtgtgggccc 3480 

tggaaaatga gggctggagg ctgtgaataa agcagtggat gtgtttgttc agtacaccaa 3540 

cgggaagaag tacccagatg ggaggagtac taggggcagg agaaatgcca gacagactct 3600 

agtgccaggg caagaaggaa gatcattttg uttgcagaac agggagggca cagggatggt 3660 

gctaacttgt tcttgtgatg gctctgagct cctacctaac aatgagaaag cttgctcctt 3720 

cttcccttcc tggatgaccc aggagccctg ggctgggatg cagtgacctc atttccagcc 3780 

ccttcccttc tggtgatgaa cctccctatc ttcactcaga aaacagactt ggattagagg 3840 

cactgcacag cccttccagg attctaaagg aggaagagtt: tctttttctg tttccaaagc 3900 

tgcctgctgg aagaggattt caacagccat cccagtcgga tgcacagcag gaccatggaa 3960 

tttcccttct gcaccatagg gacccaccct ccactctacc actgtccata aaaactgatg 4020 

gttttttttt ugagacagag tctcgctctg ttttccaggc tggagtgcag tggtgcgatc 4080 

ttggctcatt gcaatctctg cctcctgggt tcaagcaatt ctctgcttca gcctcccaag 4140 

tagctgggat tacaggtgcc tgccaccaca actggctaat tttttgtatt tttagtcgag 4200 

acggggtttc accattttgg ccaggctggt cttgaactcc tgacctcatg atccacccac 4260 

ctcggctttc caaagtgctg ggattaaagg tgtgagccac tgcacctggc ctaaaactga 4320 

tgtttttttc ttttttttta acatataact tgggacttct cagcctccta ttctttcttt 4380 

tttttttttt ttttttttga gacagagtct tgctctctca tccaggctgg aatgcagtgg 4440 

cccagtctcg actcactgca acctctgtct tctgggttca agtgatactc ctgcctcagc 4500 

ctccccagta gctgggatta caggcacaca ccaccatggc cagataattt ttttgtattt 4560 

tcagtacaga cggggttttg ctatgttggc ctggcaggtc tcgaactctt ggcctcaagt 4620 

gatctgcctg ccttggcctc ccaaaatgct gagattacag gcatgagtca ccaagcccag 4680 

ccttctttct' rttttttgag acagagcctc accctgtcac ccaggttgga gtgcagtggc 4740 

acgatcttgg ctcactgcaa cctttgcctc ccggttgaag tgattcagtc tcccaagtag 4800 
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ctgggactac 
ggtttcacca 
tggcctccca 
ttttgagaca 
actgcaacct 
ggattacagg 
ttttcccacc 
ggggtctaca 
agcaccaccc 
cactttcttt 
agtccctgtt 
atggcatctg 
acagtcctgg 
gctttccccc 
tttcctatct 
ccacattcca 
catacaggag 
tacttctcaa 



agtcacacac 
tgttggccag 
aagcattggc 
gagtctcact 
ctgcctccca 
tgtgcacctc 
tttccacagt 
ctcaactcaa 
attttgtaag 
acttaaataa 
tctttttcct 



gcacacagtt 
gttagtaacc 
ttcctgtggg 
attcccaaat 



gagggagcag 
tttagticata 
ccgtgttcaa 
tggaatcggg 
ccacgtagcc 
ggcggagctc 
tcccggcgcg 
tccgagtcct 
gaaggcgcct 
agccgccaac 
cccgggccag 
cttgtctcat 
acctcatcgc 
gccagcgtcc 
cgggaagtgc 
ggagtacttg 
ctctgtgccg 
ctcgagaatg 
gcgctcacca 
caacagtcca 
gagaggcctg 
ggaagcccca 
ggttacctca 
cggaaggcca 
cttccccttt 
tcacccaggc 
tcaagcgatt 
gcccagctaa 
ctcaaactcc 
ggcttgagcc 
tacttgaagg 
ctactcccct 
ccctctaccc 
ctcagcgatc 
cagcatcatt 
cctaccccct 
acgatgaagt 
agatccacgg 
cactaggatg 
cagcctcact 
cagttactgg 
taaacatgcc 
cgctgcaatg 
actctgtcgc 
ctgtgttcaa 
cactacgcct 
caggctggtc 
gggattacag 
actttgtgct 



gcagctggtg 
tgcagatgga 
ttcctggcga 
gagcattcga 
gctggataat 
cctgtctgtg 
gtcttacgca 
aggagggtgg 
cagcgaccgc 
gcctgcgagc 
cttggctgca 
ccgcctcgcc 
caccgccccg 
agccgcgccg 
tggggcagga 
tccggagctc 
gggcaagcgc 
gcgtgcgcgg 
cggagccggc 
ccgggaaatc 
cctgcactgt 
cgttcgtgtc 
ggcagacgag 
gcggtgaggc 
gggagagggc 
cactgctacg 
caggcagaga 
cctcattggc 
tggagtgcaa 
cttctgcctc 
tttttgtatt 
tgatctcagg 
accgcacccg 
aaaaactaaa 
cagaccaaga 
actccagcca 
tgtgtactac 
tgaagcttgc 
gcagctcctg 
ctgggctcag 
ctgaaacact 
tctaaaagac 
gctacacttg 
agacaaaagc 
caaatcatcc 
tcccagcctt 
caaggctgga 
gcaattctcc 
ggctcatttt 
ttgaactcct 
gcatgagcca 
ccccccagict 
agacctggga 



caccatgccc 
gctgacctcg 
attagaggtg 
ctgtcactca 
ggttcaagtg 
cgtggctagc 
ggaaaatggc 
cccttgcacc 
gaattttgta 
ggatttccgc 
ggtgcctagg 
ctccgataat 
tggatctctt 
taaagtgctc 
ctttctccag 
acaggccaga 
ggagcaggat 
ggactcttcc 
acgcggaaat 
gcccggctca 
acagaaacca 
cggttccgcg 
ggctgcccac 
ttcccctccc 
tagcgaggtt 
gtagccactg 
catagccgta 
gtcgcgtgcg 
caaccgttaa 
acgccggggc 
cactcacaga 
acaagagtgg 
agcctgggtc 
aatccaggct 
ggcaattaga 
ggcctgccca 
ccgctccacc 
cgcgcggcag 
ggaggagctc 
cgctgctcgc 
cggactctaa 
ggcctgctca 
aattaaaaaa 
tggcgcgatt 
agcctcctga 
tttagtagag 
tgatccaccc 
gcccttcact 
cacctttaac 
cactggtctc 
ggcttgcctc 
caaatggaat 
tccctttgac 
ctcagctcct 
tcctgatcac 
gacctaaacc 
gtttcaagtg 
cccagcttcc 
ccaaacttat 
tgcttcttat 
gttctttttt 
gggcagtggc 
cacctcagcc 
tttttatttt 
gacctcaggt 
ccgcgctcgg 
gaatttgtga 
cactatccat 



seq_prot. 
ggctaatttt 
aattcctgat 
tgagccaccg 
ggctggagtg 
atccttctgc 
cctccttttc 
tcaggactga 
caagctcrgg 
aaattttatc 
cccatttctg 
cctggttctc 
ggggatacct 
tttccacctc 
ctgaatgctt 
atccacccta 



acagggaaga 
ctctctctgc 
ctctxcacat 
cgaggtgcta 
ggtctaccac 
agggggcccc 
ggcaggtccc 
cgacccagga 
gcccgccggc 
cgcgcggtga 
ctcgtcgtgc 
cccgcccgtc 
tgcgtgtacg 
gactgaaacg 
ggggacacgc 
ccccacctac 
cctctggccg 
agcctgggcc 
cccctcccag 
acgctccttg 
tcctcgccct 
gcgggttccc 
cggtagtggc 
caggtcgggg 
caggctccgc 
tgttggccac 
acagttggat 
acaacaacaa 
tcggctcacc 
gtacctagga 
acggggtttc 
gcctcggcct 
gggaacgtat 
ctacgtctgc 
tatacactct 
ttcctxcagg 
ccagtgttcc 
tcccaggggc 
ttgcagattc 
tggcctggtc 
ctcagactag 
aacatggcca 
tctttgcagfe 
ctttgatttc 
cttcatggct 
tttttttttt 
acgatctcgg 
tcccgagtag 
ttagtagaca 
gatccacctg 
ccccttgttc 
tgtcctcttg 



txt 
tgtatgttta 
tgcaaatgat 
tacttggctt 
cagtggcacg 
ctcaccctcc 
aattggttag 
ctgacatgaa 
gctaagattt 
tgaagcatca 
ccaggcatac 
tgagcctggt 
aggaggttcc 
tttaggcatt 
atggtccaaa 
gaggaaggga 
ggtgagggct 
ctctcaagtt 
cctcccctag 
gtccaaactg 
aagccataca 
ggcacccagc 
cggccaggac 
cgcggcaacg 
accccctggc 
agtactgctc 
tgctccaggt 
ccccagtcct 
cgtgtcagtg 
tagatcgccg 
acgcttcgcc 
cacagggaac 
gaggcgaggg 
cgggtccgct 
cccccgcgca 
cgcgcggcac 
tcccatacgc 
agcccaggtc 
aggtgaactt 
aaatgtcccg 
aggcccgacc 
ctgggcgtct 
ctctatcgcc 
aaaactgcgt 
gcaacctccg 
ttacaggcgc 
accatgttag 
tccaaagtgc 
atggaataca 
cctgtggttg 
aatccttcgc 
aaactgcccg 
attctccatt 
tacactctcc 
tgactcaact 
tgtctacatt 
atcctccgtg 
aaatttaatt 



aaaaatggcc 
tcccttgtct 
actttatttc 
tttttttgag 
ctcactgcaa 
ctgggattac 
tgaggtttca 
cctccgcctt 
attctttgca 
taccggatga 
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gtagagatag 
ccacctgcct 
ccttttctat 
atcttggctc 
caagtagctg 
tgtcttgtgg 
gacaagccca 
tggcgtgctg 
ctcacaactc 
tgagcttcac 
ggtcacacca 
gagacacctt 
ttataatcta 
acaagacttc 
acagaatctt 
cagctggctc 
ttcctaaaca 
tctccccaag 
ctcggtcggc 
gctgcttttt 
atctaggcgg 
ccgcggggag 
gaccggggag 
tcccacctgg 
gagctccgag 
catttcgaaa 
gcgcgtccgt 
tgcgcgtgcg 
ggatctagct 
cccaggaatg 

gggggcgggt 

cgggaaggtg 
tgcagcgggt 
gaattagcct 
ccaggcagcc 
cctcggcccc 
ccggggcccg 
gcaatctgca 
gagattgaag 
tatctcagtg 
ggaaaccggc 
tagcacagaa 
cttgcttttg 
cctcctgggt 
ccgccaccat 
ccaggctggt 
tgggactaca 
tctgcccatt 
tcacctgtct 
cttcactxtc 
ggacagggtc 
ctcaccccct 
cagttttcct 
tccatatctc 
catctgcccc 
ccagtacctt 
cccttttctt 
actaggctcc 
ctacctctga 
tctttgagaa 
acagagtctc 
cttccgcctc 
aggcacccgc 
ctatgttggc 
ccgaagtgct 
t.1:ctgt:caca 

gaigggtctcc 

cajgagcagtg _ 



4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
,8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8700 
8820 
-8880 



atgctcaacc 
tagggtccac 
ggactgtaaa 
gcttgctggc 
ttagcatctg 
cctgcttgca 
taatttccct 
ctgggaagat 
ttttaaagat 
atctgtggcc 
agcgatgggg 
ccgcgacttg 
gcgcggggcc 
tcgccacaga 
cggccaggga 
ttggcacgct 
agctcaatga 
gggcgctagc 
cccaggatcc 
gtgcagggac 
cttgcccttc 
cggcgaccgc 
tagaggaaga 
acacttggcc 
gagggagatc 
ccggcagagg 
gcatgggtgg 
caggacacaa 
tgggtattga 
ttctaggata 
aagcctctgc 
ccagcctgta 
ggtgggggca 

ccacaggcct 
ctggagctgg 
ccagatttct 



cagactccat 
tcccagcagt 
agcacccctg 
gattcccagg 
attcactggg 
gggggcccac 
tcaccctaaa 
ggactccgag 
aagaaactta 
cagatacgag 
cgaaagtaac 
acccgcggcg 
actactcacg 
ggtcgcacca 
ccagctgccg 
ccagccgggt 
gctcaggctc 
aagcgcgggc 
agactgggcg 
tgtggggccc 
cttccctcct 
aaagcaccac 
gggtccccac 
tacccactgg 
ggtcagcagc 
ctggaccgtg 
tcggccctca 
gtctttxcat 
ggatggcaat 
ccctcacctg 
actgcagtaa 
ggtctcagcc 
ccccacccac 
taccttcgcg 
tgcttggcaa 
gggatctagg 



gttacaataa 
tgattcaaca 
gtgattccag 
ggtgcagaag 
tcagaaaagg 
caaaggttct 
ggttctggag 
gacagtaggc 
aaatcagcgt 
tggagtgcga 
ggacctagtc 
actgcgctgc 
cgcgcactgc 
cgtgtgcgtg 
tgtggggttg 
gcggcccttc 
ccccgacatg 
cgggcggggc 
gcgggaagga 
gaaggcgggg 
tcgtcccctc 
gcggagatac 
atccggccct 
gtggggcagg 
tttaccgccc 
atcttgaggt 
gcaactgcag 
gcttttccca 
gcatgtgggg 
cagcaatacc 
gcacagcggg 
cacctggggg 
atccatgcct 
cctgcagccg 
ttgcctgcca 
a 



seq_prot . 
tttggggagt 
ggtctgcggt 
ctggtgtcta 
gactgctggg 
gtgtttgcta 
aaatttttcc 
ggggtcatga 
acaaaccctt 
atttttacat 
caagggataa 
ctcgggagct 
cccttggctg 
aggcctttgc 
gcgggccccg 
cacgcggtgc 
ccagcgcgcc 
gcccggttgg 
cacagggcgg 
gctgaggaga 
ctgggcgcgc 
ctcacacccc 
ccgtgtttct 
ggccctcctg 
aagtctcgag 
gctctgctct 
tcaggggtgc 
ccctcatttg 
gtgcttgact: 
gatgtgggag 
actcattctg 
gccgctctgg 
aaagtcagga 
cctgcatccc 
caggtcctgc 
cttgcctctg 



txt 
ttttaaaatt 
gggatccagg 
cccaggggag 
tgtgtggctg 
aataaagact 
aggctccctc 
gtgtttgaga 
tctcaagaag 
ataagcagcc 
accattttcg 
gtccccgccg 
ccccttccgc 
gcacgacgcc 
cgggctggaa 
cccgcgcgat 
cagcgggtgc 
gcccgtgctt 
gccccgactt 
gccgcgcaat 
tctcgcagag 
accccggacg 
ggaggccagc 
gtccggtttg 
ccttcacttg 
ccactgcgga 
attctgggtg 
gctctgtcac 
tggcactccc 
tagggcttag 
gcatcgtgag 
agccactgcc 
aggtctgact 
ctccaccctc 
tctgaggggc 
ttttctcgct 



tactgatgcc 
ctagcgggga 
agcaaccttt 
cgtgcatatt 
caacaaaact 
ccataggtgg 
agaggcaagc 
ggccaaggca 
acctctgctc 
cgcactcttc 
accccctctg 
tctcgtaggc 
ccagangaag 
gcggtggcca 
gcgcagcgcg 
cagctcccgc 
cgctggcttt 
cagcgcctcc 
ggaaacctgg 
ccccccccgc 
gccacaacga 
tttactgtgc 
ctgaagcaac 
gggtgaggag 
gactggggct 
gattcccttg 
cctgggctgc 
tgcaggcagg 
aggtccaagg 
cagcgcttag 
tctagcacat 
ggccctggaa 
cctgccattt 
tgaacacatg 
ccagccgctc 



<210> 22 
<211> 4448 
<212> DNA 

<213> Homo Sapiens 



<400> 22 

cttatctctt 
tctcttttat 
atgtaggatg 
gtctcagtct 
aagaatgcac 
atccatagag 
actacatgta 
tgaataactg 
tttttataat 
agataaaact 
tgctgagagg 
aagtttccag 
tttatgttac 
tgttaagctc 
cacttacaac 
cacagaaaag 
ttttaattct 
aaataatcaa 
taccaaattt 
gaatttttgt 
gtcacttgct 
tgctcttttt 
ttragattcat 
cctctggagt: 



ttatgtgttt: 
tttactgtct 
tgaactccta 
caaactcaat 
aactgaacaa 
gtgatcaaga 
ctcatcaaac 
aacaaaaact 
ttatacataa 
cactgaattt 
gttgggaagg 
atggttttct 
aacatttgtt 
tctgaaagaa 
gggcagactt 
t:tttacatca 
taataggata 
gtaagcagag 
ctcgaggcaa 
rtgtttgttt 
gtctgggcaa 
ccttagtcaa 
gagcaatgcc 
cctcagttcc 



ctcttgctca 
ccgaggctca 
aatcagcatt 
atattaaaag 
aaccctgacc 
tagaaatgtg 
catgccagtt 
tacaatttaa 
tctataatta 
ctatttccat 
aagaatggga 
tagtgtcatc 
tgatgcctta 
acaaaattgc 
tgcaggggcc 
gctgaaaaga 
aaggaaacat 
ttgcaagtat 
ggcagacata 
tatattgttt 
tgttartttt 
catgctgtgc 
tctgtcecca 
ctgcaaagtg 



gccttgcuca 
tctcacttct 
caagatgtat 
ttgtatcctt 
atcatcctat 



cttggtgtct 
tggcctttaa 
atttggatac 
tatgtccacc 
attttctctc 



tgccatcttc tattcattcc 
ctacctcctt aataactgta 
ctctcttaac acttatctac 



tttttatata 
ccttgccaat 
gaaaggcaga 
tctrgcaacct 
tttaaattgc 
cttgtcatiag 
cacacctttc 
aaaatgcata 
gcacattata 
tggctaaaat 
catctggacc 
tgttattttt 
tgccataacc 
ttccagcccc 
acaggctaag 
aggaagctag 



attggcacag 
gatacacaca 
agttgacagt 
catcaaataa 
tttactggac 
ctagacctca 
aaggtggttt 
actcattttg 
aatcaatgct 
gaattttgga 
tgaatatctg 
aaaagtaggt 
acaaattgaa 
aaacatcttt 
acattaggca 
actaagtaat 



ttgatattct 
ttatttattt 
tcatttcaat 
accagcccca 
tttctgagct 
actgtcatca 
gcttaacacc 
ctcaatttta 
tgataattcc 
cacaatgtaa 
taaaaaaaaa 
gggtccaaaa 
attttccatc 
tgatcctata 
gggaaaatga 
gtaattticag 
ctgtgttaac 
tatttttiagc 
cattattagg 
gccaaattag 
gaaaattgac 
gagaagttgt 
ggtccctgca 
cgttaggctc 



8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11001 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
,1260 
1320 
,1380 
1440 
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seq_prot.txt 

ccttccagac cgaccaatct gatggtatta gatgcaattg ctcctgaatt agggcatgaa 1500 
atgaattcag ctttggtgca ccaatgtgat gactctgctt catcaaagcc tgagcacgcg 1560 
ataggcctag caccatctca cacagagaca aagggcaacc ctctgcttcc aaaggaatga 1620 
cacaacctgt ttctgaagtg attcacatca tctttacttt tgaacaaccc aatgctcaga 1680 
aaacaatcta agaatcttcg ctgactttag ggatgtaaga tacggtttct tgacagtatt 1740 
tgggattgtg gaaaaaagca attgaggaaa gggcatctcc acaacgcaat attgaattta 1800 
gtgcccaagg tccatcacag gaatccctaa cgatccctac aatctctctc tctctct^tt i860 
ttttctctct ttaccctgaa aataaactga gaagttagta ttgggataac tattcccctt 1920 
gacccaaata aaaagtcctg ggcaaacaca ggtacaaatt gccaaatgga aaaagttctt 1980 
ctccatcttc agctagaggg aggctgggga tcccagctct ttagaagccg gcccgtggac 2040 

gcccagagaa tcccttcgga gaccaggtca gggtcactga gcttgcccag cagggcgccg 2100 

cctcggacgc cgccccgcct ctcaccttgc cagcgccgcg ctcgggccgc gaaggtgcgt 2160 

gc S§S g Si cg gtgattggcg gcggcccgga gctgcccggc tgccattggc tgcccggccc 2220 

cctttgttcc cgggtccggg ccgcaggccc gctgcggcgg actgggcggc ggaagtrcga 2280 

cggcgccggg cgagtggctg ttgagcggcg ccgcgggagt tccgcaggtt tcccgtgttc 2340 

gcagcggagc cggaggccag ctgaacccgg ccgtgggatc ccggatagga ggaggagggg 2400 

acccatagga cgcgttaaca tggacctgga aaacaaagtg aagaaggtag gggggcgctc 2460 

IJSSSSSS^. 9 c 99ctgctt cacctgcgcg ggtcgcgcgg cgcgcggcgg ccggaggtgc 2520 

cgaggtgggt ggggctcgcg ggccccccgg tgtgagcccg gctctgggtc tgcggtgccc 2580 

cgggcccagg gacctggccc cctgggtaga ggaggtgctc ggcggccccg ccagctcccc 2640 

acactcggga gcgacagaat tggaagcgcg agcgagggcg ggcgcgggac tcttctctcc 2700 

agtctcacgg aatcccagtg gttgtaggtg ttggaaactt tacttaagat gtttcagctc 2760 

tgcctgtgcc tcctcaaaag gaaaggacga gcttagggcg agtgcgggcg agaccgtgac 2820 

a "^f^ tgg 5 tcaggaagtt gaatttcatt aagcctttgt ggtttggggc tctgctgtgc 2880 

£"9 a " g " ctgatctcct cccttccggc tgggctgtct ggggcgctct aaaatgagtg 2940 

ttgatttaat gcactgcctt cgcacccgtg ctggtgataa cttctaatgt gatttttttt 3000 

tcggtcattc attacttgtc taccgaccga ggcagtgcct ccctcttagg gaattatctg 3060 

^ a ^S^ cg ca «ctcctt gtttggcagg tagtcccaaa gggcggttgt caccagcgtc 3120 

tcagcgcagg gagacatttt cagcttggct gctgcctcgc aggcactgga accgacggcc 3180 

" a f ctgag ^ cactcccggg ttatgcggag cctggtgtcg cctgggagaa aagggtgggg 3240 

g^tcagaag gaatgctgat gtcacattcg agtgaattgc cctgtgcgai 3300 

ag "ctcctc 9 t1: S ca 99gg atttgctcat gttttacttt gtggtaaatt taagctagct 3360 

2i5f53H35 aaatagagaa gttttgcaga gagaggaagg gaaaaataca gaaaacagtt 3420 

tttgtttttc tcatactcat ggttcctaat attttatctt ctaaacacaa agtttttgac 3480 

^i gag ^ atta attctatttt atgcccagaa ctggattttt gaaaatggat acaaaatatt 3540 

iH~ ag .2 a S 5S5f gagttt tcggttagga agctattgac caataaaaac tcgaaatgag 3600 

aaaacagttt ttttcattcc ttgcgactaa aattctcagg cactcaaaaa ggtatccaag 3660 

aa £ agaaaaa 5 a H aa H gt taatggtagc gtgattgcat tctattctct tcccatttgg 3720 

tttctttctt tttttctttt aaataactgt aatatttttg gttaatcttg cttgtcatqq 3780 

gggaaaag ^ g gggaaagcat tttgcccgct tagctttctg ttgatgaata ctactctcat 3840 

H ag H£ ca jgtaggtaag agctacttaa tttagtactt gtctaaaact tggggggtct 3900 

+&fZESE tttaagtatt tatttgaaga caactttgca aaagtaaaca tattttgggc 3960 

tgccgaggat gaacaggtga agcacaggat ttttagggta atgacactat tctgtatgat 4020 

actataaaat ggctgataca tgtaattata cattaatcaa aatccataga atgtagaaca 4080 

!:S aagag }: aa accctaatgt aaactatgta tgttggggat aatgatgtgt caatgtgttt 4140 

catcacttgt aacaaatgta ctgcactgat acgggttgtt ggtagtccgg gaggctgctq 4200 

gagacaggaa ggcatggggg aggtacatgg gaacttcgaa ttttctgctc aattttgctg 4260 

^ gaag ^* aaa aaa H5 ttta aaaataaagt ctgtttaata gggggagaaa aatcaaagac 4320 

?«HH2 aaa £ tt1: ttgaga caaggagtaa ttctgaggga agaagttcaa acttaaaaat 4380 

§3 3 £ g gcaaaggaa aaatcc agac gttgtttata aacaatttcc ttagagtttg 4440 

aaaargtg - 4448 

<210> 23 

<211> 4408 

<212> DNA 

<213> Homo Sapiens 

<400> 23 

agcacacaaa aagcactaca cggctaacgg aggacgagga accatggcaa 60 

agcaggcagg caagccctaa gaaataaaac aatttgctaa aaaataattt ctgatgacta 120 

55 g 5 aa S a Si gaaagtgcag gaaaaataca gttcgaataa tcccagatcc tttcacattt 180 

cccccctttt catacacttt gttaccccat aacaaaatct ttaatggaaa gtttaaaaat 240 

aaa f agcaca ggaacatgtg ttttaaatga actaaattgt gaaattagcc agtaaattaa 300 

H5HH 9 i a Qtaattattt aaggaaatta aaatactgct cagttcagtt ctgtatttta 360 

»»f^?3 g c gj tcttta caaccaatta atataagtgc tttaggaaca tttgaagaca 420 

aa ^ 9 !r" a acttaaggaa caaagcacct aaataattta agtgtaattt tgctgagtta 480 

aagtaaaaca ttccacaaat gaagtggcta tttaattttt tagggaaagt ttggttattg 540 



seq_prot.txt 

aaatgttgta tgctcatgtt acatcaacaa aaatcttcaa tttattttgc ttatgtgctt 600 

tgttttcttg atattattgg tatttgaatt ttagatggat ttctgccaaa atgatatttt 660 

gtgtgataaa agcatcttta gttttgattg atagactaaa acaaatgcaa ggaaatttct 720 

ttaaatcaga ttaatttttc ataaaaatat tttagaatgt atgaattctg atatttacat 780 

ttataatggt aaaagttttt tccgtttagt ttagtaagac aatactcaca caaaagagta 840 

aaaaaaaatc acaccacctt atgatagttt gatttctaaa ttgcttaaga aagtaaagtg 900 

gttaaactgg aaaagaggaa catatttcgg aggtttagaa tcgaaaattt ttttcttaat 960 

ctccagctgg aaaataattc tctgcatcca tttaaagtgt atctcctgaa gtgccagatt 1020 

ggagttgact ggtgatcaat ttaaaggagt tacaatccaa agaaatggtg agagcttggc 1080 

atccaggcct ggctcccagg taattcgctt gggcctgaga ggtcactaac tgccagttaa 1140 

gatggaatct ttttcttttc ttttttttcc caatggataa caatgggaag ggggctaatc 1200 

ttccagtagc tgaaactttg tacccagccc tttatcttga gaatgctaat ccttggcccg 1260 

aggatttgtt cctgcagtgt tggcaccgag atttaaggga agatacctcg ttttaaatgc 1320 

cagccacggt ctggcttccc tctcgacttc agcaccctgt agattgttag tgtctgtggc 1380 

gggggacgaa aggaacaggg ctttgcaagg tctgtttgcc gactgcgtta ccttgggcga 1440 

aacttagccc caaaagccac aaatcaccta cggtgaagat tctccgaagt ggaacaaatt 1500 

tccagactcg cattatctca catccctgcg ggatagatgg cctccactta ccggctaccg 1560 

ggagagagct gctgtctccg cgtcccactg cttcccgggg cgatttccag cgagccgagc 1620 

ctccggctgc acggcaagcg cccgaaagcc gggcctgaga ggactgcagg gctcctgagg 1680 

gtgccaagtt ccgaaggagt ccacgggtgc actggggcct ccgaaatcta gccgccactg 1740 

gcagtttctt tctgctcctc tccagctrttc tcgctcggtc tcgcactctc tctcctctcc 1800 

ctccctctca tccctctctc ttccctctgc tcctactccg tgtggggagt gacgtgacgt 1860 

cagcagagat tccaccaaac tccactgcac agtggcgcgc gggcggccgg ccgagcccgg 1920 

ctgcgcggct ggcgatccag gagcgagcac agcgcccggg cgagcgccgg ggggagcgag 1980 

caggggcgac gagaaacgag gcaggggagg gaagcagatg ccagcgggcc gaagagtcgg 2040 

gagccggagc cgggagagcg aaaggagagg ggacctggcg gggcacttag gagccaaccg 2100 

aggagcagga gcacggactc ccactgtgga aaggaggacc agaagggagg atgggatgga 2160 

agagaagaaa aagcaatctg cgccaacccg gcagccctaa taaatcaaag ggggagcgcc 2220 

agggcagcgg ggagacagaa acgtactttt ggggagcaaa tcaggacggg ctgggaggaa 2280 

gcgacaggga aagtggccca agagacggaa caaaggacaa tgttcatggg gttgtttggg 2340 

acgaggcgtg tggagtgtgg gtgtgagcgt gcgtgtgtga ccttctttca ggcctgcaga 2400 

gttgaggaaa gaggtcacag caaagaggga ctrgcggaggg aggaaagtga gagaccggta 2460 

gagggcggga gtggaggtgg gcgcggtggg gatgggagag gatgagtgaa gagaaatcta 2520 

gaagaatgga gtgagctagt gggagagggt gggagggcca cagccgggag cgaacgagct 2580 

aggcttgtca gctggggaag gccgggacgc tgggcccagc ttagctggga caccgcgccc 2640 

gaggtcaagg cgggtggacc aggcatgctg agagtgtcgg cgcacaggtg ggcacggcca 2700 

cgcactgacc cagtgttcac gaagggtttg cactggacaa ggctcagacg ctcatagagt 2760 

ctagaatttc ctctgctgta cctacattca acaagttcac cctgggtcac ggatatctca 2820 

ttttttaaaa tgacgaggtt aaggttcctg gcgaggatgg tattaaattg cacgggatag 2880 

aagtgggggt gggggagaga gtttccctca agtccacatt tgctcctgca aagcaaagag 2940 

tatgtgaaat tacagggcat attctcactc gaaaagtgtg ccttacttct gaaccctgat 3000 

tttctgattt cttgacttga gcaaagatgt gtattttggt agtgagcaga atattttggc 3060 

tctgtcctgc ctctgagtgg aaggactata aatataattc gcctggagga ccaggtgtga 3120 

aggcttctgc caggcatatg ggacaatgtt ttttcaatct caagggcatc ctgttaatgt 3180 

atgtttttgg aaagtgccgg aacacagcca ttgctcctgg attcggattt tcccaccaat 3240 

attaattcct gcttgagagc aaaactcagg cccgctatta aaaagacatc tctttggtcc 3300 

ctaattgaga ataaagttcc ctctaaaagt t:gt:attgctc ttcctaaatc aatataccaa 3360 

tactcgcaat tttagaaata tatagtgact cgggagaatg tgcataaaat agatacgttt 3420 

aaaaaagctt ggcgcttaaa actaacccta gtcactatat aggtgctggg ctttccctac 3480 

ttttgggggc tgtctggaac atgttatgtg ttttcttgaa ttactccgtg ttttgaattc 3540 

atttgagtta gcagtaaaaa caggcaaaca aacttgctca atttgttttg agtgctaaat 3600 

cccttcactt tgaaatagct aacagtcgac agatggactc attttatgga aagggttagc 3660 

ctcttcagcc acgaagaaaa ctgattagag atctacattt taagccattt ctaacctcca 3720 

cgtaacatcc gtgaaaactc aaactttctc tctttaccca gtggaaactc aaagcagtgt 3780 

tatttaaggg gagagaaatg agggggaaaa tgcccacgtg ctgtttaatt gtatttcctc 3840 

tctgactctg agaatttcta tttctggttt ttgaaatctc gccgaggcaa gaaaatcaaa 3900 

tttcttcaac aagtcccaca actgaactct agttacagga caccggaaag tgcagtccga 3960 

gaaagacatc ttcacctctg cccatcgacg atttttgcag cctccccatt cctctgagta 4020 

atgggctaat aatcctccct ttttttcttt tcattttgta gagattaaga ggcgctcgta 4080 

gcagaacggc cttgccttca gctggtggcg aggataggca atctcatgga aaagttggaa 4140 

gagaatgaga aaaccaaaga cagaaagatt cagagatccg cggagagaca cagggagagg 4200 

gaagggagtt gcgctgaaaa gacgcaaaga tacgcgcgtg caactccctc ccctttcagg 4260 

tttcagaggt ttgcaaacca gggctgagag gaaggggctc gggaagctca cgttcctctc 4320 

gccccccttc tgtctggagt ctcgcccgcc agaggctggt taaccccagt cccggccgcc 4380 

gcagacactg cgctgagctt ttgggtcc 4408 

<210> 24 ' " - - i 

<21l> 4435 : ; " - , ■ . ,f ■ 
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<212> dna 

<213> Homo sapiens 

<400> 24 

gttcccttcc tggccgccac ggtccacgtc accaactcac agccacctcc ccocctacca 60 

tggttccctg ttcccggtcc atgaaccctc gggctgccca gattggggct gggggnggct 120 

gaattggggg cctaaagcca ttcacatgaa actgtatgca cccccgccag tctggggaag 180 

ggagctgatc ccctggagac ccagaaccta gaagagagat ccccagcaag cagtgggggt 240 

ggaacctgtg cccctagtgc ccccaaaccc ttcctgagtc ctgcgacgtc ccccaacccc 300 

cgcccgcaat ctctctcctt cgccagccct ggtgggcctc tcaccggcgc acgagcggga 360 

cttccagatt ttgagcagta gcaagatgat ggcgaggagg tgggagaggt ctcccaggaa 420 

tcggaagaga ttcatggctg gggaaccctg gcagggctga gcgggaggga ggcaggctgg 480 

cgggggggtg ccccccgagg ctgctggtct gaacgggtag ctgggctggg gggacggaag 540 

agggacccta ggtgcgctcc gctccgggga ggggactttg ggagggggag caaaggctgg 600 

agctggcggc ggagctggag ccgggaagag ggaggagagc gagaggggga ggagtccggg 660 

aggagaggct ccgcccccga gggcggggcc tggatccccg gcgcccccta tcgcccacct 720 

cctgctttgc cgtggcggcg ctaaacgcac ccccaaagct gccccatagg aacttgggag 780 

ggtcgcaggg ttcggaaagt gccaaatcca gcaccggttc gcccaggcag tctgtgtctc 840 

tggaagagac gcagtccagt acaccagcct cagcccttgc agggatgtag agactgccct 900 

ctgagcttga aaaatcctgc agggacccgg aggctcagct gctagctttt gcagctttca 960 

ggcactctaa ctccaatccc ccagaagaca agaaagatac cccacctact tcctccccta 1020 

gacccaggac tcaaggcccc agcccgttcg tcaaacccag aagtctgggt ccccagcccc 1080 

tcctccctca gacccaggag tccagacccc cagcctctxc tccctcagac ccagaagtcc 1140 

agacccccag cccctcctcc ctcagatcca ggagtccagg cccagctcct cctcgctcag 1200 

acccaggagt ccaggccccc agcctcctcc ctcagaccca ggagtccagg cccccagcct 1260 

cctccctcag acccaggact ccaggccccc agaccctcct cccccagatc caggagtcca 1320 

ggcccccagc ccttcctccc tcagacccag gagtccagac ccccagcacc tcctccctca 1380 

gacccaggag tccagatccc cagcccctcc tccctcagac ccaggggtcc agggccccag 1440 

cccctcctcc ctcagaccca ggggtccagg gccccagccc ctcctccctc agacccaggg 1500 

gtccagggcc ccagcccctc ctccctcaga cccaggagtt cagggcccca gcccttttcc 1560 

ctcctaggac gctgttcctt ggaacttagg gtcccacccc caccatctta tggatcaaac 1620 

atcctaacct taagaatcta gatctacagt ttctcccttt acgacccaca gatttaggcc 1680 

ctgattctct tctttttcag gaatgtgcac ctcaccctgt tctcccagac cttgaggatg 1740 

aaggaaacag gagcctcacc caggaggctc aaggccaaaa ctctgaccca aactacctca 1800 

ggagcccctg gccctggctt cccccctgct ccagagtttc tgccctgccc acacacacac 1860 

accctcttcc accctcagag gccccggtgt cctgccccac gctctacccc agagccccac 1920 

gggtggcttt ataaaagtgc cgggcccagc cctctagcag gaggggaatg ctgggcatct 1980 

gggtgtggga cccccgggga acagcctgtg gtctggactc ctgcatctat gaggggacag 2040 

acgtggcttc ccttccggat gatggggtac ccacagatga tggaggccag ggtccctcaa 2100 

taaaagaagg ggtgcaggcg tgttgatttc ttcagagggc tggaaggacg gggtgcccaa 2160 

gggtgacatc cacgagtctt gggtccctga gggtggcttg tacgggggag agtcgggatg 2220 

actgagtcct taaaagagac tccgacttgg aggcggtccc caaattcctg ggtcccagta 2280 

gagaagggga ctcctgggtc tgagggagga ggggctgggg ggtgggactc ctgggaccag 2340 

ggtcgagacc tggttttcag gcctggcctt ctggggcaat aaaagccaca gcttgggtct 2400 

tagtgtggcg aacactgaag tcagggaaag gcctcctgtt tccagagcct caaggcaggg 2460 

cgggggcaga gggcagcaac ccccagccct ggagtctagc tctgaagctg gtgtctccat 2520 

accgggttct gagtccctgc ctgcctgtcc ccagcctgac ttcttcctcc ctttttattt 2580 

tcagcccctc actcccttgt cccaggagga aggcagaggc tggtagctag gggtgggggg 2640 

cggccccctc cccaagcctg gcaggagaag gggtccccag ggaggccagg agggggggct 2700 

gtgggtctcc cggcagtggc agacggggac tgaatgttaa tcgcatcccg agtgagtgtg 2760 

tgtgtgcgag aacacagcga gtgtgtgagt ccctcccgct ccagctcctc caagccgcgg 2820 

ccgccgccgc caccctcgcc cgcagcctcc cgcagcctcc ctcggccacc ggtgcctgct 2880 

gggggtgttg cctgggtagg tcggcccggc ccccaggggt ctctcgagcg tctgccatct 2940 

gcccggtgag gatctgtgtg tccgggtgtc tggggctggc tggtggaggg ggggtgtgtc 3000 

tgtaagcgct gcggcggcgg agggagggag gggtctgtct gtctgtaccg accctgagct 3060 

gcctgcctgg gtgtcgtggg gctcccgctc cttccccccg gcccccccaa acccagatgg 3120 

atggtgtgta cctgggttct gtgcctcctg cctgcgtccg gccaggcgtc tgggcgtccc 3180 

cggctgcctg tgtcctcctg tctgtccaaa cagcccctat cagcagtggc agcctggccc 3240 

ccattagacc ccccactctg tgtgtgtgtg tctgtgtgtg tgtccctcct caagctctgg 3300 

gggtgttgag ggggaatccc agggaagtga gatcgtgcgt gtgtgcgtga gtgtatgtgt 3360 

gtttctgcct gtgtttgaga gtgggggagt caaggggggg tctagaggtg gccaagcgag 3420 

gaaggggcaa gcagttcccc aagcaggcaa tctcccgctc ctcacacgca cacaccagcc 3480 

actagcttca gaggtgaccc agacagacag atagacacag acgctggaag ggggggtggg 3540 

ggggctgagg gcacaaagcg ggggtgcgag tgagccaggg agaggcggga ctggacacat 3600 

ggaaaggggg gaggagccgg ggctgaagcg gcagaggggg gcaccccggg tgggcggagg 3660 

ggggatcccc acggggtcgg ggcggcaaga ggacaccccg acagcctctg caatgtccgg 3720 

ggcccaactt ccagagcaac atgtgtagcc acgtcctcgc ctagtccagg tggccgcaac 3780 

* ' 9 ' ' " ' :t 1 Seize 32 ... i . * ' 



cttgggggag 
acagacagga 
ctgccccccg 
gttccggtcc 
ggggacaggg 
agttggcaag 
cacctagctc 
ccctttttcc 
ctccctctcc 
ctcaccttct 
ccctctctcc 



agacagggca 
ggtccgggct 
acatcggctc 
tggccccccc 
gatgcaggga 
aagcctcgtg 
ccctgtccct 
tgagtcccta 
ccatccccac 
ccctgtaccc 
tctcgctctc 



ggacaggacc 
gccgctgctg 
tctgagccct 
gccatccccc 
tggtggtggg 
gagggagggg 
cagggactct 
cacctcagac 
atacctgatg 
ccctccctgc 
ccctcccatg 



seq_prot . 
aaggaagagg 
ccgccaccac 
cctcggaatc 
caacagaaca 
aatgtggacc 
gtttgaacgg 
ccticcaccct 



txt 
aaggagagac 
cactgccgcc 
ttggggtcgc 
gggtcatgaa 
cccaaatcta 
tgggcagggg 
ctctctctcc 



ctttcacgcc cccatttctc 
ccccatatcc ctgtgcctcc 
atcagtcccc ccatcgcgag 
tcagagctat gagtctgcta 



ggagccaggg 
gccccggggc 
tggacgccgg 
aaggtaaggc 
ggacagagga 
tcttgacccc 
ctctctgagc 
tgacacttgg 
atctctctat 
ccggtcctct 
ttaat 



<210> 25 

<211> 6001 

<212> DNA 

<213> Homo Sapiens 



<400> 25 

gggcacagtg 
tgcccagggg 
tgacttgtgt 
aggggatgtc 
cctagtggga 
cagtggaaac 
ttctcccaac 
aggatgaagg 
tataataact 
tttcligggct 
gaacgcaggg 
tccagagcag 
acaaaagcca 
atattcctgg 
aggttaaatt 
tcaaaggaac 
ggtctccact 
ccctgcgcct 
gtatcgcgtt 
gcaacctgag 
atcagcgact 
cctgcgtcct 
tacaaagaac 

ggggtcctct 

tagattagtg 
tggagaggtg 
agaggagctg 
ggcgaaggca 
gcagccgagc 
tgggggttga 
accaatgggg 
ggccaatggg 
tctcaccgca 
accgccctgg 
ccgccaagct 

gggagcacgc 
gccttgagtc 
gcgggcttcg 
cccctggccc 
cccgccggtg 

ggctggggcg 
cggcgagccc 
acgggaatga 
tgcaaactta 
gccagcactt 
tcccctcctt 
ctttttttaa 
ggctgcttaa 
ctcccgggcc 



tgaggaagaa 
cgggaggaag 
ccctgaaata 
ttagcagatg 
tacagaagta 
aactcagcca 
ccactacatg 
ctgactccct 
gagttattta 
gagatttatt 
gccttigttca 
tgcctctgct 
aagtcctgtc 
cgcatcaatt 
atggtagaga 
ttctttcttg 
ggagttcttt 
ggcctggcta 
cttgttaata 
agctgtcgca 
ttaaaggact 
tggccacctc 
cgtatccacc 
ctctccactg 
actggtggtt 
ggtggatggg 
aggtcctctg 
ggttggcagg 
cggctcggct 
aacagtaaac 
atgtgagctc 
gcgccgactc 
gcggccagga 
ccgccgcacc 
gccgcactca 
acgcgggtgg 
ccctaggcgc 
ggaagaatgc 
tgactgctgg 
ccacgcacct 
gggcgaaaaa 
gggcagtgtg 
accgatgaca 
aaggtccgcg 
aggaacactg 
gtcccccata 
cctgcttgtc 
cccaattccc 
acaatcagat 



acatggaaaa 
acaaacagag 
ccatcttgga 
agacaatagc 
gtaggttgct 
tatcttttgg 
ggactgtgta 
tagtctccag 
cgtaaaataa 
ctgtcttgaa 
ttttgccttt 
ctcagcccca 
cagttacaag 
ttactcagac 
atgcaaaatt 
cagggtaggg 
tgagaaggac 
ggggccacct 
tctcctcacg 
ggtttccagc 
tgcctggcgg 
cgttctttaa 
cacccccggc 
gggtatctga 
ctggaacaaa 
ggctacaaat 
gggttggagg 
ggcagcctct 
ggctggcgca 
gcgaggagga 
cttcgcgcga 
ggcgcaggaa 
acgccagccg 
atggacgccc 
gtaagtcccc 
gtttgcatcc 
tttgcatcag 
agttggtgag 
tgcgaggcag 
gcagacgccc 
gaaaaagctc 
gagcggatgc 
ggttttgcat 
tgtgcgtgcc 
cttttcagaa 
gaacctaccc 
ccctaaaggc 
trgttcatttc 
cttgctggtg 



gacagatttt ctctagactg 
gtcagtgggt cctgaggctg 
aactgcagga cccccgggag 
caccgccacc 
catcaaccca 



gaacagagga 
tagccctggt 
tagataagct 
tggatatgtg 
tgtcgcggtt: 
ttctggaaac 
taccgcccac 
ccttctctaa 
agggaaaata 
cacagtttaa 
ggagggtatt 
acttctgtgg 
tcttctggtt 
tttctaaacl: 
tcactcgttt 
gcaticcacgc 
tcctgcagga 
ccttcttcat 
cgcccgtggg 
tacatcacag 
ctgctcggca 
actggaaccg 
tcccgccgcc 
atctcgcgcg 
gcaactgctt 
accaattagc 
caaggcgggg 
ttcacgcgtt: 
ccaggcaggt 
gcgagcgggc 
ctgcgtgtgg 
cgtgcacagc 
gaagctcggc 
tgcacgactc 
gggctgtgcc 
tgatctctgc 
atgaatggac 
atgcagtgga 
cccctacgcc 
atgtaaattt 
tttcctaaag 
gttcgagtat 
gttctggagt 



ccaccccaaa 
agcagccaca 
ccaaatggat 
gttaggaact 
gctaggggca 
atgcttcttg 
ttcttaatga 
tttgtcctxc 
attctcatta 
acggggcagt 
tttttacatt 
aaatgatgaa 
ctgtttcaga 
aaaagtttga 
tgggctgcag 
cacagcttgt 
cccaaaggag 
ttcccaggct 
actcaggacc 
cctgcgcttc 
gggtgcttag 
cccaaactgg 
actgcccttt 
gccagattgc 
cacaatcctc 
ctccttgcat 
cgactcggct 
gcagggcctg 
gtucggggcc 
cggtcctcct 
ggtcaacttt 
gccgggagtg 
cagtcggatt 
gggatcagca 
gaggcgtgcc 
agctggccgg 
atctcctggg 
cttcgcctcg 
atagtgtgaa 
tcacgttagc 
ttacaccagt 
gaaaaccttc 
cctttctccc 
tctaactctg 
aaaccgagtg 
tctgccttcc 



aaaaggagat 
actgtatgtg 
gaatggctgc 
ttcctgtgcc 
tcagcttggg 
gtgctgtccc 
aactccagga 
gctactaana 
catgcctgag 
agttgctgag 
aattcccaga 
ggaaaagcaa 
tggactgtat 
aaaagaaact 
attccagact 
accccatgcg 
gcagctctgg 
cgccctatgg 
cagcgcgggc 
gaaatggaga 
atcgctcctc 
cacgtgcaaa 
caaccttggg 
aaaagtttgg 
gggctggtgg 
cagccaagag 
gggctcaagg 
gggcgggcgc 
tgatcaaaaa 
cagaagcgcg 
cgacagcacg 
ggctgcagac 
tggctgactc 
gggcctggtc 
aggttcaggc 
cccgctccct 
gctccggcaa 
cgtgcagctg 
ggcctgctgt 
ccggtccggg 
cgcagctgtg 
tgagtgatga 
tgaaagggat 
gtttgcttta 
agtccagctt 
cacccccctc 
gtagactttg 
attgaattct 
caggtcccag 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4435 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
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cccggctcct cccctccccc gccaaaccca ccgtccccag ccccacccgc agtgaagaag 3000 

gcaaagtctc cgatgtgcct tgagcccatt gtcaggcggc tgccgccgcc gttagatttt 3060 

tatttttcta accaggatag agctgataat atgttggagc agcatgaggc atagccaagt 3120 

attttacaat tatcaattgt tgagcagagt agaaatctcc ctgggacaga gcctcctctg 3180 

tgttgtggta agaacagaga atccaatttt aaaggggaaa ggacttctta cttttctagg 3240 

ggcagcgctc acagtagctg agaggacagg gcttattttt tctcagtggt tacagttcat 3300 

tttaggcgag attccctgct ccagctgtgg agatgtttcc tgtagcctcc tcctgcaccc 3360 

cccatgtttt ggagtgttcc caacgtttgt tccctatgta tttcgttatt aatttattac 3420 

tataattgta atggcaattg tcatcagtaa tacaattatt tgttattaat ttttctggga 3480 

ggatttttgc ccttggactg catgtaacct ggggggcagg agggtgaggg ggcaggcaga 3540 

tgttgctttt tatgtatttc cttgatttag ttgaattgta aatattagag aagctttcaa 3600 

acttcttttg actgtaacct acagtgaaaa acatgtttac aaaatgacaa agtatacaca 3660 

ttcaattgta acaaaaataa atgccttatg aaatgatgtg tagcctaatt acctgtgaca 3720 

tacactattt tgtattctat cctgctctag tctgttctgt ttcattaaaa aaattagtcg 3780 

tgagctgggt gtggtggcgc acgcctgtaa tccaagcaac tcaggaggct aaggtgggag 3840 

gatctcttga gcccaggagt ttgaggctgt agtgaactgt gattgtccca ctgcartcca 3900 

ggctgggtga tagagcaaga ccctgtctct taacaaaaaa aaaaaaaaaa aaaaaaaaaa 3960 

aattggttat gatccacaaa agcaatgcat tcttcaactg gaaaaaaaaa aacctactgg 4020 

attagggagg tgtgattttt cagtaggagt ctttaaaact gtttgtaatt attttggtta 4080 

taaactttta tagaactatt ttcaatgaat gctgtctata agaaagatac agtctcagga 4140 

gcaagttcaa agattattca cagatgaatg tttctcttta ctaaaagaaa agattcactc 4200 

tattttattt gatcagctgg tgctaacaag catccaacat ttcagaaaac aataacacat 4260 

tcttagaccc agtcgtcaag gcagtctttt ttttaatcta attctttttt tttttttttt 4320 

ttgagagaag tcaagctctg tcacgaggct ggagtgcagt cgcttggtct cggctcactg 4380 

aaacctccgc ctcccgggtt ccagcgattc ttctgcctcc gcctcctgag taactgggat 4440 

tacaggcacc caccaccaag cccagctaat ttttgtattt ttagtagaga cggggtttca 4500 

ccatgttggc caagatggtc tcaatctctt gacctcatga tctgcccgcc ttggcctccc 4560 

aaagtgctgg gattacaggc gtgagccacc acgcccggcc ttgtctaatt ctttattgtt 4620 

aaaatactgt tttttgagac aagttgcaat ccagtttagg ttatagctgt gttttaatga 4680 

tgccctcctt agcaagcatc ttaaaaaaaa aggtgcctcc tatctgtaga ccttctctgg 4740 

ctggttgctt ttgcacttat tttgtgttca irttttaaagc ttttcttttt ctctttattt 4800 

tttattaagg gaatcctcac atacacaaag ataaaagtag tgtaatgaac ctccaaggac 4860 

ccatctctga gactrcaacaa ttatcaacat tgtgccagtc ttrgttgcagc accctgggac 4920 

gaatcttatt cacagcaggc tcttccttct tcctgcccac cccacgcttc ccagtctagc 4980 

accttagacc aaaagagatc ggaaaattga gttacctgta taatctgggc ataatttttt 5040 

tatcgttgtc cctttaaccc agaactttat gcattcttta ctttagcttt aggagagtgg 5100 

tgaatgagat ttgtgaaggg gacattttct agggaatcgg taaattcggg cctttggaaa 5160 

aagtgtatca acatcagtca cattgcctaa tttcatgctt accttgtgtg taagttgcat 5220 

aatcgctggc tttgtggaca tgggaaggaa ggagactggc tgtgggtggg gatggaagcc 5280 

tggggacctc actgtagacc cttccttgtc ccctcgtcag gtttgtatgt tcagagggaa 5340 

agcagtgcag aacatattta aataacccta ttttccttta tttttttcta ggtgttgtta 5400 

gagatracaaa aggaattatt agactacaaa ggagttggca rtagtgttct tggtaagatr 5460 

tacttttgaa ttctgtgaat gtccatgttt caaaggaagc tttttttttt tttttttttt 5520 

ttagaggcag ggtcctgctc tgtcacccag gctggagtac agtggcgtga tcatagttca 5580 

ctgcagcctc caactcctgg gctcaagtga ccctcctgcc tcagcatcct gagtagctag 5640 

gactataggt acacatcacc acacctggct aatttttttt aaatgtttaa aacatttttt 5700 

aaagatccca tgagatggga tctcactttg ttgcccaggc tgatctgaaa ctcgtggcct 5760 

gaagccatcc tcccacattg gcctcccaaa gttttgggat tacaggcttg agccactgag 5820 

ccctggccca aagggaacta atttaaatgc atcaaatata gaattacagg gcctataggg 5880 

tcttcctata ggcctttata gatacatttt taaactttaa tagcttgggt tgcatgtctg 5940 

atagaaacca ttttgaaaat agcataaatg ctggtgagag gttctgacat ctcagcttac 6000 

c 6001 
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ctttattttc tttacacctg tgtcatatat taaaaaacta cacattagag aatttaaaag 60 

atttgccaaa gttgctcaga aagtgatctg atgctaagat ggtgtccttc cacttgtact 120 

acccctatag cctagaagta taccattaat ccatgtcctc ctttataact tggagtcaca 180 

catggatcat gctgtttcag ttatcttgtc taatttgatt tctaacattg ttattgattc 240 

tacccctaag tgccccacct taaaccacag agtgttaaag atgctggatc atttctacaa 300 

tgttatctct aagcctggtg aaccaagaaa ttgctagact actttgtgca acattacaat 360 

ggcactgctg taacattata tgccaaatcc agaatgtgca aagatcagaa gttgatagca 420 

aaqtactxta ggattcacaa cctgtaaaaa tgaagtttgg tgctatgaac catgctgttc 480 
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tcaaaagaaa tgtaagttca ctcctgtaaa ttttctcttt gagaaagtag attactttga 540 

tgtgctttta agcaagacgt acattacctt ctaagttagc catttctgtt acccagctct 600 

tactttgagt tttcatgtaa atttgggtct cacattaaaa agaaatgtac actgcaactg 660 

aaaatctgct ctggaacata attaattgct tttctaacat ttttctcaca cactaccctt 720 

ttcagcagga aatggtaagg aatatgactc acrtaaaatt tcaatgactc ttactgcaaa 780 

aaaaataata aacttaagaa aaccaaagtt gaaattaaca tgaaaaacca cgaacctttt 840 

ataaatgatg accactcttg tcttttttaa ataattcaag aatatagcaa tgtattacat 900 

gcaatttggt tgaccaatat tatgaataat tgaattggta ttaagtcaag taattattta 960 

aaagatattt tcagaaactt taaaagattc cttttctaaa gcaaaatatc tgacttttga 1020 

ctgggaaaaa gattgaagag cttaagaaac aaaatatcta cagagatgaa agggagcttc 1080 

tgcattccta agggacccat atagtgtcag cccatagttc ttcacacgtg cccatagcct 1140 

attcttcaac gctaataagg agggacaaaa aaggtctttt attttctagt gtctatttcc 1200 

caatcagcca ccaattaatg ctaatgctaa gataaccaag gttagctgca aaaaagtttc 1260 

cagaaactca ttggaactaa tcaattagac ttgggcaagt gacttatcat tgcaaattca 1320 

tgaataatag ctagagaaat actaccaact ttgatttttg tagcatttta ttaaagaaga 1380 

aaaatgaaaa caacgacatt caagaaattc catgctacta ttcaagtctg catgagaaca 1440 

tctattcaaa agataatcag aagctatagt cattcagctg aacaagtgaa ccactttatc 1500 

atgggccagc ttgaagatat agctatacat cacttaaaaa tggataataa aacaatatca 1560 

taatgtctac ttgccctctg tctggctcct ggaagccact caaaagaacg tctttccaat 1620 

gacatacagt aattcttgag gaaaattttt agatcctgac aggcactcaa gcatctcagc 1680 

aggagacatt gggtttctgg cttctgttag ataacatctt accctatgat cacagccaaa 1740 

ttttacagac ttccttctaa ttcaatcata agngctgaaa agagaatgag taaaagcctg 1800 

tctcagttct tattaaggta tatgcagaga ctttggaatc ttcctcatat tctcaaatat 1860 

tttactcctc aattaaagtc ccccaaaatt atttcattca tattaccttt ctctgggctt 1920 

tttgcaggat gtgttcaggg cggttgactg tggatccgaa gagggatttt agcttcgtct 1980 

tatgagttct cagtaactgc agttaatgaa gtcctcacct gcttttatac cagcagagtg 2040 

tttccacctg catctaagcc ctgggatcaa caggaaattc catccaatga ttgaagggta 2100 

cttggtgtaa ttaccatgac gtcaattttt accatgacgt caatttgatc ttatcttgtt 2160 

tcaaccctta ggtcattttt gtttggagcc attttcaggt gatggtgacg acaaggtcag 2220 

cccatgaaaa gagagtacgc tgtcactttg tacgtaattt tattgagcca tacacataac 2280 

atcctgaaag ctgacacttt cctataaaca agagacattt aattagctaa gtacctgata 2340 

tttacctaaa caagagctat agatttccga aaatagatat agagcatttc caggcttata 2400 

ccactattta attaggaaac agaaaaataa aacacacacc tatttagact tttctaaaat 2460 

taatttgcat ctgaatttaa taaagatgca tgagaaaggt atgaggcaat tctaagaagg 2520 

actgaatggt tctcaatata cgtgcctggt ttttggtttt tttttttttc tcatttcact 2580 

tttgtaaagc tcagtttctg cctgaacaga aaatatgtat gtcatggctt gccaggtgtg 2640 

ggacacttag ctccaaggtt cccaacccag tgtcacccta aaacctgcac agctcttcag 2700 

actactgtgg cttctttttt gcacacagag attatgaccc ttcaaaagtt cctatcttta 2760 

tcjcactccgc tcagtgccac ttccgtaagg aactggctta gatacagact accaaggacc 2820 

tcttcaacaa agagtttaga agatctatgt gatgcaaaaa gcaattcttg tttaacataa 2880 

agaaacttgc caaagaacgc aaagttgttt tgtgatttat tttctgtgat aaacaaattt 2940 

gtttaggtrtc ttacaccttt tcagatttcc gttgtttcct tccagtatgc atcccaacca 3000 

ttcacctttt cttagtgatg ctcctttttg ggggacgagt gggggagacg gaatctcgct 3060 

ctgtctccca agctggagtg cagtggcaca atctcactgc agtctacacc tcccgggttc 3120 

aagcacttct cctgcctcag cctccgagta gctgggacta caggtgtgcg ccataatgcc 3180 

tggctaattt ttgtattttt ggtagagaca ggagtttcac catgttggcc aggctggttt 3240 

aacccctgac ctcaagtgat ccacccacct cggcttccaa aagtgctggg atcacaggca 3300 

tgagccaccg tgccgggcct tcctagtgat gattgatgcc accattattc aagaatttct 3360 

acatttaaat atcaaccaac tggcagtata ttttxaatgt agcagactta ttttgaaaat 3420 

ctttccatC't aggcatttca ctcttctttg ttctattctt tctcttttat ttgtgccttc 3480 

tttacatcct tttatttctt ttttatctga atgattacat agaaacaatg tttctaggat 3540 

gctcttatzta aattattttt gtttggctta tgatttcata gtggatttta tttttatttc 3600 

acatttctat ataagaatta tgaatattag ggaattctta tgaaaatgtt tattgtgttt 3660 

ttaaagttta catactgttt tttaattttt ttagtgatgt ggctacatraa ataatacaaa 3720 

agtcagtgcc tttaacacct gacttatctt ccccatttcc tcaattattc tactcagaag 3780 

caatgtttcc agtttcttac ttgtgtttct gcagatactc tatgtatata caaacaaatt 3840 

atatttaagt ctccatccct gttgaaattc tccttaaaat aacaaacaga atacaaaaat 3900 

tagaggaacc cctgcctttg tgcttagaga atctctgttg gtiatggtata aattatactt 3960 

ctctgtactt tctacaacaa acatatatta cttttataat t z 4001 
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agactccacg gaaggggaca gggagccggg ctceccacag gcacctgctg agaaaggcag ? 60 
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gaaggcctcc ggcttcacaa agtggccctg ggcatccagg aagtgttcgg ggtggaagcg 120 

gaagggcttc ttccagacgg cctcatcctt cagcaccgat gacaggttgg tgatgagngt 180 

cgttccctgg gcaggagatg cagggtgaga gtggggactg gactctagga tgctgggacc 240 

cctgccacca aacacacggg ggacacacac tgcctggcac acagctggac tctgtcaact 300 

agtcctgcgc ccgagaagct ccacagtacc ctctccgacc ccacagcagg gcgcagtcac 360 

acctctcaga ggcacccaca ctgccccctc tccctgcagg cgctgggtcc tccaacartc 420 

tggcaggtcc tgatttgtct tccccactag actggggctc tggatggaca ggccagccct 480 

gcctatactc tggacccccc atccaagcgg ggacagtcag tgtggtggca ttgaggacta 540 

ggtggccagg gttcctagag tgggcccacc tggcagtagc catgctgggg ctatcaccag 600 

gggctggtgc tgagctgggg tgaggagggc gccaggccta ccttagggat gcggaagccc 660 

tgtacttcga tgtcacggga tgtcatatgg gtcacactca gggggatgat gtccccaaag 720 

cgctgcacct cgtgaatcac ggcagtggtg cagggcatgt gagcctggtc acccatctct 780 

ggtcgccgca cctgccctat cacgncgtcg atctcctgtt ggacacggac tggacagaca 840 

tgcgtcccca caatgggtca gcacccaggg gacactctcc ttcctcctgt gttggaggaa 900 

gttaggctta caggagcctg gccacgcctg tgctggaagc cccgggtgtc ccagctaagc 960 

ccaggggccc ccagctgtac ccttcctccc tcagtccctg ccttgggccc cagctgggct 1020 

cacgctgcac atccaggtgt aggatcatga gcaggaggcc ccaggccagc gtggtcgagg 1080 

tggtcaccat cccggcaagg aacaggttac ccaccactat gcgcaggttc tcatcattga 1140 

agctgctctc agggctcccc ttggcctgag cagggccgag aggatactca ggggatagaa 1200 

cggggtagcc cccaaatgac ctccaattct gcacctgtca gcccagatgc ggctcgccgg 1260 

gtgatgcact ggtccaacct tttgcccagc ctcccctcat tcctcctggg acgttcaacc 1320 

caccaccctt gccccccacc gtggcagcca ctctcacctt ctccttcttt gccaggaagg 1380 

cctcagtcag gtctcggggt ggctgggctg ggtcccaggt catcctgtgc tcagttagca 1440 

gctcatccag ctgggtcagg aaagcctttt ggaagcgtag gaccttgcca gccagcgctg 1500 

ggatgtgcgg gaggacgggg acagcattca gcacctacac cagacagaac ggggtctcaa 1560 

tccctcctgt gctctgcgtt cacctggacc agtctcaggc cccagccatc tccaggaaga 1620 

cccagggcct gcctgtcctt accactgacc tcaccaagtc cctccccaag tgccagcctc 1680 

caccctctct ctccttgccc agaggagaaa cctaaaatcg aaatctccaa cgtggacggg 1740 

ggtacagagt ccttggcctc tcctggtgcc ccctgacccg ggcacacctc tcccacgacc 1800 

atgtctgaga tgncccctcc tcctccaggc ccttcttaca gtggggtctc ctggaatgtc 1860 

ctttcccaaa cccatctatg caaatcctgc ccttcggagg ccccagtcca gccccggcac 1920 

ctctcaggag ctcgccctgc agagactcct cggtctctcg ctccgcacct cgcgcaggaa 1980 

gcccgactcc tccttcagtc cctcctgagc naggtccagc agcctgagga agcgagggtic 2040 

gtcgtactcg aagcggcgcc cgcaggtgag ggaggcgatc acgttgctca cggctttgtc 2100 

caagaggccg ttggggcgaa aggggcgtcc tgggggtggg agatgcgggt aaggggttgc 2160 

cttctccgtc ccccgccttc ccagttcccg ctttgtgccc ttctgcccat cacccaccgg 2220 

cttggtcggc gaaggcggca caaaggcagg cggcctcctc ggtcacccac tgctccagcg 2280 

acttcttgcc caggcccaag ttgcgcaagg tggacacgga gaagcgcctc tgctcgcgcc 2340 

acgcgggccc atagcgcgac aggatcaccc ctgggggcgg gacgggcacg tgggcgttgc 2400 

catgaaggcc ttggccccac cctccgccac ccactccaac cctggcgctc cacaaggtct 2460 

cccgcagtcc ctagcccggt ccagctgggc acagggccca ctctttgctc acccacattg 2520 

ctcccctgcc tggggcgggg tttggcccca cctcgtctct gcccaccctg accacctttc 2580 

cactcaagga agatcccgcc cgtcccgccc acactgagcc cgcagcatag gcgcggtccc 2640 

cgccaccgcc acttcgacgc atcagcctcg cccaccgggc ttctggcggg tctgggcagr 2700 

agccccgccc cctcccagcc cacagactcg cacctccccc gtgcaggtgg tttcctggcc 2760 

cactgtcctc agcccactcg ctggccttta tctctgtttc acgtccagga ccccacgccc 2820 

tgtcggcgct gcttgggcta cggtcactgn ccacccgggg cccacggaaa cgcggtctct 2880 

gtcccccacc gccgcttgcc ttgggaacgc ggcccgaagc ccaggacctg gtagatgggc 2940 

gcaggcgggc ggtcggccgt gtcctcgccg cgggtcacca tcgcctcgcg cacggccgcc 3000 

agcccattga gcacgaccac cggcgtxcag gccagctgca ggctgaacac gtccccgaag 3060 

cggcgccgca actgcagagg gagggtcagg gcctcttgtc aagccaggat caccccagac 3120 

tacaggtcct agtcctattt gaaccttgga cgacccccgg ggctaccagg agtgagcagg 3180 

irggaaggagg agacccagcc tcctgatcct ggggcggggg tgggggtcac accttctgtg 3240 

atggaggaac tcagtttgga tgcgtcaccc aggtatgacc ttgcaagagt caccaaaatt 3300 

gccgagaggc cccagttagc atcccattcc cagatgatgg tccatgccgg tgagcagtga 3360 

ggcccgagga cccacagtgc aaaaggtttg aaccgggtca ctgcaccccc ttcatcctcg 3420 

arttcgtgat ttaaacggca ctcaggacta actcatcttc cattcccaag gcctttcctt 3480 

ctggtgtcag cagaagggac tttgtactcc ataacatatg ttgcccaatg ggcttgcatg 3540 

cccactgcca agtccagctc cacctccagg cccttgccct actctrcctt ggcctttgga 3600 

aaatccagtc cttcatgcca tgtataaatg tccttcccca ggacgtcccc caaacctgot 3660 

tccccttctc agcctggctt ctgatccagc cngtggttta acccaccacc catgtttgct 3720 

ggtggtgggg catcctcagg acctctgccg ccctccagga cctcctccct cacctggtcg 3780 

aagcagtatg gtgtgttctg gaagtccaca tgcagcaagg ttgcccagcc cgggcagtgg 3840 

caggggacct ggcgggtagc gtgcagccca gcgttggtgc cggtgcatca ggtccaccag 3900 

gagcaggaag atggccacta tcatggccag gggcaccagt gcttctagcc ccatggctgc 3960 

ptcactacca actgggctcc tctggacaca cctggcaccc ccaccccacc aggcacagag 4020 

gaccaggcag gacactctcg gcacaccgag cgcgtgaccc ttcccttata aagggagctg 4080 

a£ga;tggcct tcgccctctg ctgtgagtga acctgctgtg ttg^ctgtgc tgccagtggc 4140 
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agagtcaggc 
aggcccttac 
cactccaagc 
gccatccacc 
catgaccttg 



cagggcaggt 
ccagggtagg 
tccccaagcc 
tcccagctag 
ccctttccct 



atgggctgct 
gtggtagaaa 
cacacaggct 
gctcatacac 
acccccacct 



seq_prot.txt 
ccagaggtcc ttgccgctgc ttcctgctcc 
ggcctggtcg gagaagtcac cccctctccc 
tctgggataa ccagggtctc agtggacccg 
cctaatgtag tcacaacccc tcctccagaa 
gcccactc 



4200 
4260 
4320 
4380 
4418 
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ctgtctcaaa 
aaactgtgat 
actactgata 
cggctcacat 
aggagtgagt 
gttccattta 
gaataagaag 
atcttgcata 
ctgtaacatt 
ggccattttc 
aagtcattac 
tctttaaaca 
caacctttct 
ccatattcaa 
ccctaaactc 



aaatatttta 
atatttiataa 



aaaaaacttg 

tctcactatg 

gccttccaaa 

taaacatctg 

taatggagta 

tcaccgacac 

ttctggtgaa 

gacacacttc 

taaataactt 

gtctggggct 

attggtagaa 

caaatatttt 

gttaagtaac 

gtttctctaa 

tatggggctc 

agggagaagg 

tagacaagtg 

ccaggccagc 

gctggtgtgg 

gaagcctccc 

tcatgtccga 

gctcctgcca 

cttcccgcat 

gtcccgagac 

cttaaaacct 

gaaaggcaaa 

tatgaagttc 

cgcaagcgct 

agcatgcttt 

*gggcgt&gt 
agagagtctg 

ccgaacttta 

agaagccacc 

cccctggcaa 

tttagtaccc 

ttgtgcagag 

cacacggttt 

atccttttcc 

acttgtaaaa 

gtattcgqtt 



cgtacaacac 
ctgtaatrtcc 
ctacagtgaa 
catgaagttc 
gtggcaggga 
ttgagggagg 
taatatccat 
tcacataatc 
caattcggtt 
cattctcctt 
tccctacaaa 
aaatctagct 
ttccttcccg 
cccttcttat 
ttgccccagg 
gtgctgtgat 
cctcttgggc 
tcctttacac 
ttcccacact 
cagttaacac 
atttaatgtg 
ggccaaggtc 
ctaaaatccc 
tctgcgtgaa 
acaatcggga 
ctgttaaatt 
acgcctttat 
tggcagatga 
ccggacctct 
accccggggg 
ccccgccagg 
agaggccccg 
ctggctccaa 
gctggccagg 
acttctgcgt 
ccgctgccaa 
tcagcgcaag 
gcacacttaa 
accagctcta 
cctctttccc 
ccacggacta 
aggaggggcg 
ctgggtaggg 
tgctgcggag 
actgatcgta 
acctgctgtt: 
cctgggagtt 
agggctgttg 
gaaatacaca 
ctcagaactg 
tggcctctgc 
tctggacctt 
cagttcaatt 



aaaaaataaa 
aatgaaatcc 
ggactaacct 
aacattttca 
ctgtgactgt 
aaaaaaaggc 
ttgacagagg 
tgtggttata 
acactggaga 
tgtaaaagaa 
tcagtgagac 
ctccaccctt 
ttaccaccta 
gctaccaagc 
ttcccaccgc 
ctgagattag 
ctgatctcaa 
ttacaggcgt 
aagctgcagt 
ctcatctggg 
txtctagagg 
ttatccagaa 
tactaacgcc 
acacaattaa 
tcatgtttac 
atggaatact 
aagtgtaacc 
tcccaaccca 



agccccgcgc 
caaacgacaa 
gactcgggtc 
tcaccttgac 
tgagcaggta 
ggaggtgcaa 
cgccgcgact 
cgccactcgt 
gaagccacta 
cgcgaccgcc 
gacccggtgc 
gccgcattgg 
agagagcacg 
ccattcagat 
agcccggcgg 
ggctccaggc 
gcggggctac 
ccaccgtctg 
gtctcattta 
gctcagtctg 
gaagcaaata 
ggagatttgc 
tgcagagtga 
gtttttattc 
tctttgcgat 
gagcctgaaa 
taaatatgtc 



taaaagaaca tctactacca 
cacagaagaa taagaggata 
caaaaatatt ttttgagcag 
gaggcaaaag caggacgatt 
gccattgcac tccagcctgg 
aaaactatct atttatgatg 
gtacaaggga acttcctggg 
cagtiacatgg gtttgtccaa 
atacatttca tagaagcaga 
gtgcaacact gctgactatt 
cacactggtt tcccttctca 
ccctcatgtt tctatactct 
tgggctgaag acacccaagt 
atgacatatc acaaactcaa 
aactccccca ctaacccccg 
aatcatctgt ttttgttttt 
actcctgggc tcaagcgacc 
gagccacggt accagacctg 
gtcccacaga aaaaggaaat 
cccctcacta ctgcagtcta 
attaggaggg aatatactat 
ccaaccacct gcccaaactg 
gccctggatg aaactgagct 
agcggaaaac gccatgattt 
ccccaacaca tgctggagag 
tttcagacaa cccaaacaac 
tcgctgcatg caacttccct 
gatcatagcc ccccaacccc 
tctctaatct cgagctcaag 
ggaggccgaa agccagatgt 
acactgggga cccccacaaa 
ccctrgcagtg agccagacca 
cacaaaaaga agcctcagtg 
ggagaacaga gacgaactcg 
gcctgtcgcc gaggctaggg 
acctgcgggc tataggcctc 
aacttgtagt acatgacgcc 
ccagagaagg accccgcctc 
tgggactagg gctgcccgaa 
ggaaccaaaa gccagggttc 
caaagtcgct gtggcggagt 
t:tgggcagtg ccacgacaac 
actaagcccg gcagtgtcgg 
ggggcctggg ggggagaggg 
atcgaagccg gttgggaatt 
gtcgtcgggt tgaaggcgtc 
tttccttccg acctagagca 
cccaggagta tagacttgtc 
agaaaatttg gaagcacctg 
cccaataata taaaagtagc 
tctaaagctc aggtaccagg 
tcacctgagt tttaaggtca 
aaatgaggta gggggagggt 
gatctggaac taccactttc 
tgtt^gcrct gtgctgtgtg 
Seite 37 



ccaacagaat 
aaaggaatta 
ccagacacgg 
actrgagctc 
gtcagagctg 
ataaaagtca 
atgatgtaac 
actzcatcaaa 
tgatagaaat 
ccttccttac 
tccggactat 
aactataatc 
ttattaaact 
catttataaa 
aaaaaaaaaa 
agagacaggg 
ctcggcctcg 
aaatctttct 
gaagcaacag 
actgcgtatc 
ctcagacagc 
ctagtcactt 
cacaataaat 
gaaaaggcac 
tcaaactcca 
gagcatctgt 
ccctggacgt 
ctgcaccatg 
tgaggctgcc 
cttcgaattg 
ccatcccttt 
aacttccacg 
accaccgtac 
accgcgggca 
ccgccacttc 
cgaagcccat 
cagagtccgg 
cccggctgtg 
gatcgcctaa 
tcaggaccac 
ctgtaggaaa 
tcaagaaatg 
agtcaggtgg 
cggtgattgg 
ccaccatccg 
tcttgcttta 
cctcttatcc 
ggccaagctt 
ctgaaggctg 
ctcctgggct 
atgtzaaatgr 
ttgattattc 
gatcagaaac 
agttctggta 
caaggtactg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



seq_prot.txt 

tgtaaaagga gtatactagt ttcttactct ccagggatgg actcaggctt ttcttagctt 3360 

catcatctaa ccataggttt acacaagcat attctagatg cacttatgaa atcatgttgc 3420 

aaattgatgg atgatactaa cctatatata gcattgccct aaactcattt aatcttcaca 3480 

acaatcctat aaggtaaatg ctattatcat ctccatttta gaatacagga aaatgacgcc 3540 

tggagaagct aagtaacttg ctcaggtcag ggataagtgg cagagctagg catttggctc 3600 

aagacttcct actgttaagc actatgttac aaataaatag atatgaagcc attctgaaaa 3660 

gaggaggatc aaggctgagc attccagggt gagggtgtac accagcatcc tggttgcagg 3720 

crtacgttca agagccccaa gtattcagca ccgccaatat gttgtaaagg tttcagcgaa 3780 

ttcatgtaag ccagcccaaa gttgccactt ttacaaagag acactgtgat ataatggaaa 3840 

gaacatgcaa tttgggttta tgtccatatt tctccaagga taagtggttt gggactagtg 3900 

atttcctttt ttt-ttttttt ttttttaaat tctttttgaa acagggtctc actctgtccc 3960 

aggctggagt acagtggtgt gatctcggct cattgcagcc ttgacttccc aggctgaggt 4020 

gartctccta cctcagtctc ccgggtagct ggaaccacag atgcgcgcca ccacgcctgg 4080 

ctaatttttt gtatttttag tagagatggg gtttcaccat gtttcccagg ctagtctcaa 4140 

actcctgggc tcaagtgatc tgccaaactc aagactccca aaatgctggg attacaggca 4200 

tgagccaccg tgccaggccc accccatgtt ttgatactga atagattcag gaagattttt 4260 

ctacgacaaa aagttcgact acctcaaaag aaaatgatcc attgacatta gccttcttga 4320 

gtcccaggcg ctagcaaata ttgagtgaaa gaaaaagcct ttcatagagt ttagattgaa 4380 

aagagaaatt aaatgatt 4398 

<210> 29 

<211> 4471 

<212> DNA 

<213> Homo sapiens 

<400> 29 

aattcataga ggtacattta cagaggctaa attttactgt atgtaaatra tactgctgta 60 

gccrtaatttt tttaaagtgc tattgcatat aaaacaatgt gctagacact gaattatgaa 120 

atctaacgtt caagcataca attctcaagg tttctagatg tattatcaaa aacttatcat 180 

taaacacttt gagttcagcc ttcagtattt tatattrtat ccatatttat attctataaa 240 

acgactcaaa gcatcaccta aaattatcac atgaattttg aattattatg tgttcaacat 300 

acacttatag aaaatttcct tcttattctt aaaagtgaac atatcaatat ctattaaatg 360 

ttaattatat ggttatgagt tgaatttact actaaattct accttttctg aattataaat 420 

acattaagaa atgtaaactg tgattagatt tattcacatc cctcagaaca ctacgaaatg 480 

tgtcctaaac atgttaattg atttcagaaa cttctatgtc ttgagcaaga cttaagtgtt 540 

aacaagcatc tgatggcttt taaaataaga atgtttcaga aattcatcta aaaaaatgct 600 

tgaacgtaaa atgcaatcct acggtgtata ctctaatgcc tgttgatgaa cagatcctta 660 

tagtttcctt cttctagaac tcatatcaaa atacattttc ttcagaaaaa ataggagtca 720 

aagatgttaa tcatgttctt ttcatagtca cgttacatrt gttttacaac actctcagaa 780 

aagcctttcc tttcacatgt tcacttttta aattacaaac atgtttcccc tcactagaca 840 

ccgagttacg aaacttagag caaaagtaaa actttctaat ggaatgccca tgtaaatact 900 

agaaatataa caaaataagg caaagtctgt cctccaaaat gttgcctacc acaccaaaac 960 

acaatcctgc tttttaaaca gcgagtaaaa ttaaatcaac cttatcttcc taaatagaag 1020 

tccactcaaa gttttttcac aaaaatgaca ttcacttgac ttaatattgt ccttccaata 1080 

agaataacac gcagggtgtt agagggcagt tgrtggactc ctcttcagca aataaaggag 1140 

accagttaga gcttagcttg gtctgcccag gaaaggagga acgcagcttg ctgacatggc 1200 

taaggaatgc caattaatac attttaaaac ccctatcatc ttgcctgaga cgtgtacatc 1260 

rtcccaggcc ctcaaagttc aatcagaaag tgtattcaac tccattgtca tttcacccat 1320 

agtggggaag tccatcgagg aaaaccatag ggaaataaaa tgtttttcag ttattcatat 1380 

tagcacaatc aacccagagc tcacggcaca aactaaataa gttaacttgc cagtgtcttg 1440 

aatgaagaca rttaatcaaa gcatgcartt tccaactttc cagtagcttt caggacagtt 1500 

ggttttggtg atcgtttttt ggtcttttat tgttctcaac acgactggrt tctcaccgca 1560 

ggatttcaaa caaaatgaga caattaaacc acacctcgag cgaaggggcg cttaccttgc 1620 

agatiaaacac accggccttg ccttctctaa aatgcggaca cgtgcttttc ccgcattagg 1680 

gggggtctcc cggcgcgcgc cccgccgcca cctgttgagg aaagcgagcg cacctcctgc 1740 

agctcaggct ccgggcgcca gccctgcccc gcagccccag agcccgtcgc agctcgggtg 1800 

gtccctcccc ggcccagcgc tcgccgcctg ctcttcgccc tgcaagtttc aagaggcagt 1860 

tanttctcgc agcctccgcg cttgcaactg cctcctggcg ggggagtggg tgtccaaaaa 1920 

gccagcagct ggagaaactg aaaagatcac aagcgactta acgataagcc ccttcttcct 1980 

ttraaagacc gagaggaggg tagaggggag tagtgcctga gcccacgtga ccgagccagg 2040 

gagcccgacg gtctcaggaa cgcccgacgc cgcgcgtgac ctctaagtgg gagcaccctc 2100 

gaaccgactc ctggtccacc cacaaggata gtggcgcaca gatggcgctc cccgcagccc 2160 

cagtctcaga tttaagaggt ctggagtagg gcctgagaat atgcatttcc aaccaggtcc 2220 

tgggggatgc cgacactgat acagccagtc tggggaccac acttcgagga tcacgtcctc 2280 

agcccctgac tcacataagt gtcattcaga acagatgtct gattctaagg agccagtggt 2340 

gaaaccagag aggctttggg tttgtcaaat ccccagcagc aaacgtaacc tcgggccctg 2400 

gagtggcaaa gccgtggact agaggtggag ggagtgggtt ctcagctctc aagaggtcac 2460 

Sei te -38 - 



tgggaaggcc 
ggcaccgggg 
acccctcttc 
ttaatggggt 
cagtcttaac 
gacccacagg 
ccatctactc 
agaggaagac 
ggtgtttagc 
aggtaatgaa 
tgtgaaaagt 
tttctctcag 
gggaagagtg 
gtctcctgtc 
cttatggaga 
gctcacacct 
gtttgagacc 
gctggacgtg 
cgcttgaaca 
tgggcgacag 
tggtggtata 
caaccataag 
attaacatxa 
ccagctcagt 
ctgtcagcta 
gacagctacc 
cgcaagccta 
ggaaggttaa 
tccaggagag 
aaaatggctt 
gccaagcttc 
aacagcatgt 
gaaacctcac 
gactcaccct 

<210> 30 
<211> 4479 
<212> DNA 
<213> Homo 

<400> 30 

cggtggctca 
caggagatcg 
aaattagccg 
gagaatggcg 
ccagcctggg 
gcgtggtggc 
gaacccagaa 
ttcagagagt 
aggtgaagag 
taacggtctc 
catgacagcc 
tcatgacggt 
ggctgcagca 
tccaaacaaa 
agcatcgctg 
tccagaagac 
agcaggcagc 
ctaccttcct 
ggtccagatt 
gctagtggat 
ttcaaaatga 
tcacagaatc 
gtatacacaa 
gtctgttttt 
aacaaggctc 
tctttttgga 



tttcgcgtta 
agacctgtgc 
tcgggcgaag 
gttcccctca 
tggctttaag 
ccattgtgag 
aggtggcata 
gagggggggc 
caaaatagag 
gacaggctzcc 
ctccactgcc 
gttgaaaggc 
tgtcgctggc 
ctggggtctc 
aaatgttaaa 
ataatctcag 
agcctgacca 
gtggcacatg 
tgggaggcgg 
agcgagactc 
tacttgttaa 
ccaacaggca 
tacctgacgc 
gagatattga 
cctgaagcca 
gcttattgga 
taaggcaaat 
acaacctccc 
aggccaagga 
gactctzctxjc 
tgctctgtgt 
nctcttttca 
tgcaggaaat 
gtccatcaag 



gatcaaagat 
ctccggccac 
gcccctcgcg 
cccaccccag 
gcagctgttg 
aaccccccag 
agggacatgg 
atttggaggg 
agacgcaagt 
aaagatggag 
ccaccctaag 
agagtctccc 
aaataactgc 
tcaaagtacc 
tgatggttta 
cactttggga 
acatggtgaa 
ccagtaatcc 
aggttgcagt 
tgtctcaaaa 
caagatatct 
gaaaaagata 
tgtgctaagc 
ctccactttt 
cctagtgaga 
cattitiatigcc 
tctattaccc 
tatrggtcaca 
cccatttacc 
tctatgcaga 
tcactgtgat 
gagcagctca 
aatiaagcaga 
aaagacaatg 



seq_prot . 
cccaggtact 
ggcgcccttc 
ccccctggtg 
caggagcccc 
ctgatgaaaa 
tgtcacatct 
tttttgaggg 
aaagttttac 
gtgctctggg 
aagcatctgc 
tccaattttc 
tgtccttgag 
agcagacaac 
cagtccccat 
aaaaaatcag 
ggccgaggca 
acctcgtctc 
cagctactca 
gagccaagat 
aaaaaaaaaa 
ctggaaatgg 
ggtgagggtg 
acctgcacag 
ccagtgagga 
aaggaaccca 
ccgctttctc 
ccattttata 
tgacctatca 
tccataccat 
acactgcaga 
gcttccttca 
aacacccaca 
aaatgaatgg 

g 



txt 
tcctcgaact 
tccgctggag 
gcagccccag 
agggtgcgag 

tgaaaaggaa 
gtgcctaatt 
titgtgatcca 
agtaccagcc 
ttgataaaga 
ttcatcagct 
attaaagctg 
gcagagaagt 
atgccaatct 
atgctcttcc 
cttcatggcc 
ggtggatcac 
tactaaaaat 
ggaggctgag 
cgcgccactg 
aaaaaaaaat 
aaaccgtcat 
aagaaaatgc 
catctcattt 
aattgaggct 
gtaacaacag 
tcaataattc 
cttgagaaaa 
gtggctaaga 
gcctccaatc 
caaacttctg 
gaaggcattc 
gaagctgttt 
gagtgagtaa 



cacttgaagt 
gcacctgccc 
ttcccccagc 
actaggacca 
agtagcatgt 
aatcatcaaa 
gatctctaat 
tcagctgttt 
tgaggtccac 
aaacaatagc 
agagtcctgg 
tagtttggac 
cagcctccct 
tccttgccac 
aggcatggtg 
gaggtcagga 
acaaaaattg 
gcaggagaat 
cattccagcc 
cagcttcatt 
gaaggaattt 
atctatcaat 
aatcctcaca 
cacagagctt 
tggcttcata 
ttattttatia 
tgaatgttca 
aggacttgaa 
aatcagaatg 
ccacctgtgt 
tacacacagg 
attatacaag 
gtttccttct 



Sapiens 



cgcctgtaat 
agaccatcct 
ggtgtggtgg 
tgaacccggg 
tgacagaccg 
tcatgcctgt 
gcccagcaaa 
gctaaagatt 
gtggcacggg 
caaacctctt 
tccagccaag 
ccctgggaat 
tctctgttta 
actgaccaca 
actcagtctg 
agtgggtcac 
tggcaccctg 
ggcactctcc 
tgcaaggcag 
tgctgggact 
ccccaagccc 
cctgctatgt 
atgtctgctt 
ctggaggaca 
tatgatgaag 
atgggctata 



cccagcactt 
ggctaacacg 
caggcgcctg 
aggcggagct 
agactccgtc 
aatcccagca 
accttgtctt 
agcgcatgga 
ggaggagggg 
tgtccaggag 
taatictggag 
ttcctgataa 
tgttctggtc 
gtcttctaga 
tcccctggaa 
ctgcctccca 
agccctcggc 
ctgccttcct 
gaaaatgctg 
cagcctcctc 
tggttcctga 
tcctccaagt 
atcctcaggc 
attttatttg 
taaaacacta 
cagagatgtg 



tgggaggccg 
gtgaaacccc 
tagtcccagc 
tgcagtgagc 
tcaaaaaaaa 
ctttggaagg 
taaaaaaaaa 
taaggaagtt 
gcggaagggg 
gaaatgaagt 
tcatgagagc 
ctaacctggg 
acgtttattt 
ggaagtagca 
ggctggcagc 
aaagctgaaa 
tggggcagag 
tctgtcactc 
caggcctagg 
ottcccacta 
cactagggga 
gtgcccagag 
aggaagggtg 
ataaacaatt: 
aat.acacatg 
ctttttaaaa 

seite 39 



aggtgggcgg 
atctctacta 
tagtcggtag 
cgagattgtg 
aaaaaaaaaa 
atgaggtggg 
aaagaactgt 
ctgtgaagag 
agaaagggtg 
catxtgctct 
tgctagggga 
agtttcgggg 
acaattaatg 
aggttggctc 
tcagcaagca 
ggctaacttg 
caaaggagcc 
tcaggtggac 
ctgggaaagg 
agagagcgag 
aagagatggg 
atgcgtgtgt 
gatgcagtca 
gtttctatct 
cattaaaaaa 
tgttaagagt 



atcacaaggt 
aaaatacaaa 
gctgaggcag 
ccactgcact 
atatggctgg 
aggacccctt 
gcacaaagat 
ttgaagtgct 
tcacgcttca 
cagcaatcag 
gcaacatgaa 
taagtcctca 
ggttctcaaa 
tgaagcctat 
cagaagtctc 
tactttcccc 
ttcctccttc 
ccagacccaa 
gcccaaagcc 
tcctactggg 
ggtgacagaa 
gtgtgtgtgt 
tttacacatg 
gaatagaata 
tigGataatta 
gtaaaaggac 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4471 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



seq_prot.txt 

aaacagtgaa aaataaatct tcctcttatt ttgtcctcca gtctcccaat tcctctactc 1620 

agaggtgaga acagaacttc cacaccctcc agaacctcca cagttagaac tgtctacatg 1680 

tttccattgt ctttactttt attcttgcct gcacaaataa atgaattgct cattatggaa 1740 

acttcccaaa agacccgtta acacttcaat aggaagcacc aacagtttat gccctaggac 1800 

tttgttccca caatcctgta acatcatatc acgacaccta acccaatcct tatcaagccc 1860 

tgtcaaaaac ggactttaaa ccaagctgca aattttcagt aatctggcct tgcctttccc 1920 

cctctgatag caccatcaaa caaaccccct tactgccgaa agcaataagc ccggctttgt 1980 

tccatccact ggttgtgttg gtgatatctg gggactgcca ctgaacagac gcacagaggg 2040 

agcccctaca ggcaggggtt trtctgtctg tgcrtctggg agagtatgtc tcgtacattt 2100 

gtcgcgttga tgaagacttc acagctccat cagctgcggg caagggggtc tgaggcagtc 2160 

ttaggcaagt tggggcccag cggggagaag ttgcagaaga actgattaga ggaccccagg 2220 

aggcttcaga gctgggcgag gtagagagtc tcctgtgcgc cttctctcct ctctgcaatt 2280 

cggggactcc ttgcactggg gcaggccccc ggccaggtgc atgggaggaa gcacggagaa 2340 

tttacaagcc tctcgattcc tcagtccaga cgctgttggg tcccctccgc tggagatcgc 2400 

gcttccccca aatctttgtg agcgttgcgg aagcacgcgg ggtccgggtc gctgagcgct 2460 

gcaagacagg ggagggagcc gggcgggaga gggaggggcg gcgccggggc gggccctgat 2520 

atagagcagg cgccgcgggt cgcagcacag tgcggagacc gcagccccgg agcccgggcc 2 580 

agggtccacc tgtccccgca gcgccggctc gcgccctcct gccgcagcca ccggtgagtg 2640 

ccgcggtcct gagatccccg ggccggatgc gcggcggccc cagctcccga gcgtctgcct 2700 

cccccgccct gggctgcccg ggctccctgg gctccccggc ggctgcacgg agtcaaggcg 2760 

ccccgtcccg ggcgtccccc gcgggtgccg atccaggctg cccggagtcc ggagcccaga 2820 

gaggagagag acagctgggg agcctggtca ccgcgggcat ctcccctgcg ctgcagtcgc 2880 

ccgcctggcc tgccttcccg ttcctccgcc tcttgccctg acttctcctt cctttgcaga 2940 

gccgccgtct agcgccccga cctcgccacc atgagagccc tgctggcgcg cctgcttctc 3000 

tgcgtcctgg tcgtgagcga ctccaaagtg agtgcgctct tgctttgact gatgctgccc 3060 

aaggacctct gatcagcacc aggggagagg aggggctgct cagggagctg gggtcctccg 3120 

gattccatcc acagcagggc cagactcrcc ccaggaaatg ggacagggtg gcagcggagg 3180 

cttgagaacc acgggggttg gcactggctg gcaagggagg aagaggccgc cgggactgcc 3240 

ccagcctgcg ggcatctggt agatgaagct tgcttgggtc aatccatttc tcctggctgg 3300 

aaacccatgg tcttccattt gagaactaga tacgaacagg gtgaggcgag agggagaggg 3360 

aagagtgggt tttgggattg gggccagttt accctcaccc tggagtccct ggagcatggg 3420 

acctttgatg aagcctcctc ccgaatctct tccagggcag caatgaactt catcaagttc 3480 

catgtgagta tccaccccta caacagttgg ctgcacagac aagttgggaa ggcttcaggg 3540 

gacatcccct ccctgccctc tgctgcaggg ctgcgccacc ccttaccact tccactcccc 3600 

ctcgcttacc ccacctttgt tctctccagc gaactgtgac tgtctaaatg gaggaacatg 3660 

tgtgtccaac aagtacttct ccaacattca ctggtgcaac tgcccaaaga aattcggagg 3720 

gcagcactgt gaaataggta tggggatctc caotgcaact gggagagaaa tttggggaca 3780 

gggagggatg ggtgggaggc aagagcaggc aggagttagg agctggaggt agggtgggtg 3840 

acatcttcat ccctatgtga caagcataaa cacacacaca cgctcacgaa acagtggcca 3900 

cacaaatgtg aggtggggtt ggaaggagac cctgtccagt cttctggcag gtctgaaacg 3960 

acatctttaa aatgtccgtt ggcagccggg catggtggct cacgcttgta atcccagcat 4020 

trtgagaggt caaggtgagt ggatcatttg aggtcaggag ttcaagacca gcctggacaa 4080 

catggtgtaa ccctgcctct actaaaaatg caaaaatcag cctggcatgg tagtggatgc 4140 

ctgtagtccc agctacttgg gaggctgagg caggagaatt gcttgaacct gggaggcaga 4200 

gatctcagtg agctgagatc acaccactgc actccaactg ggcgacagag caagactcca 4260 

nctcaaaaaa aaaaaataaa agttagttgg aatgttcttc tctttctcat attctctcat 4320 

cctcctgtcc ccttgtagat aagtcaaaaa cctgctatga ggggaatggt cacttttacc 4380 

gaggaaaggc cagcactgac accatgggcc ggccctgcct gccctggaac tctgccactg 4440 

tccttcagca aacgtaccat gcccacagat ctgatgctc ~ " 4479 

<210> 31 

<211> 4492 

<212> DNA 

<213> Homo sapiens 

<400> 31 

ttgcaggctg cgttcagccc ttgtgcacca gggacagcaa ggaaaaccca agctagacca 60 

gcttcagggg tggcagcggc tcctacctcc agagaagaag aagacactct ggatgggttc 120 

acaggtggca ggcacaagcc agtccatcct gtagtcatca tagttgttgg ctxccaagtt 180 

gctctcctca ctggagaaca aggacagcca cgtggcgcgg gatggccggc gggagttctg 240 

gttgcggcca cggctgtggc ctcgttgtga acggtagcct ttgcggttgc gatgcctaaa 300 

cctttgtttc rtggccaagg aggggcgggg tgccatgcct gagatgtaga tgcggccagc 360 

catggctgcg tccacttgcc ctggcacacc gtgccagtcc cggctaatga actggggctg 420 

tctggtacca gctgtggcag ggaaggggtg aatgagaggt cttgggggtc cgaatcttgc 480 

cccttccagc ggggccatta gagttcatct gctgcacctt gcccaaagac acacagcagc -540 

gaatggcaga gccaggcctt tgactctcag tccaattccc ttccccctgt gctttccctc 600 

caccatatca ctggtgtctc gaccccaccc ccccaggccc cctaaactcc tcagccacag 660 
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cccctccctc tgctggctgt gctctcagct gtgctaggca aagtccaagg tgtggggtcc 720 

aggtagagcn gagctttccc caaaaggtca acagaagcaa agtctggcct gagcaaacag 780 

actttgatcg atagctccac aaccacagcc ccctccagcc ggcetggctc taccaatacc 840 

aaggggtttc agcccttttg agggagaagc aagacttgcc ctctctccat accagaggtt 900 

ctgccccaga agagaagctc gaagatgtcc tcccagctgt cccgctgcat catggcaaag 960 

tgttcaaaca cagccgacag ggagctgcct tcacactcct cctgactggg ctggtgctgg 1020 

aactggtact cccagtactg tttccctgag gagcagggtg gtgggcatta ggagggctgg 1080 

gccagggcaa gactggagat cccagaggct gttgaagtta ggatctccca gcatgaggtg 1140 

ggggtcaggg gtgggcacat gctgaggcct gtccccagtg aaatgtgaaa ggaatccaag 1200 

actgctccac tgcctgctca gtctcccacc accccctgag tacccttgaa gaagtagacc 1260 

cgctcccggc cactgtagct atgggcaggg agggccaagg ctgcatccac gttgtccggg 1320 

atgccatcga agccgtcaga gatatttcgg gggtaatcag ggtccaggac accatcctca 1380 

aagcgccagt actgactacc ctaagaggtg gggaaagtga aaaggggtat ggaggccgct 1440 

ggctgggcac agaggcagag tcccttgccc taaggcagcc gttcccaggt ccaggttcac 1500 

tgcccaggac ctggagtctt ggggctgccc tgtgctcaca gagctcccac caggtcctgc 1560 

agggccctgg gtccagcttc cctgtccacc ctgtccctgg gagcaatagc tctcaaaccc 1620 

tccctagatg ctttctaccc tggcccacag cccctggcac cttgaagagg taggtcttcc 1680 

cctgacagtt gatgcgggtg aaggcggcat cgatggggcc ctcgatgccc cagacatctc 1740 

ggatgagctt ggggtaccca ggcctcactg ccttttcgtc cagttcanag cagtactgcc 1800 

ctagagtgga ggagatggtg tgagagcagg gacgctcctg gggcagaccc gcatccccag 1860 

tacctgccct ggattcacct cggaaggcaa agagggaacc gttcttgagg tcggtgaagg 1920 

cgtcgaaggg cttcccactg cacagctcct cctctgctgg gggctgaggt ctccctggat 1980 

gaagggtctc aggccttgag tctatcccct caggcttaga ggcgcccacc tcaggcgcag 2040 

gggcctcttc ctcaggtttc agaacaggtg tctgctcagg attccctttg gactgggcct 2100 

ggaggtcaga ggtcagggag gggcccccca cctgttcatg gacagtggca ttgtttttct 2160 

cctcgccatc gtcatagacc gtgtactcat cctccggcat agtgaacaca tccccgcgag 2220 

tcactgcaga gagtggatgg tagtgagtct ccagcagcag ggggcacccc agcccaccca 2280 

cctgggctct gaacacacct tggggcttgc actcagccgt atagtctgtg cagcagctct 2340 

ggtagtaaga gcagagctcg tcacactggc acttcttgtc cacgttgaag ccctcagtgc 2400 

agcggccctt gcatgactct atgaggaagg agtgtcagtc ggtgccacca agcccagacc 2460 

accctcgccc tccctacatt gacccagatg gccaccaaca ctcccctgta ccttggtcag 2520 

ccagagcaac ccatgccagc agggccagta tgagaagggg tctcaggggt gccatggcag 2580 

ggcttctagc tcagtgcctg gcaagctggg ctctggtctc cctgaagtct ccgctctgat 2640 

gcctgaggaa gggagggaga ggcagagaca gggaaggagg gcactggaga agaggaactg 2700 

ccttttctgc tgctctgttt gctcaacctc cagcccatxc cctxtgcccc ctccagcggc 2760 

tgctgcagca aaggtcacat tcctggaaca ctgggcctgg gcgagctggg agataagacc 2820 

tttgccaagc tcagaatcat taggtcatcg gaaggggaat tagcaccgtg gatctggagg 2880 

gcagaaagaa ggctcattgg gcaacagctt tatctctctg agtcttagtt cattttcagt 2940 

aaaataggat taacaaagcc ttgcttcatg gggtgttggg agattacatg aactggacca 3000 

gacaaaatgc ccagtagctg gagcagccac aaattccttc tccaaacata gctattgatc 3060 

catgattgtt tgatcagaca catccctggc tgggtgctca ttcgccatta tttatttagg 3120 

gcaggaaaaa gggagtggga ggagagattg taagcactct ggggaatttt attttttagc 3180 

ataaaagaac aaagtttcat ttctgggttc ttctcttggg tgcatcaatc tccccaggct 3240 

ggaaatttgg gtaaatatca acccactgga ctggatttaa tttctgagtc ttttctgagc 3300 

aggccactxc ctgtccccag gctcagtctc cccatctgta aagagtgggc ttgttcaaca 3360 

cacttttaga ctctcgagag aaatatgatc cctctacctg gaaaatgccc actggcatga 3420 

aacacctgat tttgtgccca atttcaggtg tccacagaac ttcrgaaatc ctcccatcac 3480 

ctgaaaaaag tgatcctgag attttagctg actttttctg ttctgtaatc agaggggact 3540 

gtgatttggg ccaatttctt cctcatgggc ttcagtggag tcagctrgtgg aatggaacat 3600 

caattcccac cccataggca tgttgtgagg tttcgctaag aaacagccat aaaagggcct 3660 

taagagttat: gatgattgtt ttaaccattt agtagggagg gggcatxcag cggtgttgct 3720 

cccacgtacc gatgtcacgr ggggcgggga ggcggggcgg agaacggaga gcgtcctctc 3780 

attctccacc cccttcctcc aagtccagcg cagggggagg tggctgcgac tgactgaggg 3840 

gctagggaag gctggccagg ggcgtgggcg tggctgggga cggcttgggg gtggggctgc 3900 

gcaggaggct ggaggagccc cgcgggaccc agaggcgggg cgtcggcccg gggaccactg 3960 

ctcctccggg gcgtggctgc aggaggctgg aggagccccg cgggacccag aggcggggcg 4020 

tcggccgggg accgctgctc ctccggggcg tggctgctgc cgagcatctc ccagctcagc 4080 

cgagcccgng cccaggccac gctttgttcc agccgccgcc tcctctaccc tacggcgtcc 4140 

ggagccatcc ctcgcctgct cgctctctcc tttcgcccac tccetgcatc tgggcctgca, 4200 

tcacctttgc caaccgctcc cccgatcctg ccgacactcc tcccccaaac ttctgaccgg 4260 

cacccttgcc tggtaccctt ctctccattc ctccccctcc atcttctttc cccgacccct 4320 

Qtcgggtccc tcttttccca aaacccgggt ctctccgcgt ggccccgcct ccaggccggg 4380 

gatgtccccc gcggccccgc gcccatggtc ctgacgctgc ttctctccgc ctacaagctg 4440 

tfgtcgcttct tcgccatgtc gggcccacgg cggggcgccg agcggctggc gg , 4492 



<210> 32 ? ' , 

<211> 4448 

l <;212> DNA 1 ■ V ■ V, ^ f . 1 
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<213> Homo Sapiens 
<400> 32 

tcagcattcc aggagttgat gagcccctac ctgggcacca ctccctcttg ggcctgagaa 60 
ggccgggggg aacctgcaag gaatatgtgg aattctatat ggtgcctctg atactgcggt 120 
aatgtcaggg agaggtttgt tcaactggtc agggccctga aaccccgatc actttctcat 180 
ctctgcatct tgaaggctag cagagaggag gaaccctgag ctctctagga ggtgggtagg 240 
tggtggcagt acagactcct agatgtctaa acggagttgc agagagaagg cttrtctcggt 300 
agaccctgga agggactttt acggtagaac tttcatttta caagcaggga tagaggccta 360 
gatggggaca gagacgtgtc caaggtaaaa caatggacgg aggcagggca gggagaggcc 420 
aggtcctaga ctcggcctca gcacccctcc cacacacaca tcgacacaga agctggtcca 480 
gatttataat ttaatggctg tgcagatccc agtccctcat ttctgtcgct cacgtgccca 540 
ctggtctggg gtcagggttt tctgttcaaa ggcatggatg tgcgggagct cttctgctag 600 
gcacgcgttc accagcctgg ggcgagagat ggggttagga aaaggccagc ggtgatcgca 660 
caccccgagg ctagggtgac ccaagagact ggtgtcccca aggttgtagt ccccaggccg 720 
tgggaacacg tacttggaga gagggtctac atcgaaatac tcgaagcagg ggtcacgaag 780 
gcgggatcct caaggtccct gagccacgcc agggggagtg cgggagtcag gttgcagggg 840 
tagggggagt ggctggggtt gtggtacagg agctaggcaa ggtgctccca gggccttacc 900 
tgtgtctctg aagcagcggt ttcccctcga acttggccga caccaccagg actcggaagc 960 
tacaggagca acggttgagg gtcgtgtcct ccacctccta ccgagcggaa gaatatgaat 1020 
ggtgccgaac ccgccccccg agctcttttc cccttgtccg gcggctcaac tcaccacatg 1080 
ctccgcctcc aggtcccgct gcagcttctc gcggaggtat tcggcgctga gttccatggc 1140 
ggcagtccag ctggaacggc agcccagcag ggacacaacc ccagctcggg cgccggccac 1200 
gctaccttgc tgccttacag gagccacttc cgctggaaaa ctcacttccg cccttactaa 1260 
ggcgtacgtc aacgcagtac ttccgccctc aagcagccgg gctttccagc ggtccaggct 1320 
tttcgcgctt gccatgtgtc agccaatcag agcctgagga aggtgggact cgggcggagc 1380 
cgatgctgaa tggtacgcgc tcgccgactg gacagcagtc tggcttccgc ggtcggactt 1440 
ctacacccgc ctccagacag gagaggggca cgtaccggcg ctacggcttc ctgcaggctg 1500 
cctccggata gtccccgaga gcttgttccg aagcaagcac cctgcagccc tagcgatcca 1560 
gccctcccct ggaccctagg tcacggcaat caaccccctg ctgtggttcc cgaaccccaa 1620 
ggcccgatgg gtcccgcggg ggtcgcggcg aggccagggc gctttttcgg cgtctacctg 1680 
ctctactgcc tgaacccccg gtaccggggc cgcgtctacg tggggttcac tgtcaacact 1740 
gctcgtxggg tccagcagca caatgggggc cgcaaaaaag gcggggcctg gcggaccagc 1800 
gggcgagggc cctggtgaga gggggaggcc ttctgtgccg ggaggaaggc gtcccagagg 1860 
aggcggaccc cgcggggcac aggcctgttg agaaggaccg gccaggactg tgacagaggc 1920 
ggggcgtctg tggtggggac ggggcctgtc gcaggggagg agcgtgacgg ggaggcggtg 1980 
cccggggcat ctccgcggcg gaactcaggg aaggagctag ctggggccgg ggtgatgatc 2040 
caggctgggt tccagcatag ggctcttggt gggcacgctg gggtcgggtg gaatgcagga 2100 
gagagaggag gtgggacagg tgggtacctg ggctggaggc agggcctgag gtgggtaggt 2160 
gcagagggct gcacttctcg gctgaagccg ggaatgagga ccccgctctc gggtgggatt 2220 
ggaggggacc cgcggctgag gcgctgggct gcgacaggga catcaccgtt ctcctcctca 2280 
gggagatggt gctcgtcgtg cacggcttcc cgtcctccgt ggccgccctt cgggtaagga 2340 
aggagaccgg gcagcggcgg ccgggtgagg gcttgggttc cgcccctcgc tccgagccgc 2400 
cctgatgccc ctgtacgccg cccgcagttt gagtgggctt ggcagcaccc gcacgcctcg 2460 
cgccgcctgg cgcacgtggg gcctcgcctg cgaggagaga cagccttcgc tttccacctg 2520 
cgcgtgctgg cgcacatgct gcgcgcaccg ccctgggctc gcctcccgct cacgctgcgc 2580 
tgggtgcgcc cagacctccg ccaggacctc tgccucccgc cgccgccgca cgtgcctctg 2640 
gccttcgggc ctccaccgcc ccaggccccg gccccaaggc gccgcgcagg tccctttgat 2700 
gacgcggagc ctgagccaga ccagggggat ccaggggcct gctgctccct gtgcgcccag 2760 
accatccagg tgaggtcccc ccgaggaatg gatggctcta gagtccagac gacttcggat 2820 
ccagctcttt cttgagggaa acccactgac acgctttggc caccctatcc ccatctctaa 2880 
gatgctgatg ctataggact tacggccatt cctgagcaaa gcaggccctg ctcagggcct 2940 
tcaccarctc cccttcaaag aaaaggttct tcccraggcc atcagtactc cgctgcactc 3000 
cagtgtiggtx gacagagcat gaccctcttt tttttttttt ttttttttga gatggagtct 3060 
agctctgtcg cccaggctag agtgcagtgg cgcgatctct gctcactgca agctccgcct 3120 
cctgggttca cgccattctc ctgcctcagc ctcccgagta gctgggacta caggcgcccg 3180 
ccaccacgcc tggttaattt tttgtatttt tagtagagac ggggtttcac cgtgttagcc 3240 
aggatggtct cgatgtccta gcctcgtgat ccgcccgcct cagcctccca aagtgctggg 3300 
attacaggca tgagccaccg cgcccggccc tttttttttt tttagacaga gtcttgctct 3360 
gttgcggagg ctggagtgca gtgatctcag ctcactggaa gctccgcctc ctgggttcaa 3420 
gcgattctcc tatctcaacc tcccaagtgt gccaccacac ctgccgaatg tttgtgtzttt 3480 
tagtagggat ggggttcgcc atgttgggca ggctggtctc aaactcctga cctcaagtga 3540 
tctgcccgcc tcggcctccc agagtgctgg gattataggc gtgagccacc atgcctggac 3600 
atgaccctgt ctctaaaaca atgtaatgaa ctgcagctca ttcttgtgtg cacttttctg 3660 
cacccccttt cccttaacac ttactattgt ctggcatgct acgtgttttc ttgatctcat 3720 
tagcctctcc caccccaact gccacactgc actatcagag ggcagcactt atcactgctg 3780 
tragaacaaag tcctgcactig agccgatggc ccagaaattt tctttttatt tatttartta 3840 
s e i -te 4 2 
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tttttttttt tgagacagag tcttactctg tcgctcaggc tggagtgtag tggcgcgatc 3900 

tcagctcact gcaacctctg cctcctggtt tcaagcaatt ctccttcctc agcctccaga 3960 

gtagctggga ttacacgcgc ctgccaccgc gcctggctaa tttttgtatt tttagtagag 4020 

atggggtttc aaccatgttg gccaggctgg tctcaaactc ctgacctcag gtgatcctgc 4080 

ccacctaagc ctcccaaaat gctggtatta caggcatgag ccaccgtgcc cggcctaaat 4140 

atttaataaa ataatggacg atgggtgcct tctactgagc tcccggtaat tgtgagtgag 4200 

tagaggactt gccctgggga cattcagtga cctgctgggt gttgctgagc tgtgaggaag 4260 

ttcaggtctg gctgcagtgg tgaggctgtg actcaatcaa tcactgctga tgctcccagg 4320 

acctgcacca gcttagtcct aggggcaagg attttaactg tccacctcag tttcttcatt 4380 

tgtaagatgc aaataacagt cacccctgcc tcatgggatg gagctgtgta atgcccgcaa 4440 

cagtgcct 4448 

<210> 33 

<211> 4001 

<212> DNA 

<213> Homo Sapiens 

<400> 33 

gagttcaatg cttgtgattt ttcttttttt aaaaattttt gaggcaacat ttatatccag 60 

tgaaatgtac agatctcaag tggatcattc catcagtttg acaaatgcct gcaaggtgga 120 

acccaagctc ttattagtat ctgttaaatc catgttagag gttgatcaga gaagcacagc 180 

ctttaagact ctcaccctgc tttgcggaga atggcaaaaa tctgcgaaac atctcagcag 240 

agccaggaga gagtaacact ggtaacgtga agcacgtcaa gctaagaaag gaagcactga 300 

aattaattgt cctgaatgga acctgataag aatctgccat ttcatcccct caagactgga 360 

ctcaagttgg gagagaatat tgccttttgc cctgtctaag ctaatatcac tgtttagatc 420 

tacaaactcc cccacctccc aatgtttata tattaacctg cattacaagt caaaataaca 480 

cagagaatct ggaaaaagaa gaaggggaat gtttcccacg aaaagcacac cccttaaagc 540 

gtctggcctg agagctatac agcatgcata aaatctgata ataagcacat tatgacccac 600 

attgcatata acaaaaagta gggcacttcrt gaagggtcct gagaaaaagc cctgggcttt 660 

taaaacctgc ttaagaaggc gatcaacggt acaccacttt caaacgcagc actgacttgt 720 

atgctccaca gtcatcaaca gxgaaaacaa cagttttcaa caagagttgg aaacactgag 780 

ttgagggttt tttttttctt tttaaagttt gaggccaaaa tcgatattta ttcttttgaa 840 

ttatgcttat atctttttcc tggttaatac gcacgcggtg gcacaatrttt gaaaactacg 900 

taacgttata gaaccaccac ccagaaacga atcctgttaa caaacttttt gtattcaaac 960 

tttttccact gcatagtttt: tgacaatgct ttttagacaa tcnttctttt agcaaccagc 1020 

actttcaaac aaaaattaca gagaatagta agrttttttc ccctcctccg gcagatttga 1080 

cccagaaatt gctatgggaa gaaagtgtta attatattaa aaaatagttt gacagaaagt 1140 

atttaaaaag agaaagggag aacatcacgt ctttattttg gtgaattagc aacaaagaaa 1200 

aagattagca tggacgggta cttttcaaaa atatattttt tttcctctgg ctcccgctag 1260 

ggtggaggaa gttgtctctc tgtcagagac agggtggaag agagtgaaag gacaaatgat 1320 

tgagaggctg ccccctccca ctggtgcagg cgtgcggggg tcggatgggg ggccgcggag 1380 

gggggaggtg gccagcaggt gcctgggctc caggccccta cccccaccct cgtccccacc 1440 

ctcacccaca cccagggatt ggcctcgccc cctggcgggc gagcggtiagg tgtcgaagca 1500 

ctggggtggg gggtgcaaac cccgcgggca gcggaaaaga ggccgtgggg ggcctcccag 1560 

cgctggcaga caccgtgagg ctggcagccg ccggcacgca cacctagtcc gcagtcccga 1620 

ggaacatgtc cgcagccagg gcgcggagca gagtcccggg caggagaacc aagggagggc 1680 

gtgtgctgtg gcggcggcgg cagcggcagc ggagccgcta gtccccxccc tcctgagtga 1740 

gagaatgcca tacctaggaa tacagctaac cagggaagtg aaagatctct ataatgagaa 1800 

ttactaaaca ctgctcaaag aaagcacaga cgacatcaac taatggaaaa acattccatg 1860 

atcatggata ggagagagca atatcattaa aatggccata ctgcccagag taatttatgg 1920 

attcaatgtt attcctatta aactaccaat gacattgttc acagaactag aagaaactat 1980 

tttaaaattc atgggccgtg cccggggcgc tgcccccgca gcctgcggcg cttccgcccg 2040 

cgcccccgca gggctccccc tgcgccgctg ccgccggggg ctcgggcgcc tgcggtccgg 2100 

cgacggcagt ggctgcagcg ggcacggccg aaggaccggg aggcggtggc tcggcccgaa 2160 

tcgccgtgaa gaaagcgcaa cuacgctccg ctccgcgggc caagaaactg gagaaactcg 2220 

gagtgtactc cgcctgcaag gtacgcgctc gccgctctcg gaccgcggat gggtgctagg 2280 

ggcccagccc gcgggacccc cctccccctc ccgcttccac ctccgcctcc cgcctcctgc 2340 

ctctcgcctc ccgcctgggg ccgctgcacc gcggaagtgc tcttgtcgcc cgcgcccaat 2400 

tagcttcttc ttggataaga gtcctgctgg gtttgaagaa gggggatcac taagacggag 2460 

agcccttcat tccttccgct tgaaaggagt agcttcggca tccgagctcc cgggctacct 2520 

tggggttttg ttgctctgta ggagctcgct ggagtcactt tcccgggagt ggagcggtgc 2580 

tctccatgtg gcgggtgacc caggggcaag gaaaaatctt ttgggggact gagtggtccc 2640 

ttctgcaatc gatcccttcc tgtgtcccac tgggaggaac tgggtgcaag gagtgggggt 2700 

ggagaagact cccggactcc tggcgctttg ggaaagcgaa ggggaggaaa gcgcggggtg 2760 

ggaaggtggg cagagtxcga ggcgagggtc ttgtggtgcc tggcgccctc ggctgggggc 2820 

ggaggcactg GGgcgcgcgg tgacagccct gtcattattg tattattact tgtttagtta 2880 

aaacgctgtc ccagctgctc gaggtgtaaa caggatttgg caaagtigaca gcccagcgcg 2940 
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gcgggcgcta 
ttaggcaccc 
gaggattgcc 
gacagggtga 
tgaggcaggg 
tgtaaagagg 
accacttctg 
ttcttggtac 
acaccatatg 
ttgttgagtg 
tcctggggaa 
agtttgcatt 
ttcagacctg 
ataataatgc 
tgcttctaag 
caggacaccg 
tgggtacagg 
agaacttatg 



ctgctcagtt 
tgtcacctcc 
cgaggccgcg 
cagctgaggg 
aagagacaca 
gtcaagttcc 
aattttgaca 
ttgcaggagc 
ctcaggttta 
aatgtaggat 
gctggttaaa 
tctgacaaac 
gacaatatat 
tatgtggtta 
aaaggtgcaa 
actgtagtag 
gcatactgct 
taatcaaaca 



tgaggacacc 
agctgcagat 
gggcaggcrtg 
gggcggggac 
ctcacacaca 
cagattccag 
taactataat 
ttctcagtca 
tgttaagcac 
ctagcccagt 
atgctgatta 
tcccaagtga 
tatttgtccc 
tttctgaaga 
gcccaggagt 
gcagaagggc 
tgggtgatga 
ccacctgttc 



seq_prot . 
catttcttgc 
ccagatgtgg 
gccctgcgct 
ttggccttgt 
ctcacacctc 
attcccgatt 
ttattctact 
caatggaatg 
ctctgtatcc 
gctcctcaaa 
tgaatcaggt 
tgctatgcta 
ctttgatttt 
aggaatgcaa 
tgatcatgaa 
gagctatgta 
ctacaccaaa 
cacaaaaacc 



txt 

tacccactct 
aggctgtgtt 
gtcagcagcg 
cagctcctgc 
cctgtattct 
gtcacaacca 
agaacatcta 
tctatgcttg 
tacactggtt 
ctttaatgtg 
ggtctgaggt 
gtccacagac 
agcagaaaag 
aaaagatgaa 
gggcattaga 
cagaattaag 
atctcacaaa 
t 



ccatgccagt 
cccgaggtgg 
gctttigtttt 
ggcctgtggc 
tcctgcctgc 
agctgcagtg 

ggttatactc 
ctctttttcc 
caataaatat 
catgggaatc 
ggaagctgag 
cacattccaa 
acagaaacaa 
gtggggagtg 
cgaatgagaa 
aacaaggtca 
tcaccattaa 



<210> 34 
<211> 4322 
<212> DNA 

<213> Homo Sapiens 



<400> 34 

ggcaagtcgc 

agcagccctc 

ctggtgctgg 

gaaccacacg 

tcctccctct 

actgggttgg 

gcaaaggtca 

catggggcat 

gtccttcact 

aatggacaca 

cagagaacag 

ttgaacccac 

ctxccacctg 

tgtccctctg 

taatagcttt 

ctgtcacaag 

ttgtgtctca 

ggacctggaa 

ctaatgtctt 

ggtccctaga 

atgaaggagt 

caggagcaat 

taggggcaac 

cgcccgacca 

ccgggacagc 

ggaagagcag 

tgggtgtggg 

gcctccccac 

gcgcgctaat 

ggtgtcccgc 

gcctggcagg 

gggtgaggtc 

ggcggctgta 

gggccacacc 

cgcgcgccgg 

aacgggaggt 

cgcccttcct 

ggtcagcgaa 

tgtgtccctc 

ccacccgccc 

ccgggccctg 

tecgagaccc 



ttctcctttc 

acctcttgag 

gtagccccta 

gagtgggaac 

ggctgcagtc 

cttactgtgg 

cttgaggctc 

gatgggtggg 

cagggaagtg 

accagggcag 

catgctcaag 

ctggtctggc 

aacaacgtgg 

ccccatctat 

caccaccttc 

cccctttctc 

gctgggaaga 

ccctcttttt 

ggctttctga 

ctttgagcct 

tatggctgac 

agagaggtct 

tgccagtccg 

gcgagaacct 

gggcttggcg 

gagtggggcc 

gagcggggga 

cgggcctcag 

ggcgggagga 

gcaggcagag 

gcgaggcctg 

gctcggcccc 

gcccacatct 

tcactggccg 

ggagcggcgc 

aagtgagggc 

txccccttcc 

agtcccgccg 

cgctagacag 

gtgtgtctcc 

tgtccgtccg 

gatccttctg 



tctgagcctc 

aactgttxag 

tcagcgatag 

aaaggtrctgg 

cctggcacag 

acctgaagga 

accttgttita 

gtgggatggg 

ccgtgcatta 

gtagaggttt 

gcctgggtgt 

cttttctccc 

cagcccttct 

cccgcaggag 

tcattgctct 

agctcaggcc 

gctgtctaga 

tgtctcctgg 

gctctgggga 

aggggacact 

aaccaagttc 

caaaggtggg 

agtaggggca 

ccgaccctgg 

gcgggcaccg 

aggagaggcg 

acccccgccg 

cgtcctcctc 

tgcgaagagg 

gcggagggac 
gcgggggcgg 
gccccctxgc 
cccgagcgac 
cttggcccat 
ccccgccgct 
cgggtccggg 
cgccccagtt 
cgccgagacc 
tgggggctga 
cgctcctccg 
t^tgtcgcca 
tccctcgcca 



agtttgctca 
actgcccaca 
ccatgattat 
tggctggacc 
ggtgggccta 
ggagccagaa 
ccctttcctc 
atcaggacac 
caactgggca 

gggggtgggg 

gaagagaaag 
catccatctt 
gtcccccatc 
cccccttgag 
gtgagaagac 
acctcaggga 
ctttctctat 
gaagtgacat 

caggggaatg 
cggtgaatag 

catgtctgga 

ggcaggaaag 

gttcccagaa 

agtctaacca 

cgcccggccg 

ggccagctgg 

gcccctccca 

tgtaaaatgg 

gccaggcagg 

gctgtccccg 

ggcctggcgg 

acttccgtgt 

ccccggcgcc 

cccagtcagc 

gccgccgcga 

cgcgggatcg 

cccgtttccg 

tgccggtgac 

ggtccttctg 

aggggcccgt 

cccactctgt 

cccgatccgt 



tcaccaaaac 
tccagcaccc 
tcatttcctt 
tggagctcct 
ttgtttatgt 
ggaaacccga 
cagtccaacc 
agaacatgct 
tcagggagga 
tgggaaatgg 
tctttcccaa 
ccgcaatctc 
cagcctctcc 
cctgtcatcc 
caaagtccaa 
ttgtccccca 
cccttctagg 
ttctttggat 
gactccctag 
atgaatgagt 
aaggaggctc 
cagaagactg 
ccgcctgagg 
gctcagtctt 
cgtccgagct 
ccagtgggga 
ggccccagcg 
gccccttctg 
gcagagggac 
attgcagctg 

gggcggggcc 
gccgcggcgc 
cgcccgccgc 
gccgcgccga 
cccttggcgc 
gggctacccc 
aaactgagaa 
tccgtccccc 
tctgcccgct 
ccgtccctcc 
gttctccccc 
ccgtcccttg 



tggcactaac 
agcacacagc 
cttccatggt 
caagggtcag 
tgagggagag 
tcccaagtgg 
ccagggaagc 
aatgaatatg 
gggacatttg 
cattcaggcc 
aggatagtcc 
aaccagggtc 
tctgccctgc 
tgcctggttg 
tggcctgtct 
cttccatccc 
tggttcccca 
gagtaraagg 
aaccccaact 
gaatgaatga 
aggcgcctgc 
ccaatcgggg 
agcagtgcag 
tggggcagag 
ccagccaaca 
gtcccgggcg 
ggcggaggga 
agggctggct 
cgggagactc 
gcggggcgag 
ggcggcacct 
cggagcccga 
gcggaggccc 
accccgtccg 
ctgcccctgc 
gagggcgcgg 
actgagccgc 
tgtccgtctg 
cccggcccct 
cgacttctctc 
trgcgcgcctc 
ccctccgcgt 



3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4001 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2,400 
2460 
2520 
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gctccatcgc cccaccccca cccacacccc tacccccgcc cgccgccccc tcgatcccgt 2580 

cctcccggag ccgcgccccc ggcccacgct cgccgccccc tgccagtcca gccgcctctc 2640 

cgctctaccg cggggcccag gccaggccgc ccgccgggcc tcgcctcccg cccctgggtc 2700 

cctggaggcc cgacccgcgc ggcgcctcct cctcgctgtc tcttctttcg gtctcaagag 2760 

tctttctcgg tcccttcatc cgtctctggg ctgggcgtgg ctccagtctc caccatcccc 2820 

cttcccaccc cacccctxcg cccctctgag acccctccca gggtggcccc tgcccgatac 2880 

ggtctgtact cgcccccaac ccagatgcca ctcccgttgt gggggctcca ccagatgagg 2940 

ggaggaggtc gggggggggg gtcctcctgg ctggtggtgg agcggcagaa ggagagcacc 3000 

atcgqaactg ctcccacttc cccagttccc tctctcttgg ggcaggggag agccccaccg 3060 

agccccgcct taaacctggt cagaaggatc tgctgtgagt gggcgggggg tccccttctc 3120 

tgacgcctac cctgtatgcg tggaagctgt ctccttgtgc taagtcactg gtatctgggt 3180 

gtgtgggtat gttgtgtgtc tgtgtcagct tctgtgctgt ctgtcactgt gggacttgcc 3240 

tgtgcatgtg tgtccttttt cgcatgtgtg acttgcttgt gtgtatgtgt gtatgtctgt 3300 

ttgtgtcctg aggctctgtg tgtgtctttt gcaactgtgt gtaccattct tggtgtgttt 3360 

gttgtcacca gactcgtgtg tgtgtgtgtg tgtgtgtgtg tccttgcctg ctagtgtgtg 3420 

tgtccctgtg tgtctggagg tttcggtgtt gcacactggg tatatcatgg tgaccaactg 3480 

gccatgagta cgttagatct gttaattgtg gaagcccctg gtgttgggtg cctacccacc 3540 

agtctcatgt gtgagtgtat ctgtataagt gtgtgtatgt gtgtgtactc acgtgtggct 3600 

ctccctctgc cactgagttt ctgtgaccag gcccacctgg tcactaaggg aggggactga 3660 

tccctggggt ttcctcgttc ctgctgtgga gtgagctggg gcttggtgag caggccagca 3720 

gagggaggct gtatggtgca tgtgtgatat gtgtggttgt gtgtgtgcgt gtgtgtgtga 3780 

cagagaggtg tggtggtgtc tgggcagctg tacctggctg atgggtggtg tgagggctat 3840 

gcgatgggac agrgctgggg taagcctggg agtgtgtggc cacggtagga gtgtgtgact 3900 

ggtgggacac agtctatgaa actgccggtg tatgcttggg gagtgactgt gattgtgggg 3960 

aaccgagagt gcctgggcgt gggtgttagc ggcaggaggg ggtgtgacaa ggtgtgtggt 4020 

gacagtgctg ggagggaagg tgagagccca gtgtgggggt gtgtgtgtga gtgtgtggga 4080 

gaccatgagg atgtgggtgt gcacatggag agggctagag ggcaagtctt ggggtttgta 4140 

gtgttgtgac caagacctcg ggctctgtac ccaagaatta tttgggctcc aatacttgct 4200 

tgtgcctgtt aagagcaaga gacccaagct aggcctctca aggctgagtg tccttatctg 4260 

cagaatggag agagtgagga caaaagttcc tgtgttaaag gtcctttaac acaatcttgg 4320 

cc 4322 

<210> 35 
<211> 8467 
<212> DNA 

<213> Homo Sapiens 
<400> 35 

atgaatcaaa cctttctctg gcaccaagaa gggacataca gtattccgag ctctgccctt 60 

gttttcttat caagttttct ctaacagctg gaagtggttg aagggggctg tcttaaggag 120 

accccctgct cttcttttga cagctgatca aaagaaaata ggtcacgctt ctcaaactta 180 

tgtccaccct gtccttaatt actgtctctt taaacaaagg caacatctgc gcaacctcag 240 

ctagcttgaa tcttcggagg caaacagagc gcaacctgct ttggaagaat cttgttacgt 300 

ttaaggcttt atgccgggtg ttggcctctc tgtcttcaac tacccctcca ccccgactcg 360 

gactgcagaa atctccaact ctctgtcccc cagttccact cccacctcca gacctggggt 420 

tgccaaacaa cggaaattct aggaagacgt aggtgcggtt tttaaagcct tggcttgtga 480 

gcgttctcag gctggccggg gcgctggtct tttcagaaac aaggtttgaa tatgcaggtg 540 

tcagccagga ttccttgtcg tctgcccacc gccgtttctc tcccgggatt tgagagaagc 600 

gggagtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgcgcgt gtgtgtgtct cgtgttagga 660 

gaaaggtccg tggttgccgt ctcagctggt ttgcttactc tctgccccca ggagaagttt 720 

gctcacttgg gacaccagcg accctcctca cctccacctc accggttcta cactctcccc 780 

acttctttcc aagtcgctga tgcagaaagc acttggtcac cttgaaacgg cagctcccgg 840 

aattctagtt ttgttttgca atctagcagg gctcttcgga agcctcataa ctgagattta 900 

tgggctcacc gggtaattaa atgatgttat tgtgttaact ttgcatgcat tactgctgcc 960 

cgcgccgccc gtgggccgcc ggccggccgc agctcgctgg cgacgagggc actacagttg 1020 

ctctgaccgc gtagattatg catgtcccgg cctcgggaat ttaccatgca ttagaataca 1080 

ttagcgcctg cattttaaaa ggctaaacta ttggctccca gctagggact ctcggtaagt 1140 

ggcttgttag tgacgagtgt ttgtctatac tggcacatag cggagtcttt tgctcccggc 1200 

ttactcgcct ccaggaaagc tttggggtga ggcgaaggcg attgaagcaa tgccccttcc 1260 

cccagatcgc agctgctcag gggggacaca gcacggcatc tttcaccgaa tctctctcgc 1320 

tcgctcgcac tccagcctcc ctctccccag cgaccccccc accttctcct ccctccctct 1380 

ccttgacgtt tgattccagt agcaaaggag gtaaaaaagg caccgagccg tcagccaaac 1440 

ctgaaaagtg cggccccgcc ccctccacag ccactggtag cttcccgtgg aaggcccgcc 1500 

tcccggggca gctgcggcct cggagtggtt gcgcttggcg cccgtcgggc gtggccccgc 1560 

cccaggtccg ggagggtagg ttggctgccc cggcgagcgg cagagccctt ctggacagct 1620 

cccgctcacc caaacagaag acgtcggcgc cggagcgggc tcggacatgg cgaggctgcg 1680 

agccggcccg agcggcgggg cccggtgatc cctccctccc tccccgtccc ctcccctctc 1740 
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ccgcacgcac gccccgtccg cccccacccc gcccccaccc cgggcgagcc cgcccgcagc 1800 
ccggggcgca cacccgcacg cgcactcctc tccactcact cccgcgcccg cccccactcc 1860 
cgcagccgag ccccgccacg cgcgccttgc ccgcccgccg gccgcccccg ccgcccccgc 1920 
cgcccccggg ccctgatgga ctgaatgaag gctgcctaca ccgcctatcg atgcctcacc 1980 
aaagacctag aaggctgcgc catgaacccg gagctgacaa tggaaagtct gggcactttg 2040 
cacgggccgg ccggcggcgg cagtggcggg ggcggcggcg ggggcggcgg gggcggcggc 2100 
gggggcccgg gccatgagca ggagctgctg gccagcccca gcccccacca cgcgggccgc 2160 
ggcgccgctg gctcgctgcg gggccctccg ccgcctccaa ccgcgcacca ggagctgggc 2220 
acggcggcag cggcggcagc ggcggcgtcg cgctcggcca tggtcaccag catggcctcg 2280 
atcctggacg gcggcgacta ccggcccgag ctctccatcc cgctgcacca cgccatgagc 2340 
atgtcctgcg actcgtctcc gcctggcatg ggcatgagca acacctacac cacgctgaca 2400 
ccgctccagc cgctgccacc catctccacc gtgtctgaca agttccacca ccctcacccg 2460 
caccaccatc cgcaccacca ccaccaccac caccaccagc gcctgtccgg caacgtcagc 2520 
ggcagcttca ccctcatgcg cgacgagcgc gggctcccgg ccatgaacaa cctctacagt 2580 
ccctacaagg agatgcccgg catgagccag agcctgtccc cgctggccgc cacgccgctg 2640 
ggcaacgggc taggcggcct ccacaacgcg cagcagagtc tgcccaacta cggtccgccg 2700 
ggccacgaca aaatgctcag ccccaacttc gacgcgcacc acactgccat gctgacccgc 2760 
ggtgagcaac acctgtcccg cggcctgggc accccacctg cggccatgat gtcgcacctg 2820 
aacggcctgc accacccggg ccacactcag tctcacgggc cggtgctggc acccagtcgc 2880 
gagcggccac cctcgtcctc atcgggctcg caggtggcca cgtcgggcca gctggaagaa 2940 
atcaacacca aagaggtggc ccagcgcatc acagcggagc tgaagcgcta cagtatcccc 3000 
caggcgatct ttgcgcagag ggtgctgtgc cggtctcagg ggactctctc cgacctgctc 3060 
cggaatccaa aaccgtggag taaactcaaa tctggcaggg agaccttccg caggatgtgg 3120 

aagtggcttc aggagcccga gttccagcgc atgtccgcct tacgcctggc aggtaaggcc 3180 

ggggctagcc aggggccagg ctgctgggaa gagggctccg ggtccggtgc ttgtggccca 3240 

agtctgcgcg ccgagtcact tctcttgatt ctttccttct ctttcctata cacgtcctct 3300 

ttcttctcgt ttttatttct tcttccattt tctctttctc ttccgctctt cccctacttt 3360 

cccttctccc ttttcttttt ctttcttact ctxtccttgt ccctgagctt tcattgaccg 3420 

accccccccc atttcattcg ccctcccctc aatgtgccaa cctttgccct atttccgatc 3480 

ttcccaggta ctgggaggcg ggatgggggt gtgcgttttc ctctaggagc cctgtctttc 3540 

caagacccac agaaaccagg acctgccctt attcaaaacc ccatgcactt caagtctctt 3600 

ttagacaaca catttcaatt ttccgggctg actagtctcc ctgtgcagag gcagttgaga 3660 

ggctttgctc tgcagaggga aaagagctct ctactctccc acccaccata taggcaaact 3720 

tatttggtca ttggctgaag gcacagcctt gcccccgcgg ggaaccggcg gccaggatac 3780 

aacagcgctc ctggagccca tctctggcct tggcgttggc gcagggactt tctgaccggg 3840 

cttgaggggc tcgggccagc tccaatgtca ctacctacag cgagggcagg gtgtaaggtt 3900 

gagaaggtca cattcaccgc tttgggagga cgtgggagaa gagactgagg tggaaagcgc 3960 

tttgccttgc tcaccggccg tccttgcccc ggtcccagcg tttgctggga tttgccagga 4020 

tttgccgggg ctccgggaga ccctgagcac tcgcaggaag aggtgctgag aaattaaaaa 4080 

ttcaggttag ttaatgcatc cctgccgccg gctgcaggct ccgcctttgc attaagcggg 4140 

cgctgattgt gcgcgcctgg cgaccgcggg gaggactggc ggcccgcggg aggggacggg 4200 

tagaggcgcg ggttacattg ttctggagcc ggctcggctc tttgtgcctc ctctagcggc 4260 

caagctgcga ggtacagccc tctattgttc taggagcaca gaaacctcct gtgtgggcqg 4320 

c 228i g £ gcg a 9 c tagaggg aaagatgcag tagttactgc gactggcacg cagttgcgcg 4380 

cttttgtgcg cacggacccc gcgcggtgtg cgtggcgact gcgctgcccc taggagcaag 4440 

ccacgggccc agaggggcaa aatgtccagg tcccccgctg ggaaggacac actataccct 4500 

atggcaagcc agggtgggcg acttcccatg gatcgggtgg aggggggtat ctttcaggat 4560 

cggcgggcgg tctaggggaa caattcgtgg tggcgatgat ttgcatagcg cgggtcttgg 4620 

gatgcgcgcg gttccgagcc agcctcgcac agctcgcttc cggagctgcg agctcaggtt 4680 

tccacccccg atcccccggg ctttcctcgc accgctgagc ccagcttgtg gggtgcactc 4740 

gaccaacgcc cgacagggct ggggaatgtg acaggcagca ggttcacccg ggcttgggga 4800 

gggggagttt ccgctttgac agcattttcc tttgccgtct gctggtggat tcctattccc 4860 

agtcggtaat cgccccgcag tgttgatcta agaaggtaaa gaaaactagg tttccctgca 4920 

aagagcctcc cccaaatcgg cggactccgg atactttgag tggatttaga aatttatgta 4980 

atctttctcc tttagtttat ttttcatcct ctcctacagt tttctctgat ttgctgttgg 5040 

ttcggggcaa gataaagcag ccagtagaga gcgataataa tagcggcggg aaatgaactg 5100 

gagactggct gacagttctt aacattttgt catagatccc cccgaatgtc ccaggctgtc 5160 

^Sl9S}922 t "tagtaccc gccggcttct tgggcaccgg ggaccagaag gaacttggca 5220 

gctggtctta ggggtacagt taaaggcagg atgacagcta ttctcctgct catctcagag 5280 

cgctgccgec ccctcatgcc ggtcgcgcaa agaacacagc ttttaaaaaa cacgtgcctt 5340 

§S55 atat a §gtctgaaa gtgatgagga aagtaatgct tcgcctatta gcgagtttca 5400 

gcttttaaaa tgatcccaag cgttgctgag atgagaaagc gtggcatccc gggggtcctc 5460 

agccccaccc gcgcccatgg tgcaagtctg cagggacagg cccgggacag cactgcccac 5520 

gctgctagat tttccgcaga ggatcgctga agctgccttc gtgggagaca gaatgcctcc 5580 

tccagcgagt ggaaaaggcc tgctgaggac cccgctttgc tcgagcattc aaatgtgtgt 5640 

ctgttttatt accctgggtt gaaaagggac aagagcttta gcctttttat ctggccattt 5700 

tatcagcaac tacaagtgtg ttgagtggtt attattacat aggaggcttt tcagtttggg 5760 

gtcagtagat cagtctcttc agacactgat gcagaagctg ggactggtaa gtaggtatta 5820 
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tgtgctcgga gcgctagggg acaggagcaa atggagaaga aaagcggagg ctttctccgc 5880 

ccggagtatc gatcggaatx cccgccggta cgccgcagag ggccctcgcc gttgggcccc 5940 

gggggtttaa caagcccagc cgctccgcag gcggctcggc cggactctca gaccggtgcc 6000 

tggaagacac cgtccctgcc cccctcccgc caaacctgcc tcttctcttt ctctcatagg 6060 

ttataggttc cctttctctc tcattttggc cccgcccccg ggtcctgcca aacagccaag 6120 

caggccgggg tttagggggc tcagaatgaa gaggtctgat ttggccagcg ccggcaaagc 6180 

tcacccttag gcgaggtcac aacagaggca ggtccttcct gcccagcctg ccggtgtagt 6240 

cacagccaag ggtggcactt gaaaggaaaa gggagaaaac ttcggagaaa tttagattgc 6300 

cccaacgtta gatttcagag aaattgactc caaatgcacg gattcgttcg gaaagggcgg 6360 

ctaagtggca ggtggttgca accccgcccg gtcgggcctt cgcagaggtt ccccaagacc 6420 

agcccttgca gggcggtttt cagcaacctg acaagaggcg gccaagacaa atttctgcgg 6480 

gttcgagcac acactctcgg gcgttgggcc ccagagacct ctaaaccaag cacaaacaag 6540 

aagggagtga gagaacccag gctagaactt gcacgggcat cccactgagg aaaagcgagg 6600 

cctcggtggc aggcatgttm tcttccgacg cccgaaaatc gagccgagcg cccgactaca 6660 

tttactgcag aggtttccgc ctccagtgag cccggatccc ccagcggcct gcccggagct 6720 

ggtctccagt ccccgccgta gtccgacgca cggccctctc ctggcagcaa gctcccagcg 6780 

gccagtctga agccaattct gttcaggcgg ccgagggccc ttagccaacc caccatgatg 6840 

tcgcctgggc cacctgatgc ccgcagcggc gggacacggc ccgggcagtg cgcagtggct 6900 

cctgctaggg gcaccgcgng cgtgcttgtc tcccgcugcg ccggggacgt ccttgggtga 6960 

cacgggccgc tgggcacctx ccaagccgag gaaacggacc cccttcgcag agtctcgcgc 7020 

ccacccccca acctcccacc tcgtttctcg ctgctagggc tcccgactca gcccacctct 7080 

cctggcggtt tagttaggga tcagagctgg agaggctgaa cgcaacccgt gccagtacgg 7140 

aacagacgat atgtttgcct gctagctgct tggatgaata attgaaaagt tcgctgcagt 7200 

ctgtgcttcg tcaagtcccg ggtgccggga gaacaccttc ccaacacgca tcagggtggg 7260 

cgggagcggg cagaggaggc gggacccgag ggaggagagt gaacccgagc aggagaagca 7320 

gcccaggcag ccaggcgccc tcgatgcgag aggctgggca tttattttta ttccaggctt 7380 

tccactgtgt ggttatgtca ctttctcaaa caaatgtgta tatggaggga gatcgatgct 7440 

gataatgttt agaagattaa aagagcatta atgctggcaa caataacgta aacgtgtgga 7500 

cccagatttc attgatctgg aacttgatcc ggcgcgtttc cagtaagccc gacggcgcgc 7560 

tcttcccagc agagcgctca ccagcgccac ggccccgcgg ttttccagcg gtgccgcttc 7620 

gccagctctg cgcgggttct cccgtctgac cgcagctcct cccccgcgag gccccagccc 7680 

gccttacttc cccgaggttt tctcctcctc tcgcggggct ctctgccctc tgcaccccct 7740 

cccccgacct ctgcaccacc cgcccctgtg cgcacacacc gctacttgcg cttccggcga 7800 

tccgcctggg cggctgggtc cgcgaagcca atgcgctgaa cggtgcccga gtcttcctaa 7860 

ctatcctgtg cttggccgtt gccactgggc cctggtgact aagcccaagt ttgaaaatga 7920 

cgtggctaaa gctgcctgct aacggcagag ctnaatcagc cgcagatccc ctcctatcct 7980 

catcggttct cgtactaaaa aggctcacgc gcagactttt tacgcaggtg cattcgttgg 8040 

acaattaaac gtggccctag taacaaaagc ctgagcttca tccctccgag tagcaggctc 8100 

tgcgctggac tggtcgggtc taacagggag aatcttgtgt cctaccttgg ctgggacagg 8160 

aagtaagaga agcaaactgg ggaacccctg ccccacccct tcccaccccc tggacgtcct 8220 

gggccgaggc ccgggtcact ggcccatcgc gggtaaggag gtgtcctgrg gaacacctgc 8280 

attgactgga aaagaaagaa ctttaaagct cttccttccc gcttgtgggg gacaccacca 8340 

caccctgcca accactcccc tggccaaatg gtggcttgtt tttccagaag gagcactaag 8400 

gtgaatttta tggaaaaaaa ataaaagatc agggacctaa gtgggagtag gatgagaact 8460 

aattctt 8467 

<210> 36 
<211> 6456 
<212> DNA 

<213> Homo sapiens 
<400> 36 

taaaaccctt caaaactgaa aagaagaaag gggcaattgg agaattccca ctttttctgg 60 

ctgtctcctt caagtcgccc agtttttatg aacagcatct agccttactg tcactatcaa 120 

caacccttaa aactagccaa tgcttcggcc tctagtattg gaaagtcttc caaataggat 180 

actggaaact tctatttata agcttggggt ggcgggcggg gcggggaggt ggagagagag 240 

ttgccatcta caggtttcta ttttggcctg aagactcaac tgcagtcatt agagtaaggg 300 

aatgcccatc tcctggtact tgttcgccat ttcctcctcc cccagagaca aatatctttt 360 

cgtctttttt aaaaaagtat atattttaaa gcaagaatgt gatttcatct ctcttctttg 420 

agctcatgtt tgctacctcc aggaatagcg tgtggactag ggccagatga acttcaactt 480 

gggctgcaga tttacgaggt tctgttctag tgccaaaggc tcttggtagt aaatagtgag 540 

caaaatagat acctgtctcc tgatggatct tgccgccccc tctttttttt tttaagttat 600 

ttattaaaac cacacacacc ttgcaaagaa aaagggaaac tggcagtctc tgtagaggaa 660 

gccggtggca tcgctcagag ccacaaactg tatttctaaa cagccctttc cctggttccc : 720 

tctctcctgc cccacttttt ttaaaatcca gactgtaaaa aacacatcta ctgacactca 780 

ctttacttta aaaaaagaag agaaaaagta aagcgttaca agactttcct cctggaaact 840 

ataaactgaa aaaaaaatcc atraaaagatt aaatcctrggc gggttgtggg gtggcggggg 900 
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ccggcgggga 
cagcttggga 
ccacccccaa 
cttttttcat 
taaaactcat 
ataaaggcaa 

ggggtggccc 

agccgatttc 
tcccccattc 
ctatttttgc 
gccgggtccc 
aggaacgcgg 
tctgaagcct 
atgggaaaaa 
gagtcacgca 
gttctcccag 
aggtgcagtg 
txtcctcggg 
gaaagaataa 
cagggaacac 
agacacgata 
gagttcgctt 
agaccgaggc 
ctcccggagt 
agggagggaa 
gagggaaaaa 
agaaagagcg 
actagcccga 
txccctcctt 
caaaagcctc 
aacaatctat 
cctgacagct 
ggagccaaag 
tgagcagcgg 
gcagcggcgg 
catcaccact 
agcttcggac 
utttactgcc 
atatttcgag 
aggggcctga 
accatcacgt 
ctggaattct 
tcccggggtc 
tgcacaaaag 
gttagagagg 
gaatattagg 
tatttttatt 
tggcttggga 
tgccttttgc 
tgnacgtgcc 
atactaatac 
cttttttttt 
agtgtgtctc 
tctttccagg 
tgttgcaaga 
tttctacaaa 



gggggcgcgg 

ctttcagcac 
accccagcca 
ttttttaaaa 
tattgcaaca 
cgcaacccac 
cragctcctg 
tgaggctcca 
ttactccctc 
aaaatcactc 
ctcagccacg 
gctggggctc 
ggcgtccact 
ataaaataaa 
gcgcccaaaa 
gctgtcctgt 
cgcacccctt 
atgagcaggg 
ttttcaaagt 
caaaacactc 
cttcatctca 
gcattttttc 
cgatggaaga 
tacccagaag 
gcggaggcca 
acacagccgc 
agagcgcgag 
atcacccccc 
cctctcctcc 
cccaaaccgg 
atcctatggt 
agaataaata 
catttggcaa 
cggcggcagc 
cagcaacagc 
tctaagaacc 
tttaccactg 
cctggccaac 
gtcctcataa 
aggtggctct 
gggctcctcc 
gagaactaat 
tggggctaat 
ggagcgagag 
cgagtaagag 
caccacgaga 
taggacaggg 
tttttggttt 
aagtttgtgg 
ttttctttcc 



agatttttgt 
ttaagagatc 
tgtatttaga 
ttgaagctga 
ttctcttttt 
tgggtcctag 
ttttcccctc 
taaaagaagg 
tttggaccag 
tcaagcgact 
ttaatttctza 
tataatggag 



tattgaaaga 
ttttttttcc 
tcctgcactt 
acagcaagtg 
aacgtgtgtg 
attctgtgag 
tttcgaggct 
agccggaggg 
acagactgcc 
gttggggggt 
ctctattact 
tttgtttgtt 
ctactcatcc 
gtgtgagggg 
aagcagcttt 
tattaaatta 
ataaagatat 
gagtgatcag 



agtggagatt 
ctggtttggg 
ccaacacaac 
aatcggtrttg 
ctagttccat 
agttctcaag 
ctgccccccc 
actctaccca 
gaggagagcc 
acaaagatct 
gccacaaagt 
tgttcgtctt 
acgctcaggc 
atttaggaaa 
tacaaacacc 
cgaggttccc 
tcccacccac 
agagcgcgcg 
gttcaacatc 
ggctctcatt 
tctggattta 
cttccaaatc 
gagccagcgg 
gacaggggaa 
ggagcgacgg 
acgaatccag 
cgcgcgtcct 
aagcactgtc 
acctcctttc 
ccgattcact 
gggagagacg 
aatatataca 
gagcggaaaa 
ggcaccagcg 
aataatcacc 
ccagttctaa 
ctacgtggca 
caaaagccct 
gcagagcgtc 
ccctccccgg 
tctgtgacgt 
ttgctattcg 
tccttctttc 
gtttgaacca 
aaaaataaaa 
aaaatatttg 
gtttttgctg 
ctgtattttg 
tgttacgtaa 
ctatctaatc 



txt 

ggtggtcagg 
tgctcaactg 
gggaacacaa 
tgttttccat 
ggttccaata 
cggtcactac 
cccgcccgcc 
gggccgccgc 
aatccaacca 
cccgcgcccg 
cctcctgagt 
caaggtaagg 
ctttttacag 
agccattggg 
gaaatcccgc 
cgcacactga 
cctgtcccgc 
actccctcga 
agctccccaa 
cggctctgaa 
aacaaaaaca 
tcgaaggcag 
cgggcagcct 
aagaggcgag 
gcagtttgca 
agccgtacgc 
tggggccaga 
tgctccggcc 
aacaacacaa 



agcttaaatt 
tttggtgcaa 
cagagaaatg 
gtgggtttaa 
tgggggggag 

ccaaatacct 
cgtttcaagg 
aattctaaag 
acaactacaa 
ggggtggggg 
ttctttttaa 
cagagctgct 
ctgaataagg 
aaaataatat 
caaaggtcag 
cagatgaatt 
gtctaaaata 
agctgtgaaa 



seq_prot. 
ggctctctga 
gtcatttatc 
catcgtagaa 
gttgtgtttt 
ttttcgccag 
acatttacca 
gccctttctc 
ctccctcccc 
acaggttgca 
ccctttcgcg 
gcccttctct 
tctcctcgcc 
ccgcagttcc 
gggaggcaac 
gcagcggcca 
tgagtccccc 
cccaagaagc 
gaggttcccg 
cccgccccca 
aggaagatca 
tgaccaaaaa 
tcggttcggc 
ggcgagcgag 
gggaaggaag 
agcaaggaaa 
agagatcaca 
ctccgcggtc 
tcgtcctctc 
caaaaaccaa 

caaagacaac aataataata 
tgggactaat cttcggcatt 
tttatatcaa tagatacaca 
aaaaagaatt aaaaggtaaa 
gcaacagcgg cggcggcggc 
tggtgtccgg cctttcctag 
gaatcaacag agctcaattc 
ggggaggact tggtgtcagc 
caaagccaca agattttttc 
tcggatntgg aggttccggt 
gcccaagacg atggtatggc 
cggcgccttc gctgtagcaa 
gtgacataag agggggagtg 
ttacccataa actcagcaga 
ctgggaaaag tatgttatat 
taaaataaac atcacagctc 
ccaagcagtt ttcggtgggt 
ttgttctggg tttntttctt 
atggtttatg gatttttgct 
atcacaggat cggcatcggt 
cctcaagcgt tttaaagatg 
tttggccata tgtaacaatc 
ttttcttttt cccccttgga 
ttcaaaggat ttgttttggt 
tctgttattg ttgactcttg 
ggtgggggtg gcggggtggt 
gtttgtgttt tgttttctct 
tcgttgtaaa aaaatctctt 
gcctgccgtg ccagtatcac 
tgcactacac gcagcacagt 
gggggtagag aagaaggaga 
aggcaattgc agtgcagaat 
agtttcctta tttaaatgtt 
gagactcttg tttcttttat 
ttcagattcc tcaagaatta 
gctgttctga gccttggcta 
cttaaagctt ccacaaaatc 
atctacctat ctcaacccaa 
gattgctgtt tgctgggata 

48" — 



ggccctgtga 
tcaggacccc 
cacagagggt 

gggggagctt 

acacggcatc 
atccggcagc 
cccggagctc 
cgccgcgcct 
acctcgcaat 
ctcctcccgc 
cttgcacata 
acctcgggaa 
agctrtgcacc 
agccaacaca 
cacctttctc 
aaggcatcgc 
catcagtctc 
ctacaaccaa 
aacacagggg 
aggaaatagt 
aaaacccaaa 
ggaatctaaa 
ccctttttgc 
gaaaaagagg 
agcgagaaaa 
aagcagcagc 
cagcccccag 
gccccctaat 
gggagggtgg 
ataaatacat 
tattttaaca 
tagaaaactt 
ataatgatca 
agtagcagca 
aaacttcttg 
tcggaatttg 
tctccgagat 
actggccggc 
tcgaggctcg 
ctgctccgcc 
agctcggcct 
cgctttgctt 
tcgagctaaa 
atatagtagg 
ttuccaactia 
tcatttgctt 
tctggtgtgg 
tctgattttt 
tggatttttt 
tattatttca 
ccagccccca 
cttttgctga 
ttggtttgtt 
ggaaatttct 
atgtgtgtgt 
taaggtcttg 
cagtctgtgt 
agatactgcc 
cttcttagct 
aattctttcc 
catttcattg 
ggttcgggtt 
gtctctctct 
gcccaaaatg 
tgagagtcct 
tcgtcacttg 
aatttgggtt 
tttggaaaat 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 



s eq_prot.txt 

ctctttcatt gacagtcccg taagaagttc catctatcac cgaaaatgtt gttggcaatg 5040 

caccccccta ccccacaaca cacacacatt tttaaaggcr gtattttagg ttttaaaacc 5100 

cagaaaggtg caaagaggtt gattattttc cacaagggct ggagcttagg aaatatgcgt 5160 

taaagtgtct gtccctgtga aatctgaata ttttaatact tattgacaga tggattactg 5220 

cagcctctgc agaaaagcct gtcttgtgaa cggtttttga aaaatataca taggcctaat 5280 

cctqtcactg ttqttgaaat gtacgtttta cccccaaaga cacatgatat tgtcaagttc 5340 

agatttattc tgccagaagg cagactatct gctttttcat ttgagtcccc ttccttctca 5400 

ttttcaaaga aattaaaatg aatcccgctg gatgcaaaat gacaccccga ttttgtggaa 5460 

gggattttct ttgtgtgttt atatgtgtaa ttttttcttt tatgagcaat tagcacttta 5520 

qgatttctca gacgtatcta tatggtttgg aatgtgaatt gtatggctct gccttactat 5580 

tgggccctca aaagattatt tttatgattt ttggtcactt ttttctttgt caaaactgcc 5640 

attcagcctc atccaccgcc cccctcaacc tgaaacacag cttgtccctc ctgtgttccc 5700 

rttgccactc cactcttagc cccatggggg cgctgcagag caaagggaaa tctgctctct 5760 

aaggcagtag tagaaccagc cagagtgggg gctgggaagg aattxaccct tctcatacct 5820 
cagccctctt taattctctg cagtgagccc ctagagggag aacccaccag ggctggttca ^ 
tatataatta acctcatcac cagggacctc aaatcttccc agcatattat ttctgggacc 
aagtgccaga gtgccacatt ttcctactgc tttttgccaa atcaatgtgt ttacactgaa 
agaaattgac cagaacagaa gaccagttag aaagaaagag aaaatcarta ttgaaaattt 

tctaaagata ggcacttact tttcggtgag tctcttttac tttgaactga atgaatggtg 6120 

aggaaaattg tgcacccgcc tctcaatctc ttttcaattc ctaggaaagt aactttgtca 6180 

aatattatac cagttgaatt tcttccagaa tgtcatttct tctttcattc ttgacctctc 6240 

ctcagtatcc agttggcatc actatgtggg cctccctcac tcccgcagtg ccgtgttaca 6300 

ggattaaaga aagggctgac ttgaacgtat tctgattctt ttaaatxatc cttaagaagc 6360 

tctaaagagt gattccatgt ctggggtcgt atgaataatt cctgcagact tcggggcact 6420 

acatttcaac tagtcccatg atttaatggt tactgg &4 5b 

<210> 37 

<211> 4001 

<212> DNA 

<213> Homo Sapiens 

<400> 37 



ctttcttgac catgaaaatt tagatctcta tcattcttca gaaatgtgat gagtgtgaag 

accactttga atatxctcaa gtxatcagtt tagctttttt cttatggatc acagcactga 

ctttgtcaac tgataaaaac aactagaaaa tgctttaacg atatggcaca aaccatgaac 

atcagaattc attagtcctt cttctataac ctccctttat taacatgtaa ttgtaaaaga 



5880 
5940 
6000 
6060 



tgtatttctt ccttrttcta gtxcctacca atctctgaaa aaaaattcct rtaattagaa 60 
attctgtgag actgttcttt ccttagtttt gactttaatg tccaagaaag atggcatcac 120 

240 
300 
360 



900 
960 



agaataagaa gggaatgggg gtiaatggatc taggctcaaa acacatacct cacctttgcc 420 

tctctccatg gtttttcttg gcatgtatgc tgtacattct caatacttct aagacactaa 480 

aaactcatta atttaagcat atttaattca gattcagttc tagttcagat ataatgcttg 540 

actctcaaga gttcaaaaca caactaaaaa tgcaaatatt tgaaattagt gctatactga 600 

gagctcccca aaacttaaat attaagacac atatttttct tactaagacc tgactcccaa 660 

tatatatttc ttaaggtaat taaagggaat acttattttt gtttgaggaa aaaatatttt 720 

gtaagctaaa acttttatzta acacagtacc ctattacatg acttcttcct cttattccaa 780 

aataatgggg cttaattttt tgaagcgcag tctttccagt aataaaggcg gacaaatttg 840 
cttccttttg caagtagaca ctttttaaag gtaagaaagg gaagattttg tcagcccagt 
ttattacagt taatgtctct ttcaccaagt gtccaactgc cccgaaagcc agatgtttca 

gaaagccaga tgtttcacag ctgttctggt ggcaattact tcaagacagc ggtatcaaag 1020 

gctctgattt ttagcttcct catctaaaaa atggggacat caattacttt ataacaaagc 1080 

tctgatgata caatgagatt: acaaataaaa tatcctgtga tgtttgactc actacttaac 1140 

taaaatcttt ataatggatc catattcctg gcctagctac catacaccct gtcaagtggt 1200 

ttccttctaa gaaaaactgg ggatgggggg gttagggggg tgggagagca gcagggagta 1260 

qqaggtaggg aaggaggaaa acaagcaaaa tcagtacata tacaagctxt cccatctttg 1320 

attgcctgcg ttttttgtgc tttcctttct caagtttcct caggctcatg ttctgaatct 1380 

cctcccccac aaatgaaaca caacggatac gaattacaga gaattttacg gaagacactg 1440 

gagcttaagt ctgcagatta gttttttgtc tacaagcgga ggcgactgga aaatcaagcc 1500 

aaataagcgt tggataactx taaacagcca aagaagcttc aatgggatag ggtccaggtt 1560 

caccaaagga ggcaacacca gcaactgacg aaaccatagc gacggctgac cgaccatgac 1620 

tcgaggtaca agacgtcaaa cctaaggcta ctaatcaacg tgcgtaagaa gcggccccaa 1680 

ccgacccggg cgatgccaac cgcgcaacca ctggtcgccg cgccacccca ggcccaggaa 1740 

tagcccagtc cgcaacaggg agacctcaac cagggagagg aggagcgggg caaaggggct isoo 

qggtgtcccc caagccgctg cgcgggacgt ggaagggacg aaagaggtga ggaagagtag i860 

taaatcaatt caatacactc acacttctgc tgtgctccca ggactacgag aagcgggtag 1920 

ggggcgactc ccggcctcrt tcacttcgta aaggccttca actgggtctc tcggtcaacc 1980 

cctcagctac egccatcttg aaacctcgcg cccctccgca ctcctcacgt canttcccat 2040 

tfcctctgtcc cgccttctct tagcgccgct tgaccctgga accatagaga atacccaaga 2100 
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attccaggac 
agagcccgcc 
tgagtggctg 
tggtacgttg 
ctgtcttttg 
cgatccttta 
ttaatccgct 
aatagacagg 
tagagtcgca 
agaaatacgg 
tattggctat 
acgtacggga 
ttccgtgagt 
tcttagagga 
attcttacca 
cagaatcttt 
tgttaggaaa 
atattgtagg 
gcgaaggtga 
gtgatcatag 
gtatgttgga 
taagatatgc 
ctataacaaa 
agagtcctca 
attcagcgga 
agtgaaatgg 
ttgaaaatga 
aatgacaaat 
tgtgttttgc 
ttaattatca 
agttttgatt 
tacagatcag 



cagagtctct 
tcattgcgct 
acggtcctgt 
agccgtccga 
gaattacctt 

ggggtgtctc 

ttgtaagcat 

cagaacagaa 

gattgcttat 

ggatacatat 

attcccgact 

ttcttcactt 

cagcccccac 

tacgcttggt 

ccagttataa 

aaactgccct 

ttcttgattt 

ttaaagctga 

gctaagttgt 

ttttgcaact 

aatattaaag 

tgagcagttt 

cacagtctgg 

tgggaattat 

accattaata 

cccacctaat 

aattatattc 

gctttgagta 

tttcttgtct 

cagttggtca 

tttaccattt 

cttcctgtcc 



tccggttcca 
catctagtac 
ctctcgagtt 
gaacggtaca 
agtttttgca 
cgtgtgtaaa 
caggttatcc 
cgtttgtaga 
atgggcggtg 
gcagtcggtt 
ggtcttttca 
aactctagcc 
ttatttccct 
tatttctaat 
atatttcaaa 
tgaaggtata 
gattttagac 
taaaccaggt 
aatgttttgc 
cgttttgaaa 
tctttcatgt 
ttatttcaat 
catcaacagc 
tggttggcaa 
ggcttccata 
gcgtcatgta 
ttttttctga 
gtgaccttct 
tctaaatgtt 
gggttgattg 
tcaaatccat 
taattgaagc 



seq_prot. 
tacaatactc 
aataggaatt 
ggaggcatct 
ttctgcagat 
tttttctgtt 
cgaataccca 
catgtgagcg 
ccacctgaac 
atttattatt 
ttaagttata 
ccagccaaaa 
ttcgcccttt 
taaaggtaag 
cttaacctga 
atattgatgg 
agatcggtcc 
tgaatttgag 
agtacataaa 
aactagtttt 
aacaatgcaa 
agctaaaatt 
acatttcttg 
atattaataa 
aggatttaga 
gcctttggcg 
gcaaaggcag 
aagcttataa 
attctgcatc 
tcagtgtttg 
cagtgactga 
ccggtttaca 
acattgttac 



txt 

aattggttgg 

agaagaagat 

tgacttagtt 

aaagaagcgt 

tcttatgacc 

gataacattt 

cctagtacgt 

aaataaatta 

tgttagtagt 

tgagtggcaa 

gtcctctatg 

ccctaccttc 

tttaaacatt 

ctatcccatg 

agatacatac 

tatacctaaa 

tgtcagaaag 

aaagaaaata 

ttttctttcc 

gaagagaaag 

ttatatgtag 

tacttggatg 

ttttgtcagt 

aaacaaacac 

ttttctaaaa 

ctgcctcctt 

taaattatga 

ctaaatgcaa 

aacattcact 

aacaggacct 

ttgtgtaatc 

a 



ttgtcggcta 

agctgctaac 

gctgggagcc 

tctgcgtttt 

caagcaggtg 

gccactgaqt 

gtggaaattt 

gcacccgtat 

gttttgaaga 

ggttgatata 

atgtcactct 

tatcccctct 

taaaaaacta 

taggaaggct 

taatrttttag 

cttacctttg 

caatgacgag 

atcttgaact 

taacacatta 

agaaaacagt 

ctactatttt 

ggagaatcaa 

atagactaaa 

atctgcagag 

tcaggtaatt 

tttgctagtg 

ctttctttta 

tatgtgtagc 

tgagcgatat 

tgatgttttt 

agtagacgtt 



<210> 38 
<211> 4341 
<212> DNA 

<213> Homo Sapiens 



<400> 38 

tcatggaagt 
gtggggtgcc 
gaatgggagc 
ccctttcacc 
aacagaactt 
caaagaaagc 
ttcagacaaa 
ttatattgac 
gtgataagct 
ttaaaatttt 
^gtgcgtcca 
tcacctgggt 
ggccgtttta 

gg^gggcagg 

cttcttaagg 
atcacaatgg 
tactttaggc 
gatgcgatgg 
tattactatt 
aggccagatg 
ggaagatggt 

aggggctctg 
ccagtgagcc 

cgacgggggg 
cattagggag 
ttcagcaggc 
gtgctgcagg 
acagggtctg 



ctttagcggg 
aaccgctccc 
tcccaaaggg 
tttgcagtaa 
ttaattcggg 
agctagctta 
gacagtaaat 
tagtcatcct 
tttgagataa 
agcctatgtg 
tgtgaagaga 
gcaggcgggc 
taggatttgg 
ggtgaatctc 

gtggggaaga 

tggaatgtca 
catctggatg 
cctggcctga 
ttatttattt 
agccagactc 
atgggtgtgc 
cagccattgg 
ctgggaggga 
aaggacatgc 
cctccggagc 
ctgggttcgg 
tgcggggcca 
tgagtttcat 



ggcttgtaag 

aaagggaaat 

aggaccaccc 

aactctagcc 

ccaacagcag 

tatgtcgttt 

tacttaaccc 

ccaagctgga 

gcttgcatct 

ttacttctaa 

ccaccaaaca 

tgagtccgaa 

gaaggtaatg 

acaaagtaca 

ttacaaagta 

tcagttaagg 

tatacctgca 

caactattac 

tattttattt 

cagggaggcc 

ctggttaggg 

cacacaatgc 

actgaagacc 

ctaggttcaa 

accttgatcc 

gcccgagaac 

ggagctaggt 

ttcttcgcgg 



ggactgtgag ctgagctaag 
gccatctccc actcacagcc 
atgggtctgc ttgactcagc 
aaggaagaca aagagacctt 
gctcatgccc aaaatggctg 
gagatgggaa aaacaaggta 
gtgacaattc tgaggaaact 
ctagggttgg aggctggggt 
gcaacttgtt acaatgctgg 
atagcttata ctaaatgtta 
ggctttgtgt gagcaacatg 
aagagagtca gcgaagggag 
gaaaattaca gtcaaagggg 
ttctcaaggg tggggagaat 
cattgatcag ttagggtggg 
ctgtttttac ttcttttgtg 
agtcacaggg gatgcgatgg 
ctatgttatg tttattattt 
tccttccaca cacccgtttc 
tagaagtggg caaggggaaa 
gtgggagtgc tggacggagt 
ctgggagtcc ctgctggtgc 
cccaattacc aatgcatctg 
ggatacgtgc aggcttggat 
tcagacgggc ctgatgaaac 
ctgcgtctcc cgcgagttcc 
ttcgtttctg cgcccggagc 
egcggggcgg ggctgggcgc 

serine 50 " 



gagaatggag 

agttagccga 

cctccccaac 

tggagaccaa 

ccaaccccaa 

ggatacaggt 

ggcaattcag 

cccgaggcag 

gaggggctgc 

actgttttca 

gctgtgtatt 

ataggggtgg 

gttgttctct 

tacaaataac 

gcaggaacaa 

gatcttcagt 

cctggcctgg 

taagctttat 

caccctggag 

cgggaaagga 

tcgggacaag 

tgggatcatc 

ttttcaaaac 

gactccgggc 

gagcatctga 

cgcgaggcaa 

cgccctcagc 

ggggtgaaag 



2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4001 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



seq_prot . txt 

aggcgaagcg agagcggagg ccgcactcca gcactgcgca gggaccggtg agtgtcgctt 1740 

ctgggggcag cgcccagtaa ccgcgctagg agcgcggaga agggcattgg gagagcggcg 1800 

ttcgtggcgg agactagcgc tccggagcac gggcacgacg ggggcacctt ctcggctgct 1860 

agtaactaac aataataata atcataatca tagcaagggc gctgatgggc gggctcggag 1920 

cacgcctgat tctggttccc accaggctgc ccaggctcct gatgacgcat cagaaacatc 1980 

cccctaaccc gcggccttcc tgcaggagag gttgggaagg ggtgggggac ggggctcggg 2040 

ggaggtctcc gagggactct agtaagcggg gaagggcgcc gggaaagttt cagatccacg 2100 

gctgcgcggg ccacgagccc acccgaacgc cgaccactgc tttccgtcga cttctatttc 2160 

ctgggaacgc gcgaaagcaa acccaagtca gactgcggag gtcgctgggg agggaaggtt 2220 

caaggagttc tcgccgatcc tgctgaataa agggggttcc gagctgggcc gagatggggc 2280 

atgcgcggga agacccctgc ccgctgttcc cccccaccgc cccagtggat gccatgcctg 2340 

gggcctcccc ggcgcgtggg gctgacgcac cctcggggtx catcgtagtt ggccgggatc 2400 

gtggagtggg tgcggtggac gaagggaggc aggacagtcc cgggggtggc agaaggagcc 2460 

cgggcacagc tgagacctgc gctcccatcc caccaacact cacagcaggt gctgccgagc 2520 

tgggcaattg ggatggccca agttatttgg ttaaatttta aatcacgttt gttactggga 2580 

agtagagtcc agtgatgcta accgcgcctc tacctccacc accggtgtca gtcccaaagg 2640 

gctcctaaaa tggctgtgtc atctttcagc cttggaccgc agttgccggc caggaatccc 2700 

agtgtcacgg tggacacgcc tccctcgcgc ccttgccgcc cacctgctca cccagctcag 2760 

gggctttggt aggtagcagt gcatttggtc taaagggcaa gatgttctct cttttattca 2820 

taacaaattt aaataccagc agggtttggg gggaaaaacg ctttcagaag aaaaggtgaa 2 § 

tgtcagtcct gcaagagtta gttttaaaac tagactgaat tggcacatgt atacctatgt 2940 

aacaaacctg cacgttctgc acatgtaccc cagaacttaa aagcttataa aaaaagaaaa 3000 

aactagactg gattatgttg ggaaagtgta gcctcttcca tcttaggcat ttcctagaac 3060 

gtaggcagta ggtggtcctt attaggagtt ttgggagagg aagggggctg aatcctacct 3120 

cccatccctg ctcctctatg gggtctgagc tgaggaagct tcaccacaag gagagaaccc 3180 

cctgacaacc ctggatgcca cctttaccct cactgcagga attctgtggc cacactgcga 3240 

ggagatcggt tctgggtcgg aggctacagg aagactccca ctccctgaaa tctggagtga 3300 

agaacgccgc catccagcca ccattccaag gnaaggcaga aatgaagtgg gccgttgggt 3360 

tctttctttt ctttctttct ttztttttgag acaaggtctc actctgtcgc ccaggctgga 3420 

gtgcagtggc gccatctcag ctcactgcaa cccccgcctc ccaggttcaa gcgattctgg 3480 

tgcctcagtc tcctgagtag ctgggattac aggcacacat caccacacct ggctaatttg 3540 

tatgtgttta gtagagacag catttcacca tattggccag gctggtttca aactcctggc 3600 

ctcaaatgat ccacccgcct tggcttccca aagtactggg attacaggca cgagccactg 3660 

cacccagtca ggttcatrtt agttgttatg ttaaccaggt ttcctgcacc tgtgcgctaa 3720 

ctttcacttt cccaaaaggt ttcagggtga cccagcaggc aatgagtgat tctcaaattc 3780 

aggatttatt gugagagatt cacacacaca attgagcaga cattcacagt acaatgatta 3840 

aagggagtga tagggtaagg acccacagtg gaggctctgg aggccagccc actgacagcc 3900 

actccaggga gtccagaagt cccgctctag tgctgggtgg tggagggaaa tctgttcctc 3960 

cagggacctc gtcctcggct gcccagctgc caaagtcagg aataagcttt cagaaatctc 4020 

actgccaaga ttccgaaaac gcttcagaca ttgctagtcc cttgtcgctt ttgcgatcct 4080 

ccacaggtgt gcgtgccact gggtccttat tcactggggt ctctggtggc attgggccac 4140 

agcaagtgtt ccctcatccc cttagtctac cacacacatg cttaccactt tgaagaaaaa 4200 

cccctttact atgagcgaaa gtgagaaaca cgtatgttta ttgtttctaa agaaagaaac 4260 

ttaatatggg cttaatgcta cctagtgagt gcctccattt tgagacatta gggtcacaag 4320 

tcattattat atatcatggg c 4341 

<210> 39 
<211> 4433 
<212> DNA 

<213> Homo sapiens 



<400> 39 



60 



ggcagaaccg gtgtccacag ccctcggggc tcaacctcat caggcaggag gagctgagct 

ggccctgcaa ccgaagrcta aggtcagtga tggcagcgag gggcagagcc cagggcaggg 120 

gaaaccagag cccccaggac gcggccagca gagccctgtc cctgctgcag cgccgggcgg 180 

gggcctggcc gccatggccc ggcttccact caagacggct gtcgaggagg cccgcagaga 240 

ggcattagga cagcaacggg gcagtgccac ccccgcggcc ccccgagcgg aaggaaagga 300 

gcctcccagg ccaggcactg ccctcctggg caggagcgaa gcagggggga tgtccgcccc 360 

cctcctgatc cacttcactc ctcggagccc tggcagcgaa gcggaggcgg agacaggtgg 420 

tgtcagggcg tcetctcgcc aggccgcagg ccccgcgggg caacaggaca ctggcccctg 480 

gcaggcgggc gcggggccct cgggctcgat ggggagaggc cggggccggc ggcggcggat 540 

ggacgctggc tcgggagaca gagcccgccg cccccggaaa ctggacccgc tccgcctggg 600 

cgccgcgggg agcgtgnggg acgcggtgga cggggccgcc gccctggacg cccacgcgcg 660 

cggcctcccc acaggacccc eactcgccca ggagcccgca ctcccggcgc tgcccgctcc 720 

ccgcgccctg cagcctgggt ctcagacgga aggctctggg gccaagggtg gctggagcag 780 

ggaggcctcg ggggtccctg cccccggagg aggctggccc tgggtcagca gggaggtccc 840 

gggcacccgg agctttggcc cagccccaga ctccacgcgc ccctggctag agagtccgec 900 
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tcaaggtcgc ccactctcgt cccaagggcc gggtgccaca ggggcctacg atgccggcga 960 

ggccggggct gacagctccc gagataacag tcctgccgct gacctggggc ccacccqgcc 1020 

cccggagcaa gcaaagccgg ctgcagccgg ccacagccgc gcgccctccc ggagccgtga 1080 

gcctcgcccg cgctccgcct ccccgcccgc agctcccggc ccggggttcc cacctgaagc 1140 

cctgactctc ccctctcctt cagacttttt gcccctggag gttacccagg atccttccgt 1200 

gggcgaaaat ctcagagcgg cgccagcccc aagttcagcc tcagcccaag tcttaacttc 1260 

agctccagcc tcagtcctag ccccagccct ggcttcatcc cccagctcag caccaacctc 1320 

agccaccacc tcaacctcat cccccacctc agccccagcc ccagctccaa cctcagctcc 1380 

aacttcaacc ccagccccag ccccaagtcc agctgcagcc gcaactccag ccccagcccc 1440 

agtcccagtc ccaaccctca cacccccatc cccagcccta accccagtcc caaccccagc 1500 

cctaagccca gctccaactc cagccctaac cccagccgca tccccagccc taaccccagt 1560 

cccaacccca gccctaagcc cagctccaac tccagcccca accccagccg catcccctgc 1620 

cccagccccc acctcagccc caaccccaac cccagccgca tcccctgccc cagctgacgg 1680 

gtcaaagcct caggagagtg tggctctccc caggcgctac caggaggggc aggtctcagc 1740 

cagctgggga aaccttattg ccatggttct tagaagccac cccttcccca ggcaagacag 1800 

gccccaaggg agtgtcccga gggcggttcc cgggagcccc gtgggtccca gcacttccac 1860 

acactctgag gacagacacg gcccctcttc ttcagtgggg acagtcatag ggacaggtac 1920 

agggggcctg gttgaggctg gaggtcagcc acagccaaga agctccgaga ccaacggatc 1980 

gcccagccca gaccctcccc caggcctaag aggagaggga accagggaga aaagtctaga 2040 

cccgctgccc caagccgcga tgcccagggg ccccgcacag ccccccgcgc agaggccgcc 2100 

tggccccgcg gcctcctcct ctgcgaggcg ctcacagccg gtaccccagc tacggaaacg 2160 

cagcaggtgc gaaatcgccc cgagctcgga gcaggaggtc aggccggccg cctcggggga 2220 

ccctcaaggg gaggcgccgg gggagggggg cagccctgcc ggccgcagcg gggcgctcac 2280 

ggaaaagcag gaggaggccc ggaagctcat ggtgtttctg cagaggcccg ggggttgggg 2340 

ggtggtggag gggccccgga agcccagctc ccgggccctg gagcccgcca cggcggcagc 2400 

cctgcggcgg cggctggacc tgggcagttg cctggacgtg ctggcctttg cccagcagca 2460 

cggagagccc ggcctggcgc aggagaccta cgcgctgatg agcgacaacc tgctgcgagt 2520 

gctgggagac ccgtgcctct accgccggct gagcgcggcc gaccgcgagc gcatcctcaq 2580 

cctgcggacc ggccggggcc gggcggtgct gggcgtcctc gtactgccca gcctctacca 2640 

ggggggccgc tcagggctcc ccaggggccc tcgtggcgag gagcctcctg cggcggcccc 2700 

tgtgtxcctg cctctacctg cgcacctgca tgtgttcaac ccccgggaga acacctggcg 2760 

gcccctgacc caggtgcccg aggaggcccc gcttcggggc tgcggtctct gcaccatgca 2820 

S^SSi 9 5 tt S t 99 c 99 ggggcatccg tggctccggt gccaaggccg tctgctccaa 2880 

cgaggtcttc tgctacaacc ctctgaccaa catctggagc caggttcggc ccatgcagca 2940 

ggcccgagcc cagctcaagc tggtggccct ggacgggctg ctctatgcca tcggtggcga 3000 

atgcctgtac agcatggagt gctacgaccc gcgaacagac gcctggaccc cacgcgcgcc 3060 

5«H CCCgca 99caccttcc ctgtggccca cgaggctgtg gcctgccgtg gggacatcta 3120 

cgtcaccggg ggtcacctct tctaccgcct gctcaggtac agccccgtga aggatgcttg 3180 

ggacgagtgc ccatacagtg ccagccaccg gcgttccagc gacatcgtgg cactgggggg 3240 

SS£ 9tac J9 c 3 tc 9 acc tgctgcgggg cgtgggcgcc gccgtgatgc gctacaacac 3300 

agtgaccggc tcctggagca gggctgcctc cctgcccctg cccgcccccg ccccactgca 3360 

ctgcaccacc ctgggcaaca ccatttactg cctcaacccc caggtcactg ccaccttcac 3420 

ggtctctggg gggactgccc agttccaggc caaggagctg cagcccttcc ccttggggag 3480 

caccggggtc ctcagtccat tcatcctgac tctgccccct gaggaccggc tgcaglcctc 3540 

actctgagtg gcaggcagag aaccaaagct gcttcgctgc tctccaggga gaccctcctg 3600 

ggatgggcct gagaggccgg ggctcaggga aggggctggg atcggaactt cctgctcttg 3660 

!"" gga " a ^" cccct tctgctttaa aggttgtcga ttattttgaa gcccagactc 3720 

cctcagcctc tttctgcccc tcactccaca cccagactgt ttcctgactc aattccgtac 3780 

ctacttacag accctctcag cttgctgaca cccccotgtc tgtgggactc cctattccct 3840 

agagccaggg actgatgcgt ctccacagac aaggacttgg ctcgctggag ctctgctgag 3900 

ccgagagagg agggggtaga aaacattcac acttcctatg ctctgtcagc aggacaggga 3960 

gcaaaaacgt ccccaggcaa cgccctcgcc tctgggactt tctgcctgtc ctaaggcctc 4020 

cccaggtacc aaccccgtag ctatctgggt ctgtttggca ctgtggattc tcaagggcct 4080 

agaacccttg cctctgaaac tggtccgctg gtgcagccct gctgtctgca gctcctgccc 4140 

atacccccag cccacaccag gccaggccca ctccgggctc accaccctct gcagccttqt 4200 

ggggctctcc cagcccctcc agaagcccac cccacttctc gccaaccccc gatctctaaa 4260 

tgaggcctga gcgtcaccct agttctgccc ctttttagct gtgtagactt ggacqaqaca 4320 

££2222£ c « ttctc « t gtctataaaa tgtggacagt gglcgtctgt cicccaigag 4380 

agttgtggga gacaagatca cagctatgag cacctcgcac ggtgtccagg atg 4433 

<210> 40 
<211>. 4494 
<212> DNA 
<213> Homo Sapiens 
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actataaatt agcactacca caaaggtaag atgattgtga ctaggttaaa ccctcaacag 
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agggaaataa aacttaagag gaagctagat ttaaaaataa ataaataaat aaatattagg 120 

gtagtgaaac atcatataga rtagggtttg rtaaactcac acctctatca acataccagc 180 

caaaatcatc aacccagata ataaccatgt ttcccccgat agccaaggcc ccattzatcat 240 

tgcccttatt catttctttt ctacagccac ctgcagcaga cctatrttct aagctctata 300 

aaagacagtc aactcgtaat tacatctttt cagtggacac agagagaacc agataaacca 360 

ccctcaaacc tcagtagact gaattaccta cccttctacc aacacttcgc aaaatggcat 420 

aatgtgaatc tcacctccac cctttaaagg agttaacaat agctactgaa aaggtcccaa 480 

aagatggcag aaagagcata ttaatgcaat taaacagacc aaaaactaaa ttcgaatctg 540 

cctacatgct tcctcttttt atacagaaag aaatattgct caacaaagtt acatatttaa 600 

gttttgacaa aagcttcaaa aatacggtta tcatatgctt ttaattcctt aaatgtatgg 660 

tttttaatct gtagtaagac cattcgggaa aactggaata attctacaaa taaatgtaag 720 

taagtcatga gagaaatcag aattctctta gttaagtgca tttttttaaa acatggaaaa 780 

cgaaattctt taactcttaa aacaacctgt cttaactagc cctaaggtga ttcctgagcc 840 

acttgtgggt aaaaatcaga agagaaaaaa gttcaatgac cttgcaaatt agaaactaaa 900 

aaactaatgg tgtgatagtc tgtacagcct cagtggaatt acaggaaaac atctgcccgg 960 

aaattgcttt ttttcttttg gaaaattact aggaggctgg gggaaaagag gggaaaaacg 1020 

ggtaaggaag grtgcaaatg ttaagatgcc aaacaggaac ccaaccgaag tccgaagtga 1080 

aatacagcct cctctggccc aactgtcatg caaacaactt ctaagcatct tcagatgtaa 1140 

agaaaaacag tcattgtgca caggcgttag ctgaatgttc gagagttgga tgaatgggga 1200 

agacttgtta tacacctagt ggtgggtgga tttcactagg ctgcagcctc cacacagctt 1260 

ttctttgctg cactgcagag gcgccatctt ctccgtctta atcttcactc aattcgatcc 1320 

ttttatctgc accctaaaaa aaagaccctt cggaccctcc ccagcatcct gtgcccccag 1380 

gggaactaaa ccagagtccc cagagctgcc agggaagagg ccgcgcctta atcagacaaa 1440 

cgcctccttg gatgcacagg acagagcccg tgcctcccca ccatccgcca ctgccctttc 1500 

cagacgcaca tcccactctc cgccttattt cctccccgcc aaggacttta cagcccagca 1560 

cagaattaag cctcggaggt gatccgaggg tgggaagcag aggggccccg aggctccgcc 1620 

agaagccccg gcagcgtctc tgccccctcc caggcggacg cgcagccccg ggaagggagc 1680 

cgggcgggag cctcggtggc gcactgagac taacgcggct cggcccacgg cgagggcggg 1740 

gtgtcgcggc ttttgttccc gcggaccttc tgcctctcag acagagctta gggtgcgagc 1800 

gaactcccgg gacgtccaga crtggggaag gcgcggggtg gctatgggtc tccgacccct 1860 

cccatttctc gcaagttaca caaaagggag acgcggacat gcacaaagtt tgcctgtgga 1920 

aggcggtgcg cgaccccgac gccggctcgg gctagggtcg cacccaggct gccgccgcca 1980 

gcagccgcga aaagaggttg gagcaaggaa gggggatggg ggtgagagag gaaggtgaaa 2040 

cgaggcggag aacgcaggga aaagcgaggg gctcctagct: tcctgggctg tcccgactcc 2100 

tcctcctcct ccccagggcc caagagaaag gaaagggcaa cgattcaaga gcgaaggact 2160 

ggctttaggg acgctgcgtt rtcggttttc cccacgtgct ccttccccct tcccccctgt 2220 

ctgaatgtct ctcgcctccc ccgacccccg cttccaaggc agggactaag tcggggtctg 2280 

ggctcagggt cgccccctac ccttctcgcc ccgggctggc gcggaaccag ggagaccagc 2340 

gcctcgcccg ctccctctca gcgggccgga gcgcgactcc ctggggcagg gctgggccga 2400 

aagcggggat gcgctggacg tccgcaagcg ggggcggaga ggagaggggt cccattgacg 2460 

gacccccggc ttctctgcat taaagacttg ggttaaggtc ttggggggta tcgcgtxccc 2520 

accgtgcagc ctccccccgc cttccgaggc gcggcaccca cctccagcgc ccgagccgtc 2580 

caggcggcca gcaggagcag tgccaaaccg ggcagcatcg cgaccctgcg cggggcaccg 2640 

agtgcgctgc tgtigcgagtg ggatccgccg cgtccttgct ctgcccgcgc cgccaccgcc 2700 

gccgtctccc ggggcccccg cgcacgctcc tccgcgtgct ctcgcctacc gctgccgagg 2760 

aaactgacgg agcccgagcg cggcggcggg gctcagagcc aggcgagtca gctgatccgg 2820 

cccaccccgc tcggcacccg agagagaccc ctagcggcgc cgccggggaa ctgcgcccgc 2880 

tcgcgccggg aggggccctc gcgccccgcg cccacaggtg cacgcgccct tggcgccgcc 2940 

tgcaccccac gcgccccctc cgctccccgg ccgacggccc acctgggctt cgtgaacagt 3000 

gggagggaga gtctggggcc aggagaggga cggtgcagga tcagggaaag gtgagtccta 3060 

ggacgctgag gctctagaaa agtcgagagc gctcctgctc gtccccgtga gcttgaatca 3120 

tccgaccccg caggcctccc gggggtgtcg tataaaggac tgctgctagg gtgcgttttt 3180 

atccgcattt cgttttttct ctcggtttgg agaggtgggg caggcgtttc tggaagagaa 3240 

tgagaacgag tgaagcttaa aggaaatagg attttctttg cttgttgaga cagcaaaacc 3300 

tcatttttta acccccaacg tcaaaagcag gcacgagcaa cctggaattt ctatctttag 3360 

aacgaaccaa aggagcaagg cgcaggacct ggcaaagaag cggccaaccc atttacttct 3420 

tcttgaaagc agggttctgg gcttgggttt tgtttctttc ttatattttt gtttgttttt 3480 

ggttttgttt tgttaaaggg ggtggtcagt gtggattttg cggggcgggg ggtctttttt 3540 

tgttgttaag cagggatgta acatccatct gagttgggga agaaggaaaa atrtaggagag 3600 

atcattcgta ttcgaccccc gtaaatgagg acttctgacc tcaaacgctg ccctrgttct 3660 

tcattgtgtc tgtcctgaat tatagaaatg aaccttctgc catgccactt tctcctggat 3720 

taaacacaaa ccgtccactg tccagttagt gtccagarag tttagaatgc tctagaactg 3780 

cccagatata tcccctgctc ttgacctgaa gtagcattta gttaccaagc cacagcccac 3840 

tecacacagg gcttggagcg aaggactgaa gccagggagt gctctggccc ttctgagggc 3900 

tgcactgcag cctgccttct ctcccttgct cattgcgctg acaagggtgc ctaggcccgg 3960 

gaiaggatggc tcagecagcg gggtacagac ttccttcctg ggaagtcatt tctgtaccag 4020 

.ctcpcqtaaa /tgttgtcttg tgtgtctttt cccgcactag gcattcagtc ttgactcttg 4080 

gattalctttt gcccctgaat gtccrttgga tgtcctcaac aaataaggac aa^tatttat 4140 

Seite 53 



seq_prot.txt 

tgtcatgcag tatttgttta tacgcttcat gtcatttact tcctacaaaa gccaaagagg 4200 

ctgttactat tattcctatt tatacatcag gaaaatgaag ctaatgaact cccaaatcta 4260 

atagcaagta agtgacatga agctgggaca gcagggaaaa gcctaaagtg acaggacaaa 4320 

cccagttgtt cactttctgc accacattta cccaccgcaa ttccgtattt gttccgggtc 4380 

tgcatgagca aataaggcaa aaaagaaggt atctgaatgc aaagagaaac gtggtccaaa 4440 

gctacaaatc ttgcagagtc cactgcaaaa tgcaaacgtg agaccctttg ttac 4494 

<210> 41 

<211> 4489 

<212> DNA 

<213> Homo Sapiens 

<400> 41 

caaaaagatt taaaaagaca atgcacaaaa gatttaagaa ttaaaaagac aattctagaa 60 

agaggaaatt caaatagccg gtaagttttt gaaaaagtgc tcaaactctt tagtgtcctg 120 

ggcagaataa attaaaacaa caataagata aagccacatt catcaagttg ggaaaaagga 180 

aaaataactc ttggtgagac agtaaaagta aagggatcta ccatatactg ctgtcggggt 240 

atacattggt gtatctactt tggaaaaggc ataactagtg aagttgaaga taaacatacg 300 

tactctacaa aaacccatgg taatgtacat taagaataca tgtacaagaa cattcataat 360 

agcattgttc ataactacta aaaacggaac acaatccaaa tgtttattaa tattagacta 420 

aactgtggct catttccatt aagacttttt atactatact tattatacta tagattacta 480 

caatccaata taagaactag agctaaatgt actzaatagag ttatatccca gctgcatgag 540 

agaaaaagat acataatgat atgtaatata taatccactt acatacaatt aagaaacagg 600 

caaaacgaac ctaaattgct taggaatgct gacataggtg gttaagaaat actaagaaag 660 

aaagcagtaa atgattgcca taaaaancag tatagtgtta accactgaaa agaattgagg 720 

aaagggatga tcaggaatag aatggagagg ggagcttctc aggtgccaca gacgttctag 780 

ttcttgactg tggcagcgct tacatagagt tcacacttta tagtttttca ttatactatg 840 

tatcgatgtt tgtatgctta aatactaaat gtatggaaaa agacacaaaa natttaaaat 900 

tgtagctaag gcacattaac tgccttgctg gtattttttt ttttcattac aaagtggcag 960 

acaattggtt acactataaa gacagacaat tggttacact acaaagaaaa ttttaaagca 1020 

caggagtcta gttttgattc catcggacat gantaccgta cacaatacag aaganactaa 1080 

gggctaagca atgatgcata aacggggtcc actgcccaaa cttacagaaa gcaaaacgtg 1140 

gaaaattagt aactttaact gatragtgtac acctaaagaa cctacaccgg ccgggcgcgg 1200 

tggctcacgc ctgtaatccc agcactttgg gaggccgagg cgggaggatc atttgaggtc 1260 

aagagatcga gaccagcctg accaacatgg tgaaaccccg tccctactga aaacacaaaa 1320 

aaattagcca ggcgtggtgg cacatgcctg tagtttcagc tactcgggag ggtgaggcag 1380 

gagaactgct tgaacccagg agacagaggt tgcagtgagc cgagatcgcg ccactgcact 1440 

cgtctgggag acagagcgag actccgtctc aaaacaaaaa acaaacaaaa agaacccacc 1500 

caacagaatt aagtaccaac ataataaata cgaagaattt tagattettg gtttttaaaa 1560 

aacatacaaa gatgatattc cttcaaaata tctttacaaa acatattgag actgtgatgc 1620 

tttatattga ttgtatgaaa acaatgaaaa agaaccagca ctgtttcact ataaaagctt 1680 

tactaatgta aatttataaa tcctttctta aatattttga gttaattcta attttatgat 1740 

agaaatitcat tattttcagc aaaaacagct ggcatttggg aaaccaaagg ctcaaaaact 1800 

aagaatagta accaaagaaa cttgacaaaa cagtcctttt aaaactctca tctacactat 1860 

aaggggaaac tttgatcacg ticccttctcc ttcatcaatc gtagaactca acattraagga 1920 

ctacacaatc ccacatccct ctccgagaaa aagcaaaggc tttgtgttgt agcaacaacg 1980 

caagacatgg agggaagctc cactcaagac ttccctgcct gctccttccc caaagccact 2040 

ccagaatacc agggagggtt gagaggtaag gcatgaaggg cgcaatatcc aatatgagca 2100 

acgcgtgtga tgcatctggt caaaatgcat acagaggact tgtctctgtc cctagataga 2160 

agtcctccgt cctgcagtca tgagggtcaa ttgctgaggc ttcacagttc ccttctctct 2220 

tacactcgga ccgtcacgct cctcacctac taccccgatg cagaggtaga ctcaggatcc 2280 

ctgcacttgt caaggattcc tcggcaagct cacggggcgg gagtggccac aagacggagc 2340 

tcgcctggtc ctggccttcc cggcctatac aagcctgccc ccttcccaat tcccaatctc 2400 

cacagccttc catcctccca ctttcgattc accttgcgcc accgacgccc ctggccttcg 2460 

tttgcagcaa gtttaccccc accattacct ctcgcataaa agcctgcatt taccaggtca 2520 

aagaggggaa ccaacgcctg caggaatcgc ttcaccgaat cgcctggccg cgtcctctgc 2580 

tagactitcac ctgccgctgc ggaccgtaca caaccactcc cggcatgccc cgcgcacgca 2640 

ctacctctcc caccccgccc ctctcccgcc caaacacgtg acctcctttc gtctccgtcc 2700 

acgcccactt ccgttcctcc actrttccctt aggaaggagg agggagctgg gggtgttaaa 2760 

agcgtagcga cttcctcctc ctccccgccc ccgctcctgt acctcccgct acaatgtctt 2820 

ccgggtcgct agcgcctcga cgccttctgg gaaaatagct catttcctcc cctccccctc 2880 

ctcctgcctt caaccaacca gccacccgtc agagagggac atgcgcagtg agtgcctccc 2940 

gtctcttcta cccgaacccc ccctcccccc caagcagaga gaccccagca gcagcagcag 3000 

ctgatgatga agagagaggc agtggcagag ggggggcacc ttttatttct atttttaaag 3060 

ggacaggaca ctaattctac cccactrcaa ccttgaattc aggggggtgg ggggaaggcg 3120 

gctgagttcc ttcccccacc ctccagccct gagccctgag agggggattg agcctgagag 3180 

aggagaagga gtttcttctt cttcgaaaac ccccatccac gactcctacc ccctcacccc 3240 
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seq_prot.txt 

tccaactcgc ctccctccct ccaccctcct cctctttggc cgtgagagga ggagagaaag 3300 

aaaccaaaag cctcttagca acacagaccc tttgctgctg ctgttgctgc tgctgctgct 3360 

gttgctgctg ctgctgctac tgctgctgct gctactgctg ctgcttggcc ctggctggag 3420 

acatctcact acacccagga gcagccactt ccccagctct cctcctcctc cttcgcctcc 3480 

tcctcctcca ctccccccct ttattaccct ttgtgtcatc ctccacagct ccagggaagg 3540 

cactcaaaag tggggggcag gaaaggtaag tgtgtctgtg ggggcttcat tgccttcctc 3600 

tggctctgac tttcactccg gggcaacagt agcaccaaac cacatacgca gtggagctcc 3bb0 

ggggtgagga gggggtggtg ctgggggggg tgaaggaggg gttggagtag ggaggggtgt 3720 

gtgaggtggg gtgcccatcc tgtcagggga ggaagggcac ttttattttt atttgctgtt 3780 

gtcaaaagtg gtttctctcc tccatctaac atctgattgt gtcttgctcc agtgggggag 3840 

aatacaaaaa caaccccctc cttcctccaa tgaggcgcca gggaaagaga cagaaagagg 3900 

cacactttct agatgtcact taaaaaaaat tcattggaga gctctttttc tccaggaaaa 3960 

gctccactgc atttgtttct gagtgggaaa atgtcgggat ctggattgta ttggaactgc 4020 

ttatccattt gtagacctga gtgttttccc ctttttggtc ctgtatgaga tttggatatt 4080 

gacttcagtg ttggaagact tgattggttt ttgctttaag ggtttcattt tccatttttt 4140 

cgttttgtct gacttccatg gaaaatttca aatttttagt tggtagaggc ctttggttag 4200 

gttttgactt gcattattgc tttttttttg gttaaatgtc tgtttcatat atcagtatat 4260 

aaagtccaaa gctctaaaat gcttaaagtt taaatgccac tgctattgtc tgttcccaca 4320 

tacgttcctg atacttattt tccatatgat gagaatatta gctattttat aattaatctg 4380 

ggtatatttg taatctttgt aacttgtctg aacaagtaat ccccattgaa aactttatgt 4440 

tgttctggct gcatggaagg ccaaacctct ccatcaatag tttgctgga 4489 

<210> 42 
<211> 4395 
<212> DNA 

<213> Homo Sapiens 



<400> 42 



60 



ctccctctgc atgttttgtc caagttccac actgcactga cttctgtgtt tctgaggtgg 

tttgcttttc tgttaaaaaa aaaaaaaagt tttaaaaaga gccatcatcc aattttctat 120 

tgtactaaaa atagaggaca ttcttataat atccattagt aaattaattt ttaaatgttg 180 

tcttagccaa accaatcaat tatacatatt ataacaggtt cagatattct ggaatattaa 240 

cagtaggagg aaaataggaa aatacaaaat aattcgaggt cttccactag catttttcta 300 

agtaaacaca cctgttcctt tcttggttgc cccagacaaa tggaggatga ttaaacttgc 360 

tacaactgaa acagaaaaga aaaattagag ataaaataca ctgaaatgat caaatcccat 420 

aaaaacataa cataatttca agataaaata atatcaaagc agattaattt tttcattgtt 480 

aactgttaga agtcgattca gaacttgcaa aaaccaattt gaaatgtaca aatcaggaaa 540 

ccttgggact cccttaagga actaatgcaa gttaagaatg gatgagaccc aaactcctaa 600 

ggcaaataca tcattcactc taaaggcaac atgtcaattt actcatgtca cacacagtac 660 

agtgggacta caggagtcac actgtatcta aactaccctt agggaatggc tctgccagca 720 

tctctgaatg aagagagggg ccagaaggta aaaaggagga aaaggagagg caaacaatga 780 

gaacagctcc atgactgttc ctgacatact gcagtgtggc tgtattttag aaaataaata 840 

cttagacaaa tcctacagac tgtggaggag gagaataagc aagaaacaga ttaatgtgga 900 

ttgtgggcat taagcaccat tattggaatg ttgcctacaa aaatgttctg aacctccaag 960 

taaccccgtc cctcagcact ttctaagctg tgaaattaat ctcttctaac cttaacttga 1020 

tttaatacca aatcaatata tagaaattga ttattaattt gagaaacgtg atttccgaaa 1080 

gcattatatt gggtcaggtt tccagtgtca aagagactgt gatctgtaac ttaacagtaa 1140 

atctgcaatg ccattataat ctaaacttgt tttacttccg tattcttgat ctaatcaaca 1200 

tttttacaag acaaaacact agcaaaatag taagtaagct agtcttcata atttgatttt 1260 

tcttctattt tttcttattc attttgtcat ccaacgacat aaaaggtcta tgttttcaca 1320 
ttttcactta aaatgaattc gggggagaga acaattcaca aatatttcct gaattaaagc 

tcaccacagg cttatacatt tatacttaaa catcactcta agaaacgtat ctaggtcttg 1440 

atattaatat ttaaaatatt gtcaattgtg tgttacttta tgctctagcc agttcatttg 1500 

cattttaaat taaggcactg gctgctgttg ttactgtgtg tttccacagg aaaaaaaaaa 1560 

tcaatttaat ttgtcatgat tctgctcaca cccagaacac acaggtatgc acatccttca 1620 

gcatcaggat gtgtatttgt ttactccgct cttcctgttt tgatgaaatc acattataga 1680 

ccacgtactg tgtttaactc agcagcaaac ccctctcaac tgcgtggccc gcgatcaaca 1740 

ctctcatctt tgcttagtaa tgatacattt ccaccacaca ttttcttata ctgcctttta 1800 

ctcgatgact cgtgatacta ttttcctctc attccttccc acctctacac tgcccgcgtt 1860 

accacacacc cataaagcga catcagcggt ctccagggcg gaaaggggtg gaagctgacc 1920 

ctcgcccttc cctccagcgc tggcttcagg tgtgccttct gctacctcct gtactgcgaa 1980 

caggggcccg ccgagctccg ggagccccta gaagaggaag actcctctgg ccccactagg 2040 

tatcatccgc gctctcccgc tttccacctg cgccctcgct tgggccaatc tctgccgcac 2100 

gtgtccatcc ctgaactgca cgctatcctc cacccccggg gggttcctgc gcactgaaag 2160 

accgttctcc ggcaggtttt gggatccggc gacggctgac cgcgcgccgc ccccacgccc 2220 

gqttccacga tgctgeaata cagaaagttt acgtcggccc cgacccgcgc gggactgcag 2280 

ggtccgccgg agcgcggcgc agaggctttt cctgcgcgtt cggccccggg aaaggggcgg 2340 
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gagggctggc 
gcgcccatgg 
agagacggga 
aagcaagaag 
cccacgctgc 
gctggccctc 
ggcgccggcc 
cgccgcgggg 
acgccccgag 
ccgacggcag 
ccccagtctc 
ggcagcgctc 
aggggtccgt 
gcgggtgggt 

gggggggggc 

taccaaatgg 
ctgggtggga 
gggccgcggc 

gggggtgagg 

agccgcaaat 
ttcgtgtgaa 
gctctctgcc 
ctccctccct 
ggggcgcgcc 
tccagtcccc 
acctcgaggt 
tcctcctaca 
caccggaaga 
gaggcagcgc 
gcaacggcac 
gacccacgca 
ggccacccac 
cggggtagga 
acgctgagct 
agaagagtct 



tccgggagcg 
atcatgtctg 
agacttccag 
gaaataaagg 
agccagatcc 
agcgccgcgc 
gccaccatcc 
tcgggaggac 
ccggggccgg 
cgacggagcg 
agacgcgccg 
ctcgctccag 
catcggctcg 
gggcgtctgg 

cggggggcgg 
gctccttagt 
tagcgaggct 
ggcggcagca 
acccggctgg 
cccgagccgg 
aatcccttgt 
tcccacccct 
ggggcgcagg 
ccagggaaag 
ccaccaaatt 
atttaaaacc 
ggctcgcgcg 
gcccgggctt 
cgggaagggc 
ctcctccctt 
gaccacagcc 
acccgcgccg 
agagctaatg 
gtgaagacta 

ggcga 



cacgggcgcg 
tgcgttacac 
tggcgggggg 

ggggccgaga 

ggatgagtcc 
agccagcagc 
gcgccgccgc 
gcggcgcgcg 
ggccggggag 
ggcggcggcg 
cgcagcaggt 
cccttgggga 
ccgggctgcg 
gaggcgggct 
ggaagggagg 
catggcttgg 
gcgcgcggcc 
aggacatccc 
gatccgcggc 
atcctcgtgc 

ggggtttggt 

tcttttccat 
tttcgccgcg 
gcggccgtaa 
actcctgcaa 
accccaaggc 
gcgcgttaaa 
gtaaaatgcg 
tggagcgcgc 
tcccggctcc 
gagcccgccg 
cggctggggc 
gcaatggcgg 
gggtggcccc 



seq_prot. 
gcggggaggg 
cagaggctcc 
aggacagggt 
gggagaccga 
gtcctccgcc 
atccccaccg 
cgtcaggacc 
ggaggcggcg 
agggcgcagc 
gcggcgccgg 
cggagcagcc 
tcttccgctg 
cgccacctct 
ggagggcggt 
gaggagaaag 
ggctccacga 
ggcgccccgg 
agccgcgcgg 
tcggcccgcc 
cggacggaag 
gtttcacttt 
ccgcgtaaag 
gactccgcgc 
aagtttcgcg 
agacgcgggc 
acacacggac 
gtctgggaga 
aatcaatgaa 

gggggtgcag 

gtcccgccct 
gaccccggga 
ttgggttggg 
ggtcgccggg 
gaaaggccga 



txt 

tactcactgt 

gggctccact 

cgagaggtgt 

gaggaagggg 

ccgggcgggc 

tgacgctcgc 

ctcctcccga 

gtcgcagggc 

gaggtggggg 

cggcggcggg 

tccccgggag 

aggcattgaa 

gctatcttgc 

gcaggggagc 

gagccggaag 

ccctcctgga 

ggctggtgcg 

atctggggga 

agggcgcaga 

gcgtggaagc 

ttaaaggtta 

gaactgggcg 

tcagcttggg 

gttgagcact 

ttcttgcaat 

ccccgttccc 

cacgagttgc 

tacgaaataa 

ggagtccctc 

ccctcggcct 

ctacggaagc 

gactgtggcg 

gggcgggggg 
ggaaaggaga 



gaagcacgct 2400 

aattccattt 2460 

taaagacgca 2520 

gagctccgag 2580 

tctcgctctc 2640 

atcacacccg 2700 

gcatcgtcgc 2760 

gagccccggg 2820 

ccagtccaga 2880 

gtggctcagt 2940 

gatgtccagc 3000 

ggcaggaaga 3060 

ggaaagagga 3120 

ggggcggccg 3180 

agggcagagt 3240 

agcccggagc 3300 

cggcagaatg 3360 

ggggcgggga 3420 

gagaggatgc 3480 

gggaggggcc 3540 

gaccttgcgg 3600 

ccccctctcc 3660 

agacacggca 3720 

gggcctgatg 3780 

tgagcccccc 3840 

ccgcgccact 3900 

ggggaaacag 3960 

gggcaaccgc 4020 

cctgggcgcc 4080 

ctgcggacgc 4140 

catcccgtcc 4200 

gcgaagaagc 4260 

tcagcagcag 4320 

aagggccatg 4380 

4395 



<210> 43 

<211> 10490 

<212> DNA 

<213> Homo Sapiens 

<400> 43 



agaaggtgcc 
cgattaaacc 
aaaacaaact 
gctcctcctc 
ccagccccag 
gagaggccgt 
gaaactgacc 
tttgagccag 

gggtggtgcc 

gtcctgaacc 
ccttggtatc 
tcaagtctct 
tacgttagac 
ggtcgtgata 
agaaacataa 
agatgcagag 
gctaatgagt 
gtcctgcctc 
agccgtcccc 
cccctaggcc 
tgagccccta 
ggtcatggag 
cactaaaaag 
gtcacttccc 
tagggagcag 



tgcttcccct 

tctttccttt 

aatacccctt 

tgccccctgt 

tcccagcccc 

gttcctttcc 

ccctattctc 

tcttttttct 

aatggatgaa 

gggcctttcc 

catggaggat 

gatataacgt 

cgttactaga 

ctgtattctt 

ccgtccaatt 

ctcctggata 

acaggggagc 

aggacttgag 

ttgaagtttc 

ctgcgtggaa 

ctgtgtgccc 

agaaatatcc 

accattaggt 

tccaccgcta 

ccatggatgc 



tcgccttctg 

ataaattacc 

ccctgaggcg 

tctttctttt 

agccccagcc 

tctccccgag 

atactggtgc 

tatctccact 

tggatggatg 

tccaatgaga 

tagttctagg 

ggcctagtat 

ttatttatga 

tagggaatga 

tattttctga 

cgagagccaa 

aggtgttgat 

actccagttg 

tttttatccc 

ggacagatga 

ggcactttcc 

agttaaaata 

aatagtatta 

tgtgtatccc 

tctgatgctg 



ccatgactgt 

taggctcaag 

ccttctcctt 

cccctcatga 

ccatcctact 

cctgcctgat 

aacatcttcc 

tgctagggct 

gacagtagtc 

agccttcctg 

gtccccggga 

ttacgtataa 

tgtgtaatac 

tgacaagaac 

atattttcca 

gtgtgctttg 

caggaggacc 

gatggcacag 

aagctctttc 

accaggtttt 

ccacaggatc 

tcagaaaaga 

gcttttgtat 

aggaccaccc 

gccctgggcc 
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aagttccctg 

tatttcttta 

aggcaacccg 

ggcccaagtg 

gcaggcctgt 

atgctttctg 

aagacctcaa 

gtcattggga 

cagggatgat 

agtgagtata 

atgccaaaat 

gctatgcaca 

aatgcagatg 

aaagtctgca 

tctgctgttg 

agagtagggt 

ctgcactggg 

gtagactcag 

tggaccctgg 

agataacatg 

ctctagctag 

aaaagtgata 

tctgagatcc 

tgggcgggga 

tcgggggtga 



aactgggagt 

tagcagtgtg 

ctgcccccat 

ataaacgggg 

gtggctgctg 

gatcctggag 

agctgtacca 

cagtcctaga 

gtccctgtct 

tacagtcatc 

ccatggatgc 

tcctcccgta 

ctacataaat 

catgttcaat 

ctgaatctac 

gggtgaggtt 

gcatctggac 

cccaggtcaa 

aattcggcat 

tctagaagag 

aatatccaag 

ccattagaga 

aatagcagca 

gggctgcggt 

ca£tg.atgag.„ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



seq_prot.txt 

gaactgggtg cacacatgag tggggcagcc gggcctggcc agagaagcag cacacacgtg 1560 

cacagacgtg tttacccaca tacacatgtg cacgcacgtg cacaaacaca ttgcaggcag 1620 

gcatgttgac gcctcaggca gcggaggacc ctgactctgg gcgctgctga cccgggcaag 1680 

gccccactgt gattcgtgcc atgacctcag aatgtcactg gtgcttagca cctgtctgct 1740 

ctctggcctg cctcagtggt ctacagcagt tacacacagg cagtggtatc tgtgagcagc 1800 

tctgtggact caaaggtttt ctccctgaga ggcatgaccc aggccagctg attcatcaga 1860 

atcaggtgag cgtgacctgc trctcttccct ccaggcggac ttggggacag tggctacggt 1920 

gcgggcggtg ttggcctctg tggggcagct accgaggagg gtcatccctg agcactcacc 1980 

aggcgcccgt tctacactgc ccgtgtagac gattggctct ttcgtctcca tggtggcttc 2040 

gtagagtggg tgctgttccc aaatgtcccc attcgacaga tgagacgtct ggggtcagag 2100 

aggcagtaac cggcctggga atccggacat gaccctgagt trtgctctca gccctgccgt 2160 

gtgctgtgct ggaattcagg cctgaaccct gtgacctccc tgccctagat cccaaatctg 2220 

cccaggtttc ccatcccgat ggggcagagc ctggtcctgg cagagccact ggtatagagc 2280 

cactggtaca gatccactga cggtcctcag aacacctctg tgccctaagc tgggtcctga 2340 

tggtcgctgt gggccccact gaacacacat ggtcccttgt ccaggggagc ctgctgccct 2400 

tgggcagctg tggaaaatga aggagccctg gagggctggc tgaggggaga ctatcttccc 2460 

ttgtgttcaa aggggtccgg gcactagggt tctccccagg tatttcttgc tctgcgtggt 2520 

cctcttgagg cctcgccctx cttttgcctc gagtattccc aggagggacg gtccatccag 2580 

ctgttctcca ggaccaagga cccactgttc ttcctcagtg acccaggaaa atgaagcctc 2640 

ctcctgttgg gacggctcag aatggtggac rccacagtcc ctccgcgaga gacgtggttt 2700 

ccatgcgtac aatagatctt tctcatcccc caaacccaac accctcctgc tcaacaggcg 2760 

ttattcctaa agtggcttca ctgttcagac tgaagagcca cggtagccaa agtgatgagc 2820 

ggagtagaac cgagcagtcg ggagagatct tgttccctgt aggaaactgg gcatcgctga 2880 

ggccctgagc atcccaggag gccgattgca cagagacctc tggtcgctga ccccagtctg 2940 

cctccacatc cctggaatag cccatcatgg gcccttcacc cttggcaggt ggaaaccatt 3000 

caacctgctg gggccggtgt gtccccattt catggcattg ggggacaaca ggattctctg 3060 

tctaggtccc actgtactca agtccttggg aagatgccca cccctgcttg ggacttgaga 3120 

ctccagagac tggagcagct gtgggccact gggtctggcc cctttttccc tgggggcggc 3180 

ggtggaatgg gggttacgca gccagccagc atctgggagc ccggcgagag cggttcaggt 3240 

gttctccgaa gccgccgcgt acagtgtgac ctttagacaa ttctgtctca caggatggac 3300 

gtggtagagg tcgcgggcag ttggtgggca caagagcgag aggacatcat tatgaaatac 3360 

gaaaaggtac aagtcggtct gcttcttgga gggaggcctc ttccagtgtg ccctggtcaa 3420 

agggtcctgg gctccctagg agcacagggc agggacgggt ggccaatgcc cccaggccct 3480 

tgcacccttt accttggacc cctcaccaag gctccctctg ggctacaggg acaccgagct 3540 

gggctgccag aggacaaggg gcctaagcct tttcgaagct acaacaacaa cgtcgatcat 3600 

ttggggattg tacagtgagt cctctgcact cccctcaccc ctaaagcacc tgtctcagct 3660 

cagggatggg tttgctttta gaaaggcctt tctgacgcag gacatgtctc accaggtcgg 3720 

gtcaacctcc tttccaggga cagaactcct ccctgactcc cctgcaggtic cagcccgagg 3780 

ttgttaggcc agaggtgtgg ggcccatcta gggagccggt gggaatggag actgggctag 3840 

gtcaggcccc tgggcgctca gcagttctgt cggcaagtga gcacaagagg agcggggcag 3900 

cctgagggtc tggccctgtc tacttggaga caaccccggt gagatgcaag ggttatggcc 3960 

acagggtgag gggacgcctg gcccagcctc agggctgttg txcagcaggt ctctgagggc 4020 

ccacctgccc ctgttctccc ccattcccct agagctacag ccctcactgt cccgtgaggg 4080 

gaaaaggcat ggtgacaatg ggggctgtag ccctaggaga acgggggaga agatgggcag 4140 

ggccccgttc tgggcatctc acggtgaggc cagggaggca gcagggctcg cggctaaaga 4200 

cctgggtctg gtgctgggaa gggatctggg gccgggtaag aggagcccag ccaggagccc 4260 

atccctcagg gatcacagga tggagagaca gaggatccct ggggaggtag ggcgggaggg 4320 

agctgacgag ccgtgccact tctgaaacgc agggtgtgtg gctcgggtgc agggagaggc 4380 

aggtggatgc tgggaggtca gaacctgcaa gggccttggg gctgtcaagt ggggtgggcc 4440 

cctggtgcag ccagagtaca ccgggcaggt ctcagggcag gctcccttga ccctggcggg 4500 

gggatgtggt cactccctga gggactcctg tcagggcccg gtcgcccacc ctgggcggcc 4560 

cccatcccat ctcagggcta acctttctca gctccagcag aaagcaccac ctcgagtcca 4620 

ggacgggcag ccccactggg cagcctgacc gccccccacg ccaggggccc cagtaacccc 4680 

ggccaggctg tccctacact ccttcttctc ccaggtcctg cccctcctgg gagtcagccc 4740 

cacaggaagg cccttgtcct cccttccctg tgccttctcc tgggctgagc cctgagctgg 4800 

aaagggacag agccagtcct ttctgggggt cggcacccag gctggggccg ctccaggccc 4860 

cgtgcagttc ctcagctctg cctgggttgc cttacagtga gacggagctg cctcctctga 4920 

ctgcgcggga ggcgaaggta agagcctgat gcgtggaggg gctggtccag ggacgtaggg 4980 

actgggcggg tggtcagtga ggcagaggaa gcagctggcc tgagcggtgg cgggtgaggg 5040 

caacacgctg tcactgggag gggcagcagt ccctgctgga cctgacccca ggttgctgtt 5100 

cactttggca gtttgataaa attccaaaag gagaaccaca gtcctggctt gggggtggct 5160 

gcgcgcttgt gtcaggaccc cacctagagg ctgggaccta agactggtgt gtctgtggcc 5220 

tgaggatggt acatcccggg gtcccaaagc cagcccactg gtgctcattt gctcaaaggc 5280 

tctcagccct tgaggtctgc ccttccctgg ctccttccag ctggctccca ccagggctcc 5340 

agagcccaag acccagcatc cgcgggcggc tctgggaagc ctggcagctc cgctaactcc 5400 

aacatgcctc atttgacagc aaattcggcg ggagatcagc cgaaagagca agtgggtgga 5460 

tatgctggga gactgggaga aatacaaaag cagcagaaag gtaacgtgtg gagggaggaa 5520 

gcactctctg cagagacagg ggacaggcac ccatggctgt ggcctggcac catcagcctc 5580 
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tcagagggtg ggcggcacac tgtcctcgcc cagaggactg caggcctggt cgccagattt 5640 

cctgcctatt cgtgcaagcg tcaccttgca gggagggaat ctgaatctag ggctgggact 5700 

acccggagct caaggctagg gatgccctgg tgacctgaag gaaggaaaag gttcagatca 5760 

gagtttcgac tctgagtgtc catccactct ttcagtcctg ggaagggaga ccctgtccca 5820 

gctrtgatctc acctctactg aggaatcatg gggccaaaac cgacaatttc cagaatcccc 5880 

gggctctggt cctcactggg gtcaccccgt ggcctgtgac accagatcgt ttitctgccca 5940 

cagctcatag atcgagcgta caagggaatg cccatgaaca tccggggccc gatgtggtca 6000 

gtcctcctga acattgagga aatgaagttg aaaaaccccg gaagatacca ggtacgctca 6060 

gccagagcac aacaaacagg acaggccgtg tcggggccca ggtctccagc tggagggaac 6120 

gtcaagacca ccctggggag ctgggggtga aggtcagatg aacaccctgg gcacagatgg 6180 

tgacacagtc accacagaca aactcagctc tggtgaccct ccctggcttc agtaacaagc 6240 

caaaatgcag ctttctgcag aaggaaacct tccttctgtc cttccttccc gaagtgctga 6300 

ctgtgggctg actgccactg ggggcaggga gtcttccatc tgttctgaga ctgcttcctc 6360 

ctcttggccc tgccctacag atcatgaagg agaagggcaa gaggtcatct gagcacatcc 6420 

agcgcatcga ccgggacgta agcgggacat taaggaagca tatattcttc agggatcgat 6480 

acggaaccaa gtaagcctac gggagccaca gggtcccagc agagatgggg tgaatgagag 6540 

ggatgggggc ttccccggag cagaagccag ggtcacccag gagggatgac acagctgcca 6600 

agagctctcc cggcccaggg agcagccggc accatgaacc gagcacctcc ctggttccaa 6660 

gccctgggcc agactggaac atgtggggcc agaacccagg aggatcctga ggagatggaa 6720 

ggcagcaaac aaaatcatgc acaatggtga agggtgctct ccctgaccca tggggaccca 6780 

tggtaggacc cacgggaggg tggcaggata gagggcccat gagccccccc caggcaacag 6840 

tgacagcacc aaatgctggg agaattaggg gtcctggaaa ctctcatcca ggtccgctgg 6900 

gaacatgaca tggcacagcc acgttggcag cccgttgggc agtggctcac aaagctcgat 6960 

ggacttgaac cacacatccc caaagtgtca cagatattga acccactgat ttgcaaactg 7020 

acatccacat gaaaccagca tgccaggttc actgcttgac tcctcgtcac tcacacacgg 7080 

agccttcggg gacggccttc aacacgggga tggggagagc aaggctggtc ctcccttcaa 7140 

acggaagacc cagtgagaaa agggaacgag ccggtgatgc ccgcacgaac gtgggtggat 7200 

cctagatgca ttttgctgag ggacagaagc cagacccaat aagctaccac agtaggattc 7260 

ccattcctag gccattctgg aaaaggccaa accacaggga ctgagaagca gtctgggtgg 7320 

ccaggggctg acggatcggg gagaggctgg gtgcataggg gccaccctgg agacttggag 7380 

gatgaaggag tcgccccagg aggggctgga gcggtggccg ggagactctg cacattggtt 7440 

tggaaccgtg gaggaactgt acacccacag actgaactgg cgtgtgtgca aactgaaaaa 7500 

aaaaaaaaaa aatcattcag agtgaaaagg atcaggcaag tcactgtaca actgggctat 7560 

ttgcatgtca cagatgtgga ttttactgaa acatttcttc aagagtctca ggccctgaag 7620 

agctcactgc ttatctggtg aaacatctga acctgaaatg ggatttgctg utaggctttg 7680 

tagacaaagt gaaattaaca acatzctgcac aaaacaaacc aaagccccct ttctctgttt 7740 

cctaggcagc gggaactact ccacatcctc ctggcatatg aggagtacaa cccggtgagt 7800 

attcccggca gtgaggttcc cgggccatat ttccatattg acaggagtgg gtgtctggtg 7860 

OggQtgtcgt tgcttctrttt aaagttagta tttgtgaccc accaggatat aggaggtagg 7920 

atgtcagctc accgctggca taaacctcca aggaaggggg tggtxtcaag gggtcaagct 7980 

gagacacaga ggagtcaggg cctggactcc tggtgtcacc tgggcctgac caccacttct 8040 

cagaacaaga aatgacgccc tcctcctggg gctgccccaa agcccaggag cttggcagca 8100 

tcgcacacag gatggtgcta tcagcagaca ttttggacaa ggtgctgaag tgcctgatgg 8160 

acttggctct tgtcatgaaa tgaatgtgca tcctgaggaa gcctcttttt cagaggaagc 8220 

cuctccttca gaggaagcct ctccagtcac ctctgccctc tccaatgaca tgagtcctcc 8280 

caggtgacct cagcccrccc aggi:gat:gtc cttccatggt gactctggct cttgcaggag 8340 

gtgggctact gcagggacct gagccacatc gccgccttgt tcctcctcta tcttcctgag 8400 

gaggatgcat tctgggcact ggtgcagctg ctggccagtg agaggcactc cctgcagggt 8460 

aagtgaacag ctgccccggg gacctcctgc agccagacct ggggatggcc accctggcca 8520 

ggtgatcaca gctttcagcc aaggcaccct ccttgtgtcg ccagcttgut gggagacttt 8580 

aggatgtctc tgctgagggt cccacaggag tccacggctg acccccaaag cccaaatcag 8640 

acgcctctca tccccatcag cagagggcat ctcatcctcc ccgtggccac cctctgtgtc 8700 

ctggagccac gccctccggc tctgattctg tgcagctgac tctcccctcc ctgagagtcc 8760 

tcctgccctc cagctgcccg ggctcctgct gccatcggtg cccacgaatg ggccgaccaa 8820 

gcccaggtgg cagcatctcc ccatrcccctg ttccctggcc cgaccccact accaggagat 8880 

gaccgggaag cccagcgccc acccagrtcc ggccaccctg tcgtggcctg aaagtcaggc 8940 

ttgccctttt tgcaccctgg cccaggaggc ctccagggga acctccagcc aggctccagg 9000 

gaatgttccc gccccacctc cccagggtaa aggccgcatg ttggggtcac cagatgggag 9060 

ggtgggaggc cttggggttt gggggcctct ccagctgccc agctcttgca gctgatggct 9120 

ccacatcttg ggggaaggct ctgatttcat gatgggctgg gggcttctca ggatttcaca 9180 

gcccaaatgg cgggaccgtc caggggctcc aagaccaaca ggagcatgtg gtagccacgt 9240 

cacaacccaa gaccatgggg catxaggtga gtttatggtc ccctcagctc ttcccagagg 9300 

ccctgcctcc cgtggggctg taggagcagg ggggctggag cccctcgtgg ggctggtgac 9360 

tggctgagtc ccagccaggg cctgacctgg gacgtcgggt tctccatggg ctgggagttg 9420 

gttrtcctttc ctgccctgga ggagacagag gcacagggat gggggcccag ctcccgcaga 9480 

gcagggcaaa gggcagtgtg tccaccggga gtgtgggaag gtgacagtgt ngtggggagc 9540 

tctggacacc gcccagtgtt ctgcactagg ggaagggtct tcagaggccc tggaagaggg 9600 

aggtttttag ggcagcccag tggcctrgagc acctctgttg cttccatcag gacaagaaag 9660 
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atxtatgtgg gcagtgttcc ccgttaggct gcctcatccg gatattgatt gacggggtaa 9720 

ggaggcatag ggagaccctg gctcagggac cttccttgcc ctgcagtgcc ctgcttcccc 9780 

agcccggggg tctggctcac tcccagccca caggaggctc aggcgggtcc ccaaaggaca 9840 

cacaagcaaa accctctgcc caaggggggt catcccaggg ccatggctgg ggctcaggcc 9900 

cagcctcatg ggcagactgg gccaggaccc gacttgagag ggctcaggga agcctcaagc 9960 

cctgggcaag cccctctctc caggagccac atccccactc aaatgagtgc cccccatgag 10020 

gagcttcaag accttgtctg acccagcgtc ctggagggct caggcgaccc tcatggggaa 10080 

ggtcactgac tctggagact gaagccccag tgtgcgcagc tcgagccacc agccccagcc 10140 

tggaaggacc aggttctttc acacctgctg tccccacaga tctctctcgg gctcaccctg 10200 

cgcctgtggg acgtgtatct ggtagaaggc gaacaggcgt tgatgccgat aacaagaatc 10260 

gcctttaagg ttcagcagag tzaagtctacg tgtgcccagc ggggcctggg gagccctggg 1032O 

gtcagacccc gactggcccg agggcagctt cctcacactg tcctcatgat cctctgttct 10380 

ggcccagagg gaggtctggc caggtgggct gggcaggaca ctgtgacacc gagcccatcc 10440 

cccacatgac ccagatgaaa gtcgagagtg tggtgagcac ttccctgtcc 10490 

<210> 44 

<211> 6416 

<212> DNA 

<213> Homo Sapiens 

<400> 44 

ctcecgactc acctgtgact gagggcaaga atgctttctt ctcagagact cggtgtcccc 60 

ttttgtaaaa tgaggctcat tctgtccttc cccaagggct tcagggaaac tccagttaga 120 

gcagtgaggt gctaggtaaa ctctgagggg caccctaacc gccgtgtgag gtxaggaggt 180 

ctgaattctc tcatcccctc tccccaggac aagggcacac aactggttcc gttaagcccc 240 

tctcttgctc agacgccatg gagctggatc tgtctccacc tcatcttagc agctctccgg 300 

aagacctttg cccagcccct gggacccctc etgggactcc ccggccccct gatacccctc 360 

tgcctgagga ggtaaagagg tcccagcctc tcctcatccc aaccaccggc aggtacgatg 420 

gggcgtgggg cttgggggag gtcagtgctg gataatacac agaggcttgc aggccactgc 480 

tcccttcccc acacctctct ccctttttct tcttgccaca ggaaacttcg agaggaggag 540 

aggcgtgcca cctccctxcc ctctatcccc aaccccttcc ctgagctctg cagtcctccc 600 

tcacagagcc caattctcgg gggcccctcc agtgcaaggg ggctgctccc ccgcgatgcc 660 

agccgccccc atgtgagttg tccctcagaa gggaagggag ggatgcacgg gttctgggtc 720 

tgtgggagat acacagccgc ttccatggag gcaggggatc ttggttagga gtccctgagg 780 

gtctagcagg tgcggaaagg gaatgaatca ccctttgcct cccctcaagt cgtgtgcaat 840 

tcctggggcg caggtagtaa aggtgtacag tgaggatggg gcctgcaggt ctgtggaggt 900 

ggcagcaggt gccacagctc gccacgtgtg tgaaatgctg gtgcagcgag ctcacgcctt 960 

gagcgacgag acctgggggc tggtggagtg ccacccccac ctagcactgg gtaagtcagg 1020 

tgcatggaac tatccgggct gggagatgat gccttgcatc ttgggctagg catgtggctc 1080 

atccaggaga tctggttgat ctccaataac ccctgctttt tgcccttgtc cccagagcgg 1140 

ggtttggagg accacgagtc cgtggtggaa gtgcaggctg cctggcccgt gggcggagat 1200 

agccgcttcg tcttccggaa aaacttcgcc aagtacgaac tgttcaagag ctccccagtg 1260 

agtgcatgag ggctgtctgg gcgctgggat gccctgatcc tcaacctgga tgctggagcc 1320 

ctgatccctg acacttgtct acccacagca ctccctgttc ccagaaaaaa tggtctccag 1380 

ctgtctcgat gcacacactg gtatatccca tgaagacctc atccaggtgg ggggaccccc 1440 

catttcactg cagattcacg actccccagc attggccagt gcttctccac ccttaagtcc 1500 

tgtgcctccc ctctatgttg tagaaagagc caaatcacag tcctgtgtga ctggggacag 1560 

tcactttccc tgcctgagca tcagtttctt ctattaaatg ggggcgagaa atgcatgtgg 1620 

agcatttcct tgtaaaaacc tgagggtggg ctgggcacgg tggctcatgc ctataatccc 1680 

agcactttgg gaggctgagg cgggaggatt acttaagcct agaagtttga gagtttgaga 1740 

ccagcctggg caacataatg agacctcgtc tctccaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

aggccaggaa tggtggcatg agcctgtagt cccaggtgct tgggaggctg aggtgggagg 1860 

atcacttgag ctcgggaggt cgaggttgca gtgagctgtg atcgtgccac cgcactagca 1920 

tgagaccctg tctcaaaaag aaaaagaaaa agagaaacat cctggtggta gaggggaagg 1980 

gaggaggtca gacctgtcac tctcctctgc tctcctctgg ctcagaactt cctgaatgct 2040 

ggcagctttc ctgagatcca gggctttctg cagctgcggg gttcaggacg gaagctttgg 2100 

aaacgctttt tctgcrtctt gcgccgatct ggcctctatt actccaccaa gggcacctct 2160 

aaggtaaggt cttgagggta ccagccccag cccctccagt ccctggtcct tttagaagtt 2220 

gccccttctc tgctggaacc tctgagccct tctccccctg ggccccccag gccagccacc 2280 

tccagtttac catctctccc tacatccttg cctagctcac ctgcccaggg aggtagcagg 2340 

agaaaagatg atcttagttt aagtcctggc txtacttcta tttgctgtgt gaccctgggt 2400 

attcccctgc ccctctctgg tcctgaaatc tccccacctg gcttgtgggg ggaggtaata 2460 

gtgggcggga tctacctgta ccaagtcctg ctcactcatg ctgcttagga tccgaggcac 2520 

ctgcagtacg tggcagatgt gaacgagtcc aacgtgtacg nggtgacgca gggccgcaag 2580 

ctctacggga tgcccactga cttcggtttc tgtgtcaagg tgaagacctg gccaggcctg 2640 

gcccctggcc tggggaagca ggaactgctc aggcccctgg atcctgcccg gggcttctga 2700 

gccccaattc agaeacctag actcctctcc ctgcaccctg gcctgctgga aactcctgga 2760 
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ttcagctctg ctatgtggag caggggcaga catgtggtcc taaaggcaga tatgggacca 2820 

grcaarttcc tctctctgca gcccaacaag cttcgaaatg gccacaaggg gcttcggatc 2880 

ttctgcagtg aagatgagca gagccgcacc tgctggctgg ctgccttccg cctcttcaag 2940 

gtgagaccct gggagtggca tggggggctg gcctggccag agggatcccc agctctgccc 3000 

tcaggaagtc tcaggaatga ggagggcatc acagccttgc tctctgataa cccccagtcc 3060 

aagcctgaag ttataggaag tgcccatcga aggcagaaac acagccctgg tctgggcaag 3120 

tcctggtctg aggggggcgt cacagccacg cccccaggac ctctcgacct caagctctct 3180 

ttctctcccc acccccagta cggggtgcag ctgtacaaga attaccagca ggcacagtct 3240 

cgccatctgc atccatcttg tttgggctcc ccacccttgg tgagtgtgcc caaggggatg 3300 

9gagggtggg tatgcaggcc ctgtcttacg ggtacctggg ccctgttctg acctctctcc 3360 

tctccttcca tctccagaga agtgcctcag ataataccct ggtggccatg gacttctctg 3420 

gccatgctgg gcgtgtcatt gagaaccccc gggaggctct gagtgtggcc ctggaggagg 3480 

cccaggcctg gagggtgagg cctgctgtgt gtgtgtgtgt ttgtgctggg gacccactct 3540 

ctgggtggga tccctgaaat aggagggagg aagagagggc ggggggaggc ccctggctgg 3600 

gaagaagtgc tctaccttcc tgaggtgctg ggtaatgccc ccaagcacgc cccgactctc 3660 

ccgtatctcc cactgctcca cagaagaaga caaaccaccg cctcagcctg cccatgccag 3720 

cctccggcac gagcctcagt gcaggtgggt gacggccccg agtcctgggg cgggggctgc 3780 

ctcaacttct ctntgtattc ccagtgggga gtagatggta taggggtccc ctccccaaag 3840 

tgaccgccca tgtccttccc caccacagcc atccaccgca cccaactctg gttccacggg 3900 

cgcatttccc gtgaggagag ccagcggctt attggacagc agggcttggt agacgggtaa 3960 

ggggcagggc cgggcaacag acccagggat aagagagact ggggtccagg tggcagccat 4020 

ggtccttggg tagtaatgct gccccatctc ctgtcttctg gcagcctgtt cctggtccgg 4080 

gagagncagc ggaaccccca gggctttgtc ctctctttgt gccacctgca gaaagtgaag 4140 

cattatctca tcctgccggt gagcttccct gcgtccccgg agtcctgcaa tgagacacag 4200 

gactcccagc aacctgtcct cctcaccagg cccctccaga ggctccctgg cccccagtgc 4260 

gtctcccttt tccctgcaca agaagtggga ggctgagtgc ggtggctcac acctgtaatc 4320 

ccagcactta ggacggccaa ggtgggagaa tggcttgagc ccaggagttc gagaccagcc 4380 

tgggcaacac agggagaccc catctctaca aataatttaa aaatgagcca ggcatggtgg 4440 

tgcacacctg tagtccagct actcaggagg ctgaggtggg agcattgctn gagcccagag 4500 

ggtcaaggct gcagtgagcc atggtggcac caccacactc cagcctggat gacacagtga 4560 

gaaaactgtc tcaaaaaaaa aaaaagaaag aaaaagaaaa agaaaaaaga aaagaataaa 4620 

aggaaatggt ggggccctgg cccaggaggg agcttcccaa gcctcgggcc cctccctgaa 4680 

cttccacccc ctttactgta ccccagagcg aggaggaggg ccgcctgtrac ttcagcatgg 4740 

atgatggcca gacccgcttc actgacctgc tgcagctcgt ggagttccac cagctgaacc 4800 

gcggcatcct gccgtgcttg ctgcgccatt gctgcacgcg ggtggccctc tgaccaggcc 4860 

gtggactggc tcatgcctca gcccgccttc aggctgcccg ccgcccctcc acccatccag 4920 

tggactctgg ggcgcggcca caggggacgg gatgaggagc gggagggttc cgccactcca 4980 

gttttctcct ctgcttcttt gcctccctca gatagaaaac agcccccact ccagtccact 5040 

cctgacccct ctcctcaagg gaaggccttg ggtggccccc tctccttctc ctagctctgg 5100 

aggtgctgct ctagggcagg gaattatggg agaagtgggg gcagcccagg cggtttcacg 5160 

ccccacactt tgtacagacc gagaggccag ttgatxtgct ctgttttata ctagtgacaa 5220 

taaagattat tttttgatac acctatgagt: tctgtctggc aaggcctggc tggctgaatc 5280 

aagaagggaa ccagagctgg acgtggtggc tcatgcctgt aatcccagca ctttgggagg 5340 

ccaaggtagg agaattgctt gagtccagga gtttgagacc agcctgggca acatggcaag 5400 

accctgtccc tacaaaaaat aaaaaaatga gccgggcatg gtggtgtgca cctgtagtcc 5460 

cagcttctca ggaggctgag gtgggaggat cccttgttcc ttgagcctgg gatgtcaagg 5520 

ttgcagtgaa ctgagattgt gccgctgcac tcagcctgag tgacagagtg agaccctgtc 5580 

tggaaaaaaa aagaggggag acctggagag gtgggcacct gtggaggcct tggtggaagt 5640 

gtcaaatgga tcgaggccat gtctrcagcct gcctggatgt tcctcaaggg caggagtggt 5700 

tatctgagag tctccagtgc ccaccatgca gctngacaca tagtaggcgc ccagtcattg 5760 

ctaattaagt aagtgaatag acaagagacc atcatcccag agagattttc tgacagtcta 5820 

agtctagaga ggtaattaac agggcctggg agttggagat gagtccgaca gcatgcttgt 5880 

cctccgctag tcctggacta gctgacggat agttggcccc agcatgcaca cagttatgca 5940 

tccagcccca ccaccaagac acagaatggc ctcatcaccc ccaacaagtc cctgtgcccc 6000 

ctactatcag ccatgctccc tcctacccaa actgaaacct cctttctgtc ctagttctgc 6060 

cccttccaga aagccatata aatggagcct gctagcattc agccctctgc atctggcttc 6120 

catxtccaac acatccttca gcacccagag tgatcttttc tcaaatanaa attcaccact 6180 

cccggcaggg cactgtggct cacacctgta atcccagcac tttgggaggc cgaggcgggc 6240 

ggatcacgag gtcaggagtt tgagaccagc ctggccaaca tggcaaaacc ccatctctac 6300 

taaaaataca aaaattagct gggcatggtg gcgggcacct gtaatcccag ctactaggga 6360 

ggctgaggca ggagaatagc ttgaaccggg gaggcagagg ttgcagtgag ccaaga 6416 

<210> 45 

<211> 6001 

<212> DNA 

<213> Homo Sapiens 
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tcttaacatc agaaggaaca aactctgaac acaccatctt taagaactgt aacactcacc 60 

gtgagggtcc acggcttcat tcttgaagtc agtgagacca agaacccacc aattttggac 120 

acaaggtgac aggctgaggg cggtggctcg gtcctgggtt ttcctggggc cttcccaggg 180 

aatgttctgg cacctgccga ctgagccctg ggaggtagcc ctggcatata gctccctgac 240 

atgatttgtc ttccattttg gggtgtcata tatgaaggga ggtgactgtt gtgatggtgc 300 

tggcaggact gctgtccctg atgtggggtg ggctgagtta ggcctgaaat atgggcctcc 360 

aggctgagtc ctgccctctc caccacatcc agggctgact gacacctcta gtcagcccat 420 

tctggcccct tccccacatg ccaggacaat gtagtccttg tcaccaatct gggcagtcag 480 

agttgggtca gtgggggaca tgggattatg ggcaagggta actgacatct gctcagcctc 540 

aacgtacccg tctcaaatgc ggccaggcgg tggggtaagc aggaatgagg caggggtggg 600 

gttgccctga ggaggatgat cccaacgagg gcgtgagcag gggacccaag ttggaactac 660 

cacattgctt tattgtacat tagagcctct ggctagggag caggctgggg actaggtacc 720 

ccattctagc ggggcacagc acaaagctcg tagggggatg gggtcaccag aaagctgacg 780 

acacgagagt ggctgggccg gggctgtccg gcggccacgg agaagctgaa gtgctgcagc 840 

agggaggtga agaagaggaa gagctccatg cgggccaggg gctccccgag gcatgcacgg 900 

cggcctgtgg ggaggggagg ggcgtcagtg agcctggctc ctgggtgata cccctgcaag 960 

actccacgga aggggacagg gagccgggct ccccacaggc acctgctgag aaaggcagga 1020 

aggcctccgg cttcacaaag tggccctggg catccaggaa gtgttcgggg tggaagcgga 1080 

agggcttctt ccagacggcc tcatcettca gcaccgatga caggttggtg atgagtgtcg 1140 

ttccctgggc aggagatgca gggtgagagt ggggactgga ctctaggatg ctgggacccc 1200 

tgccaccaaa cacacggggg acacacactg cctggcacac agctggactc tgtcaactag 1260 

tcctgcgccc gagaagctcc acagtaccct ctccgacccc acagcagggc gcagtcacac 1320 

ctctcagagg cacccacact gccccctctc cctgcaggcg ctgggtcctc caacattctg 1380 

gcaggtcctg arttgtcttc cccactagac tggggctctg gatggacagg ccagccctgc 1440 

ctatactctg gaccccccat ccaagcgggg acagtcagtg tggtggcatt gaggactagg 1500 

tggccagggt tcctagagtg ggcccacctg gcagtagcca tgctggggct atcaccaggg 1560 

gctggtgctg agctggggtg aggagggcgc caggcctacc ttagggatgc ggaagccctg 1620 

tacttcgatg tcacgggatg tcatatgggt cacactcagg gggatgatgt ccccaaagcg 1680 

ctgcacctcg tgaatcacgg cagrggtgca gggcatgtga gcctggtcac ccatctctgg 1740 

tcgccgcacc tgccctatca cgtcgtcgat ctcctgttgg acacggactg gacagacatg 1800 

cgtccccaca atgggtcagc acccagggga cactctcctt cctcctgtgt tggaggaagt 1860 

taggcttaca ggagcctggc cacgcctgtg ctggaagccc cgggtgtccc agctaagccc 1920 

aggggccccc agctgtaccc ttcctccctc agtccctgcc ttgggcccca gctgggctca 1980 

cgctgcacat ccaggtgtag gatcatgagc aggaggcccc aggccagcgt ggtcgaggtg 2040 

gtcaccatcc cggcaaggaa caggttaccc accactatgc gcaggttctc atcattgaag 2100 

ctgctctcag ggctcccctt ggcctgagca gggccgagag gatactcagg ggatagaacg 2160 

gggtagcccc caaatgacct ccaattctgc acctgtcagc ccagatgcgg ctcgccgggt 2220 

gatgcactgg tccaaccttt tgcccagcct cccctcattc ctcctgggac gttcaaccca 2280 

ccacccttgc cccccaccgt ggcagccact ctcaccttct ccttctttgc caggaaggcc 2340 

tcagtcaggt ctcggggtgg ctgggctggg tcccaggtca tcctgtgctc agttagcagc 2400 

tcatccagct gggtcaggaa agccttttgg aagcgtagga ccttgccagc cagcgctggg 2460 

atgtgcggga ggacggggac agcattcagc acctacacca gacagaacgg ggtctcaatc 2520 

cctcctgtgc tctgcgttca cetggaccag tctcaggccc cagccatctc caggaagacc 2580 

cagggcctgc ctgtccttac cactgacctc accaagtccc tccccaagtg ccagcctcca 2640 

ccctctctct ccttgcccag aggagaaacc taaaatcgaa atctccaacg tggacggggg 2700 

tacagagtcc ttggcctctc ctggtgcccc ctgacccggg cacacctctc ccacgaccat 2760 

gtctgagatg tcccctcctc ctccaggccc ttcttacagt ggggtctcct ggaatgtcct 2820 

ttcccaaacc catctatgca aatcctgccc ttcggaggcc ccagtccagc cccggcacct 2880 

ctcaggagct cgccctgcag agactcctcg gtctctcgct ccgcacctcg cgcaggaagc 2940 

ccgactcctc cttcagtccc tcctgagcta ggtccagcag cctgaggaag cgagggtcgt 3000 

cgtactcgaa gcggcgcccg caggtgaggg aggcgatcac gttgctcacg gctttgtcca 3060 

agaggccgtt ggggcgaaag gggcgtcctg ggggtgggag atgcgggtaa ggggttgcct: 3120 

tctccgtccc ccgccttxcc agtincccgct ttgtgccctt ctgcccatca cccaccggct 3180 

tggtcggcga aggcggcaca aaggcaggcg gcctcctcgg tcacccactg ctccagcgac 3240 

ttcttgccca ggcccaagtt gcgcaaggtg gacacggaga agcgcctctg ctcgcgccac 3300 

gcgggcccat agcgcgacag gatcacccct gggggcggga cgggcacgtg ggcgttgcca 3360 

tgaaggectt ggccccaccc tccgccaccc actccaaccc tggcgctcca caaggtctcc 3420 

cgcagtccct agcccggtcc agctgggcac agggcccact ctttgctcac ccacattgct 3480 

GGcctgcGtg gggcggggtt tggccccacc tcgtctctgc ccaccctgac cacctttcca 3540 

ctcaaggaag atcccgcccg tcccgcccac actgagcccg cagcataggc gcggtccccg 3600 

ccaccgcc^c ttcgacgcat cagcctcgcc caccgggctt ctggcgggtc tgggcagtag 3660 

ccccgcccfcc tcccagccca cagactcgca cctcccccgt gcaggtggtt tcctggccca 3720 

ctgtcctcag cccactcgct ggcctttatc tctgtttcac gtccaggacc ccacgccctg 3780 

tcggcgctgc ttgggctacg gtcactgtcc acccggggcc cacggaaacg cggtctctgt: 3840 

cccccaccgc cgcttgcctt gggaacgcgg cccgaagccc aggacctggt agatgggcgc 3900 

aggcgggcgg tcggcrcgtgt cctcgccgcg ggtcaccatc gcctcgcgca cggccgccag 3960 

cccattgagc acgaccaccg gcgtccaggc cagctgcagg ctgaacacgt ccccgaagcg 4020 
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gcgccgcaac tgcagaggga gggtcagggc ctcttgtcaa gccaggatca ccccagacta 4080 

caggtcctag tcctatttga accttggacg acccccgggg ctaccaggag tgagcaggtg 4140 

gaaggaggag acccagcctc ctgatcctgg ggcgggggtg ggggtcacac cttctgtgat 4200 

ggaggaactc agtttggatg cgtcacccag gtatgacctt gcaagagtca ccaaaattgc 4260 

cgagaggccc cagttagcat cccattccca gatgatggtc catgccggtg agcagtgagg 4320 

cccgaggacc cacagtgcaa aaggtttgaa ccgggtcact gcaccccctt catcctcgat 4380 

ttxgtgattt aaacggcact caggactaac tcatcttcca ttcccaaggc ctttccntct 4440 

ggtgtcagca gaagggactt tgtactccat aacatatgtt gcccaatggg cttgcatgcc 4500 

cactgccaag tccagctcca cctccaggcc cttgccctac tcrtccrtgg cctrtggaaa 4560 

atccagtcct tcatgccatg tataaatgtc cttccccagg acgtccccca aacctgcttc 4620 

cccttctcag cctggcttct gatccagcct gtggtttaac ccaccaccca tgtttgctgg 4680 

tggtggggca tcctcaggac ctctgccgcc ctccaggacc tcctccctca cctggtcgaa 4740 

gcagtatggt gtgttctgga agtccacatg cagcaaggtt gcccagcccg ggcagtggca 4800 

ggggacctgg cgggtagcgt gcagcccagc gttggtgccg gtgcatcagg tccaccagga 4860 

gcaggaagat ggccactatc atggccaggg gcaccagtgc ttctagcccc atggctgcct 4920 

cactaccaac tgggctcctic tggacacacc tggcaccccc accccaccag gcacagagga 4980 

ccaggcagga cactctcggc acaccgagcg cgtgaccctt cccttataaa gggagctgat 5040 

gatggccttc gccctctgct gtgagtgaac ctgctgtgtt gactgtgctg ccagtggcag 5100 

agtcaggcca gggcaggtat gggctgctcc agaggtcctt gccgctgctt cctgctccag 5160 

gcccttaccc agggtagggt ggtagaaagg cctggtcgga gaagtcaccc cctctcccca 5220 

ctccaagctc cccaagccca cacaggcttc tgggataacc agggtctcag tggacccggc 5280 

catccacctc ccagctaggc tcatacaccc taatgtagtc acaacccctc ctccagaaca 5340 

tgaccttgcc ctttccctac ccccacctgc ccactccaga gtgaccttca gcacccttat 5400 

ctgtcactgg cacttacctg gggccttaga gctcctgatg atgagtggca tcatgggcct 5460 

ggtcccttca cttcaccttg cactctrtgac atgcacagac gctatgcaca cacctgatgg 5520 

tgcacagatc tcttgtccac tcccagacac ttgtccactt gttcacactt gcagggacac 5580 

gattacacat gcagaaaatc acccacacaa agacaatatt cacacataca cagactcaca 5640 

ctgacactca gggcacacat tctctctcac acacaccagt cacacacaca tacagacccg 5700 

gcaccaagta ccccacttcc cagccatgcc cgaggtttxc tggatgggac ctctcctgtc 5760 

cagaggctgc tcccggtgag cctcaaagct gtcacatgga tcccagctca gcccacattc 5820 

tgggctctgg ccgggccatg acttcctgtt tgcaacaggg ctgttcccag agctcccagt 5880 

tggtagcctg aaggcccttg ccccagcctg tgacagcatc ctccagggct gcctgagggt 5940 

cgtcattctc cactgcttcc tggcctccat gtttctgatt agaaatctgg tggaaacatt 6000 

a 6001 
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caagaatttc aagtcattat ccctgtgcat gcaggcagta ccagtgccag caataacaag 60 

tcagcttttc tggcaaatcc atcaaaatta aatcttaaac ttttgaaaaa actaatatgt 120 

ttattatgct cctgttagct tagaggctta actatctgtg ggggagggcc tataattatg 180 

atcaattact tatgacttga aggtacttat aaaagcatct gcccttttaa gaaaccatta 240 

aagaaccagg cttaagactg taaaactaat cccaggggca atagtaaatc acaggtttgg 300 

ggacccaact ttgacaactc tcccttggaa agaaaagaat tgctgtcaac tgtccttatt 360 

tcctgaccct gtctgagcaa atattcgcag cctctaaaat ccagaatuct tagtctggcc 420 

aagcaagtca tggggaccaa tggttgtttc aaggcagagg cagaagagaa gaaacaggaa 480 

ggaaacaaaa agggaatgaa gatctgtagc tcatgagggt gacatcatca ataatttcct 540 

caagtatcac tgaatttcat ggctcgtgac tgagagtctt ttctgacatc ttacaggcta 600 

attatcctca aggatcttct tggttccaga gcctgtaatg tctgggtttg aatatgattt 660 

gttctacaaa aaaggtcagg tgaattgacc cctccctgtg aagctgctcc atggcaagag 720 

accatcaatc tgctgaaacg caaacaatcc tttagatatc tgaagtactt tcaaatttac 780 

aaagtgcttt cacatacgcc atctcattut attctcacga ctttaacagg taggtgccat 840 

catcacctta acacgtaaaa ataattttag aaaacaatgc ctttgtcttc agttcataag 900 

aatatcctcc tgaaattcac agtacagaat ccaagtcatg ggtcagttag aaacgtctag 960 

aaacacatca atctcaggag gataaatcaa gagaatagga tggatcagac aaaatctaca 1020 

agcacaatgg cagaaagtgt gcctctggta taagaatggc gtaagcttug taaacaatag 1080 

tgttatacag gggtgcctct gtagtacctc tacgtttgga aggacactat tcattcatat 1140 

acatgataag cacccttaac tggatactcc ataggtaaca gacaaaataa aacaagaact 1200 

tccgtcttaa aattgtagtg gtggaattca tccttcggga tgtattaggt gattccccta 1260 

ggacgacagc tgagttacag ggggctaaat ataccctccc cttttccctc caatatcaca 1320 

. aagagcccca aaaacggatt rtgttgtttt taacaaataa aatacacatg aaaaaaagtc 1380 

atctataggt ttttatatat aatctcatct tctaaaacac tcattctcct tcactagcta 1440 

ggccgaggta tata!tgcggtc ttaactttat aaccatttac tgccttattt atatatttat 1500 

gattctttga tgccttctag^cttgagtcta jadctgqgttg LSgaacaaagc accgcacaaa 1560 
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tcaccagcct gggataagag gagacaagtt aaaacctccc aagaagattc tgtagatatt 1620 

tggtctctct gagtctgtag agagtttggt atttcaaagg tcagtcttgg gtctctccac 1680 

tttgatgact aatgttaaat tgtttgactt tttacaaaaa tatttagctg tgcttattta 1740 

catcatttca tcctcagcgc acacagtagg catgtatttg ttgaacagaa gccctaaatg 1800 

gatgagaaaa aagatatttc tgctacacat tttctcaaga tcgagaaagg taaaatcaca 1860 

acccaatttg cagaagtttc taggcattcc ttttcaatgt aaagctatgc atctctgttt 1920 

aaaaaagaaa aggatgacac aaccaaacaa tgtcaaccac acaatgggga aattccagga 1980 

tcactaagaa aacaccatta tcctaagtca cacattttca tttaatactg gttaccacac 2040 

aggagagagt aagaactgct gtctccccac ccactcgcac aatccatctg ggaaaagcat 2100 

gggcagaaca tttatgatct aaccatacca catcttggac tcacatttaa ccctttggag 2160 

aattagtcta aagagtatgt ccgcccagag atataaagta aaacagacac agggataaca 2220 

gttttggaag aaatttttat gacgctgttt gagattgggc atcagcactc aaactcctta 2280 

gaccaccagg agtcattctt actacgcaac tttaataaag agaaactgag agtatgactg 2340 

gcaaaaatat tatgatgttt ctgcatcttg ttaacctgat ttacacttgg ctccacccat 2400 

aaagttccca ctgattatca cacggcatcc tcattctcca gtctcaccta ctctgtcttt 2460 

caaaaattcc ccaacggcat ggccacgaaa aatatgcggg ggctagcggg gcatgtctca 2520 

ggttggctta tcctgtccct aaaacgcacc cacttccccc agaccctcct tcagggtggg 2580 

cagggagaaa acgcttctaa gttgcacagc tgcagggaag cctgctcagg tcatgttttt 2640 

ttgacttgtc tttcggctcg ctgtctggca aggtcaccgg ctctggctcg cggctctgtc 2700 

gacaaaactt aactctcact ctgtaagacc aacaggtcgc gcaggacctc cccagtcgcc 2760 

ctctgccccg cacctttctg ggcctgagga ttcccggctc ggccctcccc gcgcgcgcga 2820 

ggccccagat ggcgagggcg cagctccctg gctcacctga ccacgttggg gtgctcgaag 2880 

gtctccaggt gcctcagcac cgccacctcg cggatggtgg agagcggcat gccctcctcg 2940 

ccggtctgca cccgcacgcg cttcaacgcc acgaaacggc ctccgttctt caagtcgcgg 3000 

gccttgaaca ccttcccata ggcgccctcc ccgatctccg ccacgcattc gtactgctgg 3060 

tcagcgcggc acaggccgtc cttctccatg ccgccnggac gccgcccgcc gcggcgccgc 3120 

tggggcgggc ggggggtgcg ctcaactagc tggcggccgc cgctcgccta ctccggggct 3180 

ccccggagat cggtcragct ttacttgctc cccgccggct caggcgctcg ggcgctgggg 3240 

cttrtcgccgc tgcagaagct ggatggagag acctccccgc ggggctggcg taaccctggt 3300 

gccgccgccg cgaaactccg cctgcagagt cgccgccgcc gccgccgccg gaggagcgag 3360 

ccgatccctc ctcttccctc ctcgaagcga agtcctcaac acagacacga ttacatagcc 3420 

tctgcccaag cgcgtctcag tccagaatca ttgcacctaa aggaggagac gggaggataa 3480 

gaagaaagtg caatcagaca gcccagaagc cttcctcggg gttcctccag actcccctcc 3540 

tcctccttta cgaagcctcc atcgctaccc tccgcgcgct cctgccctct cccaagccgc 3600 

ttaatccttc ctggttcctc cgagaaaagc gaagttactt ttctttccct gcagggctga 3660 

agccgtcttc gcgcggagag gttgcagggg cccctcgggg atgagcgagc ggcgcgggac 3720 

gcagtggaac gggagggggc gtgccgagca gcccagagtg tgccgggagc gcgggggagg 3780 

ggaggcgccg ggcacgtcaa tgtcacggct taatatttat ccctatatca ttgtgttgcg 3840 

ccgctaccct ccccgcctcc tgaggcccgg acgtgcaggg agacggggtc caggggtgcc 3900 

ggagggcgtt caggggtcgc gcacaagctg gagcggaagg actgtgggtc catccgtgtg 3960 

gggccgcaga atgtgggtgg gggctcccag gaccctgtaa cccgatgctg ggagtgtgcg 4020 

agaaggggtg gtcagacatc tgctcagaaa ttgcttcctt tctttctact ttcagttttt 4080 

ctacgaggag acatttaaaa acgactcgca tacacaaatg gtctttttgg ggtaaaggag 4140 

tctxggttgg aaacggaatt ctctctccgg ctgggaggac ctccctggtg gaaaccttgg 4200 

gaagaccagc catctggtcg gggagggttg gggcttatcg ggggtgggcg agcgagcggt 4260 

cctgggggag gggagacacc gtcccccacg ggatcccaag ggtgggagaa aggggtgttc 4320 

gccacaattt cgcrtgtcgt ctccttaaag aataacrtca ggaaggggat agcataatcg 4380 

cgcctcgtcc ttaagtttta aactgatttc gacctgagca aaactcaacc tccctttcaa 4440 

agggtggggg gtggggggtt aaatcctgcg cctccagggg tgagagagaa ggtctctgtc 4500 

ctcggggccc ggactcgcga accttttccc attcccagga cctccccgct gtaggtagca 4560 

gaggtggctg ccccattccc cctccggcta aaggcccggt ccagtctgca gccggacgcc 4620 

cccggagccc tctgcacaac aaagcgccag agtagggtac cctaactttc cctccccatt 4680 

cagcagctat ctgggaccta gggggcctcc ctcctccagg gccgccgcgg cctcggcagg 4740 

actttcacca cgacggccgc atgtccggaa ttcccggggc acctcggaat tacctgcctc 4800 

gcaaccgctg gcccgctcgc cttttgccag gaattaaaca aacggcgcga cccccacgat 4860 

gagcgggtgg agcggaccgc ggcgagagca gagcttctgg cactcactac attcagaact 4920 

ttccttaaaa gccgaggaaa gagcccccag ggactgcccg ggggtctgag ccaggtcgag 4980 

ggccacacag gtagccgggg agccgccgcc ggccgcccaa cccgcctgtc ctgcatcccc 5040 

ccaaccccct cccgctcgag tctagccccg ggctccgcag cgaaggaagc gtcggggacg 5100 

tcccaccccc gcagggaact gcgcctgctg cccccgccgg gccccgcact gccctggctc 5160 

H£S cccact cg<xcxccag acggtcggct tcggcggaga gaaacgggag cagcaatgcc 5220 

ttttcccccc ctctcctcca cttttttttg ttgttgttgt tgctttccca cgctggctga 5280 

atgtgacttg accccatttc aaaaaaagtt tgacatagtg cttcaacatt tccgcattcc 5340 

tccgcgacta caaaggctgc agccgcctcc ttctgcttte tgtctctgct ctctgtcttc 5400 

acactcaatg aaatccttca tgcgcctctc ccgaagcctg ccaagcacca caagggtcgc 5460 

caccagcaca gcgacaagag gccacaccgg ggaccgcgac tcccgc^gtgt gcgcacacgc 5520 

agatgcgccc atatgcacac ggacgggcgc gctgcgggga ccagggctgc tcactgcggg 5580 

gcgtgtgttt aactccaata ttttaaatta taaaagtaaa cttggaaact acagcaaacg 5640 
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tgcgtxttcg cggggctcca agcctgggca ctggccggct gcgtgcactt ttctgtgtat 5700 

aacacgcccg caggaagcct gcgttaacat ttatctcgtg gaggggaagg tggagcccac 5760 

acccacacct cagcgagctg gagagggagt tttcgagggg gaaatgcaag cacattctcc 5820 

agcatggagt ccctagaggc agtgcatttt aaaccctaaa tgtgaattat tatgtgagtg 5880 

tccgagtaga gttccagttc cgtttggaga aactgtatga ggcgtgttcc gtgtgggtgt 5940 

gtgtgcgtac ggatacgtgg aagcaggatc tcggtgtcgc gggtgtccct aggctgggat 6000 

ctccccctct catcagataa ggagaatcct ctagctcccc aaaactggcc cccataacca 6060 

atcccgtctc agttcgtaag gggttaacca aagccgattc caaggaaaca caggtccccc 6120 

ccaccccccg ccaccctcca cttaccccac ccacagttct gcctcggacc ccggccaatc 6180 

ccccagatct cccccgggac cccccacccc gcagcccacc cacccgagcg cacagctcct 6240 

cacctgaggg gcccagtcgc gtcgggcctc ccgagggggc tgcgagtgtc agtcggctct 6300 

ccgcacgtgt ccgcggcctc gcggagcagg taatcagact ctggggaagg agttaccagc 6360 

actctctccg gcgagggggt gggcacagcg gcggagggcg gagggacggc ggagggcgcc 6420 

gcccgcgccg ctcgctgggg gcggacgggg gccgct 6456 

<210> 47 
<211> 4499 
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tttagaggcc accattcgaa tcttggtcaa ctgtgaggaa aggctggaag aggagtttgg 60 

agatggactg tggcaattct ggaagaccca tgtcctggaa ttcgagagaa agagaggagg 120 

ccataccacc taggagagag gggctgctga ggtggggtga gctggaatct gctctcataa 180 

gttattcctg ccctttggct tccagggatt tgggggcctt tggtctttcc tttcactctg 240 

gcccaggatc aagtttctct ttaacttctt tttctgcacg tggattccca gacccagcct 300 

ggtgctccag ccagagccta ggccatagtg gggagcgcga gtgaggagca tgtcctctga 360 

tgggtttttc agcgagggtc gaaaggcaga tgctgaaaag gtgactcgct tagactcacg 420 

ggaccagtcg gggcagctct gggactccca gagttcccaa agacgatggt ccaaaaacca 480 

ggagcagcgt gagcggctgc cgagttccat tccactcgta agctctcgcc aagtactggg 540 

caggaaacat gatagccaag aggactggag cgttgacttt ttctcctggg agtgatacct 600 

gggacagcct tcaaggctga agggcccagg ttctctccgt cgctcccacc tggggacgtc 660 

tcttctggct ctccgccgcc gcgcacacgc actagctggg cagcaggaga ccagtcaatg 720 

agactgcaaa cgttgtttgc ttttatttat ttatttttga gactgagtct cgctctgtcg 780 

cccaggctgg agtgcagtgg cgcgatctcg actcactgca agctccgcct ccagggtaca 840 

cgccattctc ctgcctcagc ctcccgagta gctgggacta caggcgcccg ccaccacgcc 900 

cggctaattt tttrtctattt ttagtagaga cggggtttca ccgtgttagc gaggatggtc 960 

tcgatctcct gacctcgtta trccgccctcc tcggcctccc aaagtgctga gattacaggc 1020 

gtgagccacc gcgtccggcc cgttgtttgc ttttagttac tgattgttaa aagattaagc 1080 

tgctggtaag ccgacatcaa taatccctcc ggaaaaatct tttaactaag ttaataatag 1140 

aaaaacattg gttgcaattg tagatactag tcatttcgaa gatgtgtgag agggaggtac 1200 

agtgggtccc gggtaggctg caggggtgtg tgcgcgcgaa cgtgttgggt ccgggagact 1260 

gagagtgtgt gtgcaagatt tcgggaggga gctttttctc cggggtgagg agtgcggact 1320 

gcagctctcc caggccagct tccgcccaac actccccaca acccctaacc cccaagcccc 1380 

gcacactcgc cccgccccag gctggacagt tactcccagg gactcttcct cctgctttac 1440 

tcagaaggag gcagctgcga gccgggcgtg gtggcgggcg cctgtaaccc cagctactag 1500 

ggaggctgag gcaggagaac tgcttgaacc tgggaggcgg aggttgcagt gattcaaaat 1560 

cgcgccaatg cactccaccc tgggcgacag accgagactc cgtctcaaaa acaaacaaac 1620 

aaacaaacga agtcgctgcg agcgtcccct cctctccttc cctctgatta gctgagctgc 1680 

atcgatcaag gaccatccgg cccgggcggc gggataggga gggcctgggc gagtccctcc 1740 

cggagccttg gagtccgcag gagatttgcg ggggcgggct gcgtctccaa gagagcgagc 1800 

gtctccaaga gagcgagctc cctgtcatgg gcggtgtcca caaaatgcgc ctctgtgggc 1860 

gacagagggg tgcggtgggg cctaggaggc tcgtgggcga ttccgagcac cgggtcggtc 1920 

ccgagtgagg ggccagccct ggggctggga ggaaggcgag aggcagcgcc actgctcctg 1980 

actccccggg cgactccacg aaggaaggca gagggctgag ccccagcgcg gatcagccaa 2040 

actcgacggg cccttgccgg tcacccgtcc ctaaccctgg ttgggctggg aggggtcttc 2100 

tcgcaagtct tctcgcccgg cggaggcttc tccttcccaa cccccgcgac gggcgcatgc 2160 

cttaagctgc tctcgcccgc cgggctcgtc tgtcgctccc agggcctggt ggcccgcggt 2220 

ggcccgcgtc catcgtgtcg gtcccaccga ccccgattaa cgcaggttcg agatgcggcg 2280 

gcttccgccg agtgggcgcg gcgttcgggg tcctccggga. gcaggtgcga aggacgtgtc 2340 

tggagtctgt cgtctggacc tctggcctct gggtgcgcag caacccctcg cggcaggatt 2400 

gcaaccgcgg aggcctgcag ctgtgactaa actcgcgcct cctccgccgg cttcgcggcc 2460 

cgcagtccac ccgggctaat ggtgtcaact ttggattcct ctcaatgccg ggccctgtgg 2520 

cacccgggaa agtctccacg caggaaggcc cagggtcctg cgggtgtaca catgggaagg 2580 

cacaaaagct acagatattg tttgttcaaa ttgaagctcg gtggaaagct atgccttctt 2640 

actttttaaa aaattcaact ttgttgaagt acagtttaca tgcaataaaa tgcacccact 2700 

ttaagtggat atttagatac atattttgac acagtcacac tccactacca tgatcaagat 2760 
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acagaacatt tccatgtcct caaaaattac cttgtgctgt catcccagac taccctgtgt 2820 

ccctctactc accccatccc acacacctga gttctaggca accgccggac tgctttcttc 2880 

tactatagat tatgtttttg tattttctgg agtttcatat caatggaatc atacactatg 2940 

agctctgtgt ctgactttca cttagcttaa tgttgttgag acccctccct attgttggaa 3000 

agatcagtag tttacctttt tgtgtgtgag atggagtctc gctctgtcgc ccaggctgga 3060 

gtgcagtggc gcgatctcag ctcactgcaa cctccgcctc ctgggttcaa gcgattctcg 3120 

tgcctcagcc tcccgagtag ctgagattac aggcatccgc catcacaccc ggctaatttt 3180 

tgtatttctt taatagagac ggggtttcac catgttggcc aggctggtct cgaacttctg 3240 

acctcaggtg atccacccac ctcggcctcc ctaagtgctg ggattacagg tgtgagccac 3300 

cgcgcccagc ttttttattg ctgatattgc atrtgtctggt tgtaccgcaa tttgtttaac 3360 

ctttctcttg tccatagaca tttggacttt ttcccagttt gggctatttt gaattaagct 3420 

gttaggaata tttgttcaat tctgtgtgac tttatgattt catttctctt gggtgcatat 3480 

atagaaatgg aattggtggg tcatatcaga aaatatattt ttaatgttgt aagaaactgc 3540 

aaaactattt tctaaagtgg ctgaagcatt ttacactccc accaatagta tatgaaagct 3600 

ccagtcatct ccacatagcc accaatttrt agtattgcca gtctttttaa ttttagttat 3660 

gctagtgggt gcccagtggt accttactgt atttttaaat ttcatttctc aaataactac 3720 

gttgagcatc ttatgacgtg ttcgttttaa aaatgtgtat atcttattta gtgcaatgtc 3780 

tgttcaaata tttcactcat ttttatcagg ttatttgttt tatattactg agctgtaaaa 3840 

gttttctata tatttaggat ataactccct tatttatgtt ttgcatatat tttctcacag 3900 

tctgtggctt gcctcatcat tttctttttc tctctctttt tttttttttt tttttttttt 3960 

tgagacggag tttcactctt gttgcccagg ctggagtgca atggtgtgat ctcagctcac 4020 

cacaacctcc gcctcccagg ttcaagcgat tctcctgcct cagcctccca agtagctggg 4080 

attatggtca cacaccacca tgcccggctg attttgtatt tttagtagag atggggtttc 4140 

tccatgttgg tcaggctggt cttgaacact tgacctcagg tgatccgccc gccttggcct 4200 

accaaagtgc tgggattaca ggcatgaagc accacaccca ggcttttttg ttttgttttg 4260 

ttttgttttt gtttttgttt tttcaagata gagtcttgct ctgtcaccca ggctggaggg 4320 

cagtggcatg atctcaggtc actgcaacct ccacctcctg ggttcaagcg attctcctgc 4380 

cttggcctcc ccagtagctg agattatagg cgtgcgccac catgcctggc taatttttgt 4440 

gttttttagt agaaatgggg ttttgtcatg ttggccaggc tggtctcgaa cttxtgacc 4499 
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ccatgtgagg acacagggca acgacagcca tccacagcca aggaagagag gcctccaaca 60 

gaaccaaccc tgcccacacc tggacctcaa actcccagcc ttcagaactg cgcgaggaga 120 

aacgcctgtt gtttatgcca cctggtctgt ggcagaccaa tacggctact tactagtcgg 180 

ctggecctgg gcaggcttcc tatgccctct gtgcctcagt ttcctctact gtagaatggg 240 

atagcaatgt gcccacctct tgggctgtgg tgagccggtt aagatgtgaa gagcctggag 300 

caggctgagc aggggcactg caaacctgct cctgttctta cacttcccca cgactcagag 360 

gccggggaca ctgggcacct tgcttgcagc tctcacgcca gcttactcct aaacacccca 420 

accagagccc ctacaaacac agccaaacgg agcagaggca ggcagggtct gctcccgaca 480 

gcccttgcta aggacgcccg accaaggctg gcaaggaagg tggctagggc tcagagctca 540 

ctgttttctg ggaggagaaa ttaaaacata aatgaaaaag ggctccgggg ctgagcaggg 600 

catggaggtg gccaggacag gagcccagct gcaggtacag ccccaattta tgtgattttt 660 

agggacctgg aaagggacct ggctcagccc acgtggcagt cacttctcta acttcctaac 720 

tgcggcccca ggatggctgc tgggactccc agagatggac tctcacggct tccaggggtg 780 

caaaggaagg acagtgtgga gtgtgctigta agccccaacc gacactgctt gagcacctgc 840 

tgtatactaa gcgctttcat acagtgaagg cgaggctgca ggccatgtcc agctcacaga 900 

cagagacacc aaggccaccc aggcaccaag gtgatgcagc taccgaggcc acacgagaag 960 

caagagggca tctctgtggc aggtgggtag tgggagttag tgtgagttgg ggagcggaga 1020 

cccaaagagc tgtgctgggt ggccagggag gctctgactc aatgcccagc accaggtgag 1080 

ctgacctggg ggtgcccatg tgtccactta tgggagacca ggaggcagag gcaggttggg 1140 

atgagagtcc tgggggtaga gctggcccgg gcggttcctc ctcacagatt ctatagccgg 1200 

agatggagat ggctcagggg catgcctgcc tgcggccctg gctctccaag agctctgaga 1260 

ccaatgcccc cggcttagcc caagccgcac aggaaagcca tcctgtggac caagtctggg 1320 

cacacagaga gggctggact ccacacgcat ctgtggcagg ctcctggaga ctcctggaac 1380 

acccctgcat tctcccrtca ttctcaagca atattccttt gggaaaatgc tcacctggtc 1440 

tcagctgaat ccaggaaatt gattgccacc cactcacaca ctcagagttc aaetggggtg 1500 

gtagggacac atgcacagga ccgtgcccac gcctgggctg catgcccgca gctgggggac 1560 

acacacacag agtgacgtgc acccatcact nacacacaca cattggtcac acacatgtcc 1620 

. acactaaatc atttgctcac aaccactcac acacacacta cctccacact cacaocctac 1680 

agctgctgag gagggtgagg acaggtcgct actcccacct ccctcatagg atgagctgca 1740 

aaacggctca gaggcttcag cgagggtgca tgcgtgtccg ccgagtgttc cagcgtcccg 1800 

ggacagttct ccccagcacc gttgatacag cccccactgc cctctgaagt gcagacaaaa 1860 
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cggagggcag 

ctgccttccc 

ggattctcct 

gggaggaaaa 

cctggagcca 

gggggccatg 

cagtgtggtc 

tggggtctca 

tatgtggggc 

ccctggagca 

actxttaggc 

tcattcattc 

gcctgtaatt 

gagaccagcc 

ggcatggtga 

tgaacccagg 

caacaagagc 

ggcacagtgc 

accaggaatt 

aacaaagcct 

ggaggtggcc 

cggcacgagt 

ggccccatga 

catggctcca 

ctgctccagg 

tcgggggtgt 

gcccttgctc 

cagggtctca 

ctccgcctcc 

ggcgcatacc 

ttggccaggc 

gctgggatta 

gcctcctgcc 

ttgggggaca 

ttctgccccc 

ggtaatgttc 

ttaccttgca 

caatggccca 

gcagtcatgg 

accctgcaag 

ccagccctgc 

tgtcacatgt 

aatttttcct 

tggactgaga 



ctctgctcag 
caggaacgct 
gagatctgtg 
atccatgtgt 
gggctgacgc 
caggcactaa 
ccacctcaag 
attatcctgt 
ggtcacaagc 
ggcctgggtt 
cagaaaactc 
attcattcat 
ccagcactta 
tggtcaacat 
cgcacacctg 
aggcagaggt 
gagactccat 
taactgctag 
tggaggttag 
ggttatggtg 
cagctaaagc 
caagtcgcag 
agcccgtcct 
gcacacaccc 
aagtgacctc 
gtggaggccc 
ttccggttat 
ctctgtcacc 
cgggttcaag 
atcacacccg 
tggtctcaaa 
caggtatgag 
acaatcaacc 
gggatgggag 
agctgcccct 
atgctccttg 
gagccagaca 
cagacattgt 
ggccgcctgc 
gtgagacgtc 
ctctctcctg 
cacatggagc 
ctaattacat 
agcggggcta 



ggcctccctg 
ccgctcgagt 
agtagatgaa 
gattcatgat 
ctaggacccc 
ggcaaggcac 
ctgcagggca 
acggccaagg 
ctgcagctgt 
cccggggcac 
tctgagaagg 
ttttttctta 
gggaggccaa 
ggtgaaacct 
taatcccaac 
tgcaatgagc 
ctcaaaaaaa 
caacacccag 
aaacatagcc 
gggtcatcca 
cttgaaggat 
agcccagggg 
ctgctgccaa 
cgatcccatc 
accttttaaa 
ggaagctggc 
ggctcttccg 
caggctggag 
cgattctcct 
gctaattttt 
ctcctgacct 
ccaccgtgcc 
cacaacagtc 
gaaattcagg 
gctgagaatt 
taacctctgg 
taatggggca 
ttaatttgta 
acacacatgt 
ggacaaggcc 
cctccagcca 
acacatgcat 
attcagcact 
taataaaatg 
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gccccaggaa 
agtaagtgac 
gctggggatt 
gccctcccag 
aacatttctc 
gagcgtgtga 
gagcacgaga 
cctaagttct 
ccctcctgct 
ccccatatcc 
gacagcagga 
ataaacatct 
ggtgagtgga 
tgtctctact 
tacttaggag 
tgagatcaca 
aaaaaaaaaa 
gcaaggaaga 
cctctgcaga 
gcctgaggca 
gcacaagagt 
acaggtcagg 
tgaacccttc 
acttccgtct 
tacaatgcgt 
ctgcctgctc 
tttttttttt 
tgcagtggcg 
gcctcagcct 
gtattttcct 
cttgatccgc 
tggccacacc 
gaaggaggct 
actagacaga 
gggagccaga 
tcccattaaa 
ttagaaggga 
atctgggagt 
txtcattcac 
gcccccagcc 
ggcccaccac 
ggagccatcc 
tctgagaaaa 
gaatttgtaa 



txt 
gggccaacag 
tgcctagctc 
agcagacata 
acgtggggcg 
tgcacccaag 
gcgtgcagca 
gacccaacgt 
aagagctggg 
gtatatccat 
cacctgccca 
cttgggattc 
tggcgagcgt 
tcatttgagg 
agaaatacaa 
gctgaggcag 
ccattgtact 
caaaaaagtc 
tgtctatctg 
taatctgtag 
ctcaggggag 
gagtgagcca 
ttggagaagg 
atggaggcct 
ccgcatacca 
cttcctgcaa 
ttgcagctgg 
gtgtgtgtgt 
cgatcttgtzt 
cctgagtaga 
agagacgggg 
ctgcctcagc 
cacactctgt 
atgaggcccc 
ttacaaaacc 
aaaaggaggc 
ctagcagcac 
attgaatttc 
taatgtggtt 
ccggacggag 
cggtgagtigg 
acagaggacg 
taataggcag 
tgagatttta 
caatcgcgca 



gcagcaaggg 
cggccccggg 
gctccaggcc 
atgcaggtgg 
gggtggacct 
gacagcatgg 
ggtctctgga 
cagtcccccg 
cccgagctca 
ctctcaaaca 
ccccattcat 
a 9tggctcat 
tcaggagttc 
agattagcca 
gagaatcact 
ccagt:ctcgg 
tcgtgggctg 
gtttttgccc 
caggacagaa 
gctccctgga 
aggaaggggc 
gatgctgaga 
ggaagcaccg 
gctggggatg 
cttaaccccc 
tggttgaagt 
tttttrtgaga 
tcactrgcaac 
tgggactaca 
tttcaccatg 
ctcccaaaat 
ggggaccagg 
agcgggactt 
gctcacagac 
caaatctcag 
aatcccagct 
cctttaatta 
ttctcctctt 
catggggccc 
agtgccgcgc 
tcagtgtggc 
gcaggagcat 
gtaattxaat 
gttaaattag 
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tactgcatat 
attttaacac 
caacaaataa 
tgattacagg 
tttcctataa 
aggcagaagt 
gaaactctgt 
acattttttc 
ttctatgctt 
tgtgtgaaac 
gaatttccaa 
ttcataatac 
acagttacca 
gaactgagat 
tcccatcatt 
gtagggcagg 



tctcacttat 
attartatrc 
aagcctttag 
agtcatttca 
gaaaaactta 
tgcagtgagc 
ctcaaaaaaa 
tttaaaaaca 
tttgacaatg 
tatgtcgagc 
tctcaacctc 
ataagagtag 
tggaaagacc 
ttgaataggc 
gatcatctgg 
catactaaat 



aactggcagc 
ccacttagat 
aaattgatgt 
tttacactaa 
attccgggag 
cgagattgca 
taaataaata 
ttacactatt 
taaagattat 
trgatgtatcc 
cttcttgttg 
ataatcagat 
ataatatgtt 
tttagataga 
gagaatcaaa 
aaaaatgggg 



taatcattga atacacatag 
tctgtgtaat atttaaccat 
atctagcaca attaaggggt 
tttgatgtga ggagacaatc 
gcttaggcag gagaatctct 
ccattgcact ccagcctggg 
aaataaaata aaaagtaaaa 
gaaatagaat agaattttgg 
atcaagaata acattgactc 
agaaaagaaa ttgcagcgat 
tggaatagaa atttagttcc 
atgaggtgat taaatatata 
tggacgaggt gggggaaaga 
ctaaaaaccc atggcctgtc 
atggcctgga ttataaagtg 
tgagtgtgct tggatgggaa 
Sette-€rfr — 



acataaatgt 
ttaattacat 
aatttgttta 
aaattccttt 
tgaacccagg 
caacaagagt 
cttaaccctc 
acctcaatgt 
cttgtgcaca 
attgggtata 
accttctctg 
aactgatttc 
gtctccctct 
ttcttcaact 
attacaatta 
aaactggggt 



1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
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gcagagaaat atttcctgtt ttggaggctt cgttcatagc gagcctgagg atgcagatga 1020 
tatggaacat gagccccacc aggatataca tcctcctgca cagtcttccc tgatcagtgc 1080 
tgtctattgc cctaggcacc atcctcagcc actctgtgtt aggcacttag gacagttctg 1140 
ttcaaagcat gattgctttc tggtcagtga agaaatagta aagaaatgag agtctgcttt 1200 
tgaagctttt gtagcaactt aacatttctg caatgtcgtg gtacatgttt tatatttttt 1260 
aaaagtattc atctgcagac tagaaataaa agagtccttc agcacagata gtttgagaga 1320 
tgctactgta gagtgccctg ggaggatggg attcttctag acattgacgg ctccctggag 1380 
tggtgctgga gcttgctgag atagggacgc ggggattgct gtggaacagc atgttgtgaa 1440 
aacaggcaca gatgggctgt tgccacgtac tcagtatttg gcaggcccag gcaccctgcc 1500 
ccggggagac agagcaacag ctgtcatttc caaaaatgtg acacaaatat gactttcgtt 1560 
ttttattgta tgtgtatgca tgtgtgcatg cgcatgtgtg gttggtgtgt gggtaaataa 1620 
tattggtgag ggagttagca gtgcagagac gcagagatga atctaatgaa tttatatttg 1680 
gaggcccttg aaaggagccc tgtgcatgca taatcaggaa gcagaagtgc agtcgtaggc 1740 
cagtcccatc ctctggtgga aacagcaaat tacagcctta gcacttctca cctgtagcaa 1800 
aagaaatgta tgctgcacct gggaggggta ggagaacttc ttagaatttg gtgggtttat 1860 
tagtcagggt tgtgtagaga aacagaacca atgagataaa tatgttagat agatgataga 1920 

tagatagata gatagataga tagatagata gatagataga gatgatagat gatatagata 1980 

gatgatagat gatagataat agatgataca tagatagata tagcaaatag atctatagat 2040 

atgacagaga gagaatgaga gagatttaca ttaaagaatt ggctcacatg attgtggggg 2100 

ctgcaaatcc tacatctgta gaatttgctg gcaggatgga aattcaggta agagtggata 2160 

ttgtagcctt gagtcgaaag tccacagggc agcaagctgg aaactcaggc aaggtttcta 2220 

ggttgcagtc ttaagaagaa tttcatattc tttaggaaac ctcagtctgt gttcttaggg 2280 

tcttcaactg attgaatgaa gcacaaacat tatggaagga aatccggttt actcaaagtc 2340 

tgctggatta agcattaatc ccaactaaaa aacgccttca cagcaacatt tagactactg 2400 

ttcagccaag tatctgggca ccatagctca gataagttga cgtataaaat tcaccatcac 2460 

aagggaggaa tttatatata cactttattg cagcaaaatc tttacagttt aacgttttgt 2520 

acttttccca gaaggaaacg tttcagtgca gagttgaata tcgcatcttt ccgtagctca 2580 

tacaaatccc atacaacttt gatttgctga gcctctcctg aaaactggac aacttaagtg 2640 

ttcatgaaag gttttaattg cttaagaaaa tagactgttt gtgtgaatta caaagaaaaa 2700 

gggattttag aaggaacatt ggtactccgg gaagcaggtt ggggcaaggc ttgcataagt 2760 

gaatcagaag ttttaggcac gaagtcagca cctctgtcat ggcctatgtc agttgagtct 2820 

ttacctttgc ctcgtgcctg gccttcagac cctgtgagcc cgctccggga cagggttacg 2880 

gccaatccag cagagattct ggcaaagcat cccgggaatg agagtgagaa aggcccaaag 2940 

cagtcaaggc ccaaagacaa ccctcgaccc catctctcca aatgtcagcc ctcaagatct 3000 

caggctctct ggacctcaat ctccatgcca cttcaaaggc cccaaatttc agccccagca 3060 

gctccagctc atagccctag gtcttcagag atcactccag taactcccaa atatcacccc 3120 

agggatccag ttctgacagc caagaccact cctctcataa aataatcgca gatattaggt 3180 

gaagatcccc aaagccccaa ggaaacccta aatctcatcc tgaggacacc gatcccacca 3240 

aagccccgag gaaaccccaa atttcatctc gaggacacca gtcccaccgt taggaaccgg 3300 

gacctcaacc cacagcaccc ggattccgag aacagaggct tcggggccaa atgggctgaa 3360 

tccagtacct cactcccacg cccccgggtg gacagcaacc ctcctccacc gcgtcccctc 3420 

gtgggtttta ggtcctatat ttgagggatg tggcctcctc cttttatatc atgttggcca 3480 

agaatgatct acacagtcat catggaatat tgcatggaga taaggagtgg ctgtgcccct 3540 

gcctgaaggg ctcatcccat tcatgaggca ggatgaccct caaggcagtg ctactcacta 3600 

gagtcaaagc tcaggtaagg tgtttagggg ccagttgcaa catgaagcac attagccaag 3660 

gcagtctcaa agagtattgc cagaggaaga atctacacct ggctgtttaa tgaggttgaa 3720 

aggcaaatct aggtcaagca ggccaagtaa agaggtcagc ccagcacttt gggaggcaga 3780 

ggcaggtgga tcacctgagg tcaggagttc gagaccagcc tggccaacat ggtgaaacct 3840 

tgtctttact aaaaatacaa aaattagctg ggtgtggtgg cacgcgcctg taatcccagc 3900 

tattcaggag gctgaggcag gagaattgct tgaaccaggg aggcggaggt tgtggtgagc 3960 

caagatcagg ccactgcact ccagcctggg tgacagagtg aaactccgcc tcaaaaaata 4020 

aataaataaa taaataaaaa taaaaaagaa gtcagaaagc ccatgaaaaa catttggagg 4080 

gaagaagtta gtccaagaat aaaagcatta ggtcagggcg tctaagatcc ccaaggattg 4140 

gtcctggctg tgtagacact taaagagatt gtgcaaggct ggacaggagg gtgagctggc 4200 

caaaaaggtg atctgctaac tgtggaatca accaccgccc aatgtcctgg ataactgagt 4260 

ctgactcgca atggttaggc ttaccattgt aaatcagtct tgaagtataa acaagttcat 4320 

taaactaccc ctcctgttta cattttcttt tattccctga ttcatgtaat ctctgtagct 4380 

catatctaaa taaaaaataa taacatggac ttgttattat gcatgtcgtg gtttacttct 4440 

aactgtaaag tttctgatta ttctcctgtc atgctgatct tcatgtcaga agaaaacgct 4500 

ttgctattca ccatgagtgc agaaaactgc ttactagaaa cgattctaaa cagcaaagga 4560 

tgtatatttg aaataatagg gttgtaatga aaaagtacat atttctcaat ctcattcaat 4620 

acattgtacc cgtagagtaa aagatataga aaagcatctt atgttttaaa aatgtaagtc 4680 

tgagagtgaa aaacagagac aataaaaacc tttctctaaa tattattggt ggaaaagtct 4740 

caaagcaata gaaagtatat ataaaatgcc tgtgaatttt tgtgcaataa aatcatatat 4800 

atatatgtgt gtgtgtatgt atatttgttt tttttgtttt gttttgtttt tttttttttt 4860 

fcttgagacgg agtctcgctc tgtcgcccag gctggagtgc agtggcggga tctcggctca 4920 

ctgcaagctc cgcctcccgg gttcacgcca ttctcctgcc tcagcctccc aagtagctgg 4980 

gactacaggc gcccgccact acgcccggct aattttttgt atttttagta gagacggggt 5040 
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ttcaccgttt tagcggggtt ggtctcgatc tcctgacctc gtgatctgcc cgcctcggcc 5100 

tcccaaagtg ctgggattac aggcgtgagc caccgcgccc ggccaatgta tgtatatttg 5160 

taactcagat ttgnaaatgc agccacattc atatccacac atcaaatgca ccaaaaagat 5220 

tcactaagtt cctcatagtc agtccataaa ttxccagctt cttcacatat tcgcattttt 5280 

tgtcttaaca tgatataata gttaaaaaaa rttaaaaatt aataattaac caagtttttt 5340 

tttcttgtag actatatcag ccattattca gaaacattaa ttcattgttt taaaaataat 5400 

ttctctagag atgtttggca cttagtatat gaaatcattt ataattgaat aatacaaaca 5460 

aaacagaaaa aagaacttac ttttgggaca tctgaagaaa gtgacctaca aactcagttt 5520 

atccctgagt ctcattattt ttaggaaaat ggtctctatg txatctctat gttatgtttg 5 580 

acttgtcttg cttgagtctg ttttccagaa agattgtgat ccttggcctg ttgaatatga 5640 

aatttacatg atcactcttg ggaaaaattc cttttttagt gactgcaagc caggtcctgt 5700 

ggccgagaag catcagctgt acttggttga ctgactgtat tactctgttt tcatgctgct 5760 

cataaagaca tacctgagac ttggcaautt acaaaagagg tttaacagac tcacagtgtc 5820 

aagtgggtgg gggtgcctca caatcatggc ggaaggtgaa aggcatgtct cacatggtgg 5880 

cagacaagag aagagagctt gcgcaggaaa atttcctctt ataaaaccat cagacctcat 5940 

gagacttact cactatcaca agaatagcac gggaaggacc cacccccatg attcaattac 6000 

ctccaaccag gaccctccac aacatgtggg aattatggga actacaattc aagataagat 6060 

ttgggtgagg tcacagccaa ccatatcact gactttcatc atattataaa gaactatctt 6120 

gcagagagaa tctgctttcc atctctaata atttgtttat tctggatatt tcacataaat 6180 

agagttatat aatatttgtt gctggctttt tcatttagtg taatgttttc aaggttcatc 6240 

tgtgttagag tatgtacctt tgcttcattc ctttacatca ctaaataata ttcgattgta 6300 

tggatgtgcc acattttatt tgatctctta tcagttgaca gactgttggg tttttctact 6360 

ttttggctat tacgaatgat ggtacratat gaacattcac acacaagrtt tcatgggtat 6420 

atgtttttat ctttcttggg tatataccta ggtgggggat tgctgtcata tggtaacttt 6480 

atgtttaaca tcctgagga ~ 6499 

<210> 50 

<211> 4441 

<212> DNA 

<213> Homo Sapiens 

<400> 50 

ctagtacaag gtctttgtta agttcatggc ctgaaaattc tgtttccttc cggatttgaa 60 

agtaccttgg tttccggata gtatagaaaa atcaaacagt tagaaattga ataattacag 120 

cacaatccag gaggtttgcc agcttgactg agttctatca attttttaac cgtaagaagg 180 

aataaaatat acttgctggc ttttatgccg tggtctgttt cagagtagtc acctaagtcc 240 

acacctcatg tcagaatgag aagtgaatca tctggggaca aacgaggtgt gtatctgcct 300 

aacagcrtag tgactttctg gaatctaaat attaagtgaa aggacagggt cacagggcag 360 

ctgatctttg gctcgtctct ctttcgtctg cccatggctt tgatcatgct acatcctcct 420 

ttgtggccag ggctccacag agaatctcag ggtxatggcc tgttgcctag tcaagaggta 480 

taaacctgct gcttaaatat gtgaagccca cagtttgggg gttgggagtt aaaggggagg 540 

tagcctcata cgttccactt ttcccacaaa acccaaagca gactaaacaa aaccggtgtg 600 

aaatctcatg acttgatgct gccgtggtta tatttttcaa gctgtaatcc attcatttgt 660 

atgtgtgttt attcccagga tttacagctc gctcagtctt cggggggaaa gagaatattt 720 

gaaacctttt gcaaagggtt tgaagtatgc agtggcctgt gattaaagcc caggatagag 780 

tgtgtgtgtg tgtgtgtgta tgtgtgtgtg tgtgcattca tgtgaatgtg tgagtgccca 840 

ttcttgtggg gtttttttct cccatttgct gcttttacgc aaaagcatca cacatggtag 900 

catttaggag ccccttcaca gctgaagtgt ttcaaacact aatgaatgga agtttggtca 960 

nataatgcag acaagctrta aggcatgctt ractgaatgg tgtact:gt;ag tgacctggga 1020 

aaggaagaga ggtataaatt gtgtcggtta gccacccaga caaaatgaag ccatgtggaa 1080 

aatcaaatct attaaagtat gaagactgtt ataagctgtt ttaagctggc catrtcttaaa 1140 

ggggctcggt ctgctgcagg agtaaatgtt tacctcttcc cccagatctt tttcacatgc 1200 

cgtcttatga gacaatgtgc agcatttgat aaaaatcatc ctctcactaa cgacgctttt 1260 

atcggaaatg tttattgtct ccgctttacc acccatgtcc tgaaaggtac tgcacatttg 1320 

ttaaataagc aaaaggaaag agaactttgc ggctcaagcc ggctcgcaga aaccccaaca 1380 

aattccagag ccataaggaa tcagaaaaag gaaaagggaa ggaggagtta gatttatatg 1440 

aagagggggc cttgcaaaaa catatatatt agattttctc tgcttggcta gcccaccatc 1500 

aaatgtttgg tcgatgcgga ttagtattga gattgagcag ttaacgccga acttgggtca 1560 

ctgtctcaaa atgtgcttta ataaatacaa gggggaggaa atggatttgg gaaggctcgt 1620 

tgtcagctct gcccctgctc tcacactgcc tctgcacagt ggtgtagggg tcacacacag 1680 

gtgttggcac cagtaccaac ccgacccaaa tcctggtccc acacaccctg taattttggg 1740 

cacattactt gtctgtgcct tggttttctc agatgtaaaa caggaatatt aacagaaggt 1800 

gcctaagcat ctggtaagca ctctaaaaat accagctatt attaccagta tctggagggt 1860 

gggttcattt ataccttaag aaaggatccc cctaactcta tttttcttgt gtgccggttt 1920 

ttaaaactga tgaatggcat gctgtcggga aaaatttatc cattcctact ttttttctaa 1980 

ttggtgagta agcgtgcatt gccctgacat tctctggaca gcaaacaatt gaatttgctg 2040 

accagccgcc atgatgtcaa gccttaagtc aacagtggct aatgaccgct ctgggaaaaa 2100 
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acaacacctt 
atcgtcatcc 
ttggcaagag 
tccaggagga 
caacgttcga 
gaccacttat 
gtgggggagg 
tcaccttctt 
gggacacagc 
agtttgttga 
atcactgcac 
ggaagaacgc 
agtgaatcct 
ataatcttgt 
gctgangtat 
gggaagtaag 
ggagtaataa 
tacttcctac 
tttaaggaaa 
aataatgcat 
tgctanagtt 
tgtgtctttc 
gaaagacctg 
ttgacataag 
caaactgcac 
gttacaacga 
aaaaaactta 
agttaaagaa 
ttaattaaaa 
gaaaattttt 
cacatcacac 
tgaattcttt 
gaatgaactt 
gcagctaacc 
ttacatactt 
cttgaagctt 
tgtgttttta 
cacaaatgtg 
agtttttatg 

9 



gattccucaa 
tcttttgagg 
cagcccacac 
aactgaagag 
tttctacctc 
tctgtattgc 

ggggggtggg 

tgtaatccct 
ttaacaatgc 
tctgataagc 
tgcatgcaaa 
ggtgtgtaac 
ctgtttagaa 
agacacttac 
cctttcattc 
caaacaaata 
atttactcac 
agagccaaaa 
ctagacaagt 
gatgaactca 
atacatcaat 
tctctctctc 
aataccactt 
atggttgacc 
ttttagccag 
atttacagtc 
ctgggtaggt 
gcaatctcct 
atttttaaaa 
taaccaagtc 
aagatttgac 
aaggagatgg 
gttacttgca 
ctagtcaaaa 
ttgcaaaata 
tgtaggtgag 
agattagttg 
atttttgtaa 
cctttctctc 



ttacggttta 
caagcataat 
agtataacga 
acatcctcac 
agcagcagtt 
catggtcttt 
gtggggagaa 
tcacagtccc 
aacttttaat 
aagagtgggc 
caagaaacgt 
actgtgtaca 
caccaaagat 
ttgatgattt 
agctaacaaa 
ttgccaactc 
agcacttgtt 
tgccatttag 
aaaattatcc 
cctaattatg 
ttaaaaagca 
ttcctctccc 
acctcaaatt 
aaggtgcttt 
agatgcaata 
tagtacttat 
gattctaatc 
tactgtgttt 
tacttgrttc 
gctcctaggt 
tgtaatattt 
actaattgac 
gcattcatitt 
ctattttrtgt 
agtaaataat 
atgcaacaag 
aatataagaa 
tatgtctcaa 
ctagtgagtg 



seq_prot. 
agaagccctg 
gtgctgtgga 
ttgagcgtca 
aagagtctgc 
ggatcttttg 
ccactttcat 
atcacataac 
aggtttagtg 
tactgttttc 
gggtgagaaa 
gtcacacttg 
cctcaaatac 



txt 
ggaatgaggg 
aacaggttac 
tggctgtgcc 
cgaagaggac 
aagggagaag 
ctggggtggg 
cttaaaaagg 
aaaaactgct 
ttttttctta 
aaccgaattg 
tgacgtcggg 
caccccaacc 



aaggactaga tactactttc 
ttaacttttt atttctaaat 



ctagaaaagg 
ttctatttat 
ttcaggacaa 
caataaataa 
tctttgtaat 
aggtgggagg 
aaaaaaaaaa 
tctctctrtt 
aagcatatgt 
tcttcggctt 
tatccccact 
tacatgctgc 
atctgcagtt 
cagcatgact 
agcttctctg 
tcttaaggat 
aaatattacc 
ttgcaaagac 
gttttttcaa 
aaaagacatt 
aaataaaata 
ccctgctttt 
aatgcttgac 
ccagatttat 
tgctgacttt 



ttatgttcat 
ggatatcaca 
cacttcattt 
cacttgtcag 
ttaatgaaaa 
agcgaaatct 
aggggggggc 
cattgtgtat 
gttacttcaa 
gagttcacca 
actcaatact 
tatacacaag 
ctttttgtac 
atgtattttt 
ctagatttct 
aatrttcctc 
ctccaagtct 
ctacctccag 
tgtttgaaat 
tgatagaaag 
aaagccaacc 
gcataatgca 
aaatattttc 
tttaaacgct 
ttaacatggt 



ctcgccagtc 
ctctgcactg 
ctttgtgtgt 
tagggggcgc 
acactgcagt 
gtggggtggg 
actatattaa 
gtaaacacag 
acctactaat 
ggtttagtca 
cattcatata 
cactccctgt 
tctttttcgt 
gagacgaaat 
ttttcaaaaa 
catatcagca 
tcaggaaatc 
cctcagagca 
ggtacaacag 
aaatttcttt 
aatctctctc 
cagtttccat 
gtaatacgtt 
tctcttcatt 
acctctgaat 
caatgcaaga 
acttaattac 
ctatgttttt 
acattaactt 
aatcacacta 
gtacctcaaa 
acttcaaaag 
agttcaaact 
gaacacgttt 
ttcaaagaaa 
atcaaaaata 
atgtatttta 
tcttatgtag 
attatcaact 



<210> 51 

<211> 4343 

<212> DNA 

<213> Homo Sapiens 



<400> 51 

ctcaaagaac 
gctgctaact 
cagagaaccg 
agctatccaa 
tctggttttt 
agctttacat 
ccccttaaca 
tgaggtcaat 
gtcccttgaa 
ggcagacttt 

cgtgggtaac 

gatccaaaga 
ctttgacctg 
accgttactt 
aaacttgata 
ctcccctccc 
gt:gcct:tggg 
gcacgggcag 
cggtgttaga 
tgcatcacag 



taccaaatta 
gagtgaaggg 
aaaagacggc 
aggcacaaat 
tgttttattt 
atttttcatg 
gctggacaca 
tatttcccca 
caacactgct 
ccttttgcaa 
atatggtaag 
aggtcattcc 
gatacaacag 
ccaacaatcc 
aaagtatcac 
cctgcgccag 
cctgcgggcg 
agtgcgcatt 
ttattttctg 
cttcctaaga 



ttccctgggc 
tgttggggaa 
agctgtatga 
caagaaccgc 
taaagtgaaa 
tcagccccct 
tcaccgttcc 
gcaattgaca 
ccctcctccc 
ggagcccagc 
tggagaaaat 
acagagattt 
agaatttaaa 
gaattattgg 
actggccctt 
gtccccaaat 
gtgtttgttg 

tggggcgact 

atgagacaca 
aaggtaaaag 



ccctgaatca 
aagtgccatt 
aagggcttct 
aaccttggag 
ttccactgtt 
tccttgccac 
atttttctat 
aagaatcttg 
cctgccctcc 
agggcggtgg 
atttatattc 
caagcaagag 
gaaagagttt 
caatttaaaa 
ccccgggccc 
ccagaattcg 
ttgtgactgg 
ggrtcttggg 
tgaaacctcc 
gccctctttc 



gatggagctg 
gtgtaagtta 
gtgctgggat 
atatgacaaa 
tcccaagaat 
ctttgttttc 
tagttgtgac 
atgaagagga 
cggctgagag 
cggtggggtc 
atggtgaaag 
acactgactt 
txttttgtigg 
aagaaaagat 
tggccaggct 
gagccgcggt 
gcggctaggt 
ggaagatata 
agtgctttca 
ggtcccccag 



aatcattctg 
aggaaagggt 
tctcctttta 
tgaacacaat 
ttgcacatga 
cagcctgcct 
tgaagtagct 
aagtgacatt 
gagagagcag 
tgtggaagtg 
agcaggagaa 
gatattgtat 
ctttcctgcc 
tccgcccact 
gccacgcctt 
cactgagcct 
gagacggagc 
aataatgaat 
gcaggcattt 
tggcaaaatt 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4441 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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cgcagctgtc gcccagcacc acaatcaaat gcgtgcgggg ctgcgggaca gaggcgtcct 1260 

cgcgtaccct cggacagacc cacagctgtc tggcggatac acacgtgcgc gtctggtcga 1320 

ccggaccagc agctgcttgc gtcccctctc ccgcgcacgg tggcggcccc gcaccaagac 1380 

ccgctgagga aacgaagagt agggtcattt cccatcttgc cgtcgaaagt ttatcccatt 1440 

tcccgggtgt gcaggaagag atggcgcgag ggcaggaagg tgtgaaactg gggttctgcg 1500 

ggagacccac cgtggctcta tattcactct cgtttgaaaa ctgccagcgt tccactcccc 1560 

ctaaattcca gttcccgagg gggaggtagt gagatggqat acaaatacat ctaaatcctc 1620 

gcttgggagg ggttctccca ctttgggacc cctcgtggag cagcccataa ccgaccggcc 1680 

tgctgtacag aggccatgct taatggattc tcgcagataa aagggcctgg ctacctttct 1740 

ttcctataat ggcccctgag gctaatgtct taaagaggaa acaaagggat tgttttacag 1800 

cacacaaggg gggtggcggg aggaacaggg agaggaggag gaaggggcat tggatacgcg 1860 

gagcgggagg aggcctcccg cttcttaacc cttcacctcc tgcgaccgcc caggtctccc 1920 

gagctccgca ccctcatcca ggacccgaaa cagcggggag gaggcgacag tgcgtggagg 1980 

gccccgccct gtgactcgac ctccagctct cctcacctcc ttccccgtct ttttaaagaa 2040 

cttctgaaag ggagaacgga aaagatgagg ggacccatat ctcgtgagct gcagttctga 2100 

aaagtcagtt ctagaggggt cgtctctcgc gaggcccggg tgcccacaca ggatcgacgc 2160 

gtggcagtgc cacctgggcc cggactctgg gactgcggga ggcgggagcg cctggggcca 2220 

ggacctgggc gtacgctgag ggcacccggt gaaggccggg gaaagtgggg cagctccgaa 2280 

atgcggcctg cacgtcgccc tgggagtcgn tcccgggccc ctctgcagac acagccttca 2340 

ggtgtgagcg gtgtgtgcgc cggccggggg cgtctcttac aggaagttta tatttgcaaa 2400 

ctgggcggtg gggggggggg ggggcggggc gcggcgggag agagagagtc tcagtggctt 2460 

ctgctttctg gcttttctgt tctggctcca ggaaaagatzc aaaacaatiag ttaaatagca 2520 

attgaagaca agtgatgtaa gaaataaaca actcgacaac aaaaatgccc atagggacgg 2580 

ttttttaaaa ccctttatta tggaaatcag agcacatgca caaaaggaga gagaaaggta 2640 

ttgtgcagtg ggtctcaaat ttgatcaagc atccgaaccg cggagcgaaa tcctctggag 2700 

ggcttgttaa aacacagatt gctgggccct agccctagag ttttttgttg tgtattttta 2760 

agcagatctg gggtcagctc cctcctgaga atttgcaact tgttagagcc ctcaggtggt 2820 

gctgcttctg gtttggaaag gacactgcta tccactggtc caataaatct ccgcgtaccc 2880 

tctcccacct tcaatatgta caacacctcc ccactctgtt cattctatcc cctcccacat 2940 

tttgtttggt ttgctggaga attataaagc aattcccagc catcctatca tttcacctat 3000 

atagatgatt tttgattcgg tggttactta agccgagagg actgcaggct tccttgtggt 3060 

catttcacat tggggcagaa tctggtacct ctctctggga aatccatgcc catatcttgg 3120 

ttctgagacc acccttcggg agttcctttc cagcagattg tcacatgccc aaacatcacc 3180 

actgggtctg ctggcaagct ggcctggcct ggtcagggca ccacctggtc cccatctttc 3240 

attttctgct cattgtcccc aatctgtcat catgtactcc cacaaccatc gtttcccctg 3300 

aggtattttc ctaacaaggc gtctcccctc cagggtggag ttctgagtgc ctagggatgg 3360 

ggtggtaccc acaacagtgc aagcccatag gattatttgc atcagttagg tccaggactc 3420 

acaccccaag gggagcttcrt ctgggctact tgaccctccg cataattcct gattggaaaa 3480 

ggggacattc taggcatagc ttctgggaat ggccatcact gacatctgtc gtaacctccc 3540 

ttgattgtcg ggccctagct cagtctggct atctgtgttg cttctgttcc agagaggtgc 3600 

caagtgctct cttagactcc acctggctag tcctaagtgt gcagcctcta cctatctcct 3660 

tgcctctcct ttgggccgct ccgaaacaag cttcccttct ttccctcccc tcccctcccc 3720 

atcctggttt aggataccat ctgctccttc cacagctttc tcatcccagc ccctcccagt 3780 

cctatcccca cctcaaccct cagaggtctc ttgggctgtt acaaagttrta gacatttttt 3840 

tttatttttt aaacttttag agatggggtg tcacttactg tgtcacctag gctggagtgc 3900 

agtggtgcga attcggctca ctgcagcctc aacctcccag acgcaagcga tcctcccacc 3960 

tcagccccta agtcactggg actacaggca cgcaccacat aggcctgtta attrgttgtat 4020 

tttttttttt ttttgtagag acagggtttt gccctgttgc ccaggctggt cttgaactcc 4080 

tgagctcagg caatccgcct gccttggcct ctcaaagtgc tgggattaca ggcatgagcc 4140 

actgtgcccg acctagactc ttttattgtt ctctttcttt tcctgacuct tcatgaagca 4200 

agcatactat atgaattctt catttttcca tctgctgtta aagaaagccc cggggatgga 4260 

gtaggttcct ccaccccaga gctctgttcc tattttcata ttgcctcatc ttcctatgtt 4320 

rttcttcctt caccagcctg gaa ~ 4343 

<210> 52 
<211> 4476 
<212> DNA 

<213> Homo sapiens 
<400> 52 

cctctaattc ttactctaag aagactcaga gtaacaacgc agaaaataag cggcctgagg 60 

aggaccggga gtrcagggcgg aaagccagca gcacagccaa ggtgcctgcc agccccctgc 120 

ccggtctgga gaggaagaag accaccccaa ccccctccac ggtgagccgc accccccgct 180 

ctctccttcc ttcctgcggt ggggcctgcc ctctccaggc agctcttctc ttaattcaga 240 

ctctgttccc tttggctact acttctgctt atagcaggaa gcctcgctcc cagcagtaaa ; 300 

tgcagaatcc tttccttaac ctaccactgt ctgcttcagg tggaagggac aggaagcctg 360 

ttccatgaac ctggggggag aacctggctg tagaccactt tggctttctg atagaacgct .420 
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tgccctttat tccccacaga acagcgtcct ctccaccagc acaaatcgaa gcaggaattc 480 

cccacttttg gagcgggcca gcctcggcca ggcctccatc cagaatggca aagacaggtg 540 

agagacccgg gccctgcctg cctcactccc taggagccat gtctcacagg gtgatgtctg 600 

tcagcagcac cgtctcctgt ccctgccagc gcattgctcc ctgctccctg gagttccatc 660 

ctggctgtgt ccagtccagc tttcccctcc cctattccac gccattgcct cctccccatc 720 

ttcctctgac tgctacttgc agtttgccaa gtgtggggct gaccgtggcc atctcagcta 780 

catgctcgct tcttgaccac ggccagggca tggcagctgc cctcctctag acatgagcag 840 

ctaaggcctt gtgttggggg tcccagctca gggcagaacc aagagatgcc caccttgagg 900 

ggtgtacaca tagagggcga ctccagccat ccccatgaga ccagagctcc ccagccttca 960 

ccggccgcat ttcttggtgt tgcattcctg gctctatctc ttctgagttt atgaaagttt 1020 

cccctcagca acaccccact ctttctgtag aagaaactct cctgttctta aaattcttag 1080 

gaggccagtg cagcctggag gcagcggccc cttgtctgct ctccttcatt tctgattcct 1140 

cttcccaggc actgacccac ctcgctgctt cccgacctca ctcacctcca cttctcagcc 1200 

ccgcattcct cagttctgac ttgcatcccg ctgctgccca ggcctgactt ctaccctgcc 1260 

agagctcccc agctctggcc cttcccctgc ccttgcttcc taatccaggc ctcccgccct 1320 

cactcacccc taacacgggc ctctccgctg cttttgtttc ctagcctaac catgccaggg 1380 

tcccgggcct ccacggcttc tgcttctgcc gcagtctctg cggcccggcc ccgccagcac 1440 

cagaaatcca tgtcggcctc cgtgcacccc aacaaggcct ctgggctgcc ccccacggag 1500 

agtaactgtg aggtgccgcg gcccaggcaa gtgtgctggg gcagctggtg cacctgctgc 1560 

cctcagccca ccctaccccc ttgccccaac aatttcttct tcccacttgg gggtcctgct 1620 

gtgttcttgt catcttagcc acaagaaatg ggtctgtccc ctgcggccag gaagtggagg 1680 

gaacaaaaaa gagcattaat gcccctcttt tccagttctc cctctcagaa caggtatgca 1740 

ggaagctgtc ctaaggctcc aaagggaaac ctttttgttc tgaaccttcc agggtttcct 1800 

tagggacccc ggggatagtc ggcatcacag ggactcaatc ctcaagggtt ggtccccatt 1860 

gccgccttga gggtccagtc tgcccggctc ccagggagcc cgctgtctcc agcctaaacc 1920 

acactccaca caggggtcct tccttgcctc cctccctccc ttcccaaacc atctccttcc 1980 

acttccacga gacttcctrc tcaccactgt cctcagtagt cacacccttc cttctgtgtc 2040 

ctcgtgatgg ctgcctctgc cctagcatcc ccctccctgt ccccaccaca gggtgtccag 2100 

gtgcccagtg atggctgtcc tgtaccctaa ttcgtccccc tcaaccccac ttctcttccc 2160 

acagcacagc cccccagcgt gtccctgttg cctccccatc cgcccacaac atcagcagca 2220 

gtggtggagc cccagaccga actaacttcc cccggggtgt gtccagccga agcaccttcc 2280 

atgctgggca gctccgacag gtgcgggacc agcagaattt gccctacggt gtgaccccag 2340 

cctctccctc tggccacagc cagggccggc ggggggcctc tgggagcatc ttcagcaagt 2400 

tcacctccaa gtttgtacgc aggtaagcaa ggagctttgg gtggcagaga ggctcaggcc 2460 

aggccttcct gctttactcg gggtgggttg ggggttgggg gttggggttt gggacactct 2520 

gtaccggtat tgggtcctgg ggttagaaga ggcttcagga agcacaagaa attaggtctt 2580 

tgtcaacacc ttatgtgccc aggcccaccc ctcttaggcc tctccccaac tcctcacagg 2640 

cacccctcat tctctggccc caagcagatg gccgatgccg cctcctctct aggagagtgt 2700 

gaactcagat gctaaaataa aagccccccc ttctctcctg ggttcccatg gaaacttata 2760 

tttggtgacg cagctgcaaa gtcatgaggc atgagccagg ctggggccag caaggaaaat 2820 

tttgtcctgg tctcttgccc ctttgactgc ctctcccact agttggttct gtttctggct 2880 

gcaggcgcag ccatgccctt ctgcccgggg gttttagggt gaaacctata aatgaaatca 2940 

ctggcgaggg cctacagtgg cctcttccct aacctaactc cgatgtgcca aaggtttcct '3000 

gtgttggacc cagggtgggg atctcttcac ggggtttctc acacctgagc ccccagccac 3060 

cacagaggtg cagcttgaag tgcatccagc caactggctg gcctcctggg atgctccgca 3120 

tccccatcct gccattcctc tccctgcctt ggagtagcag ctcaggaagc agcaggggct 3180 

ttgagagaac aggctcgcct gcccttcctc tacgtttcac tccactctgc tggaggagcc 3240 

aaagccactg ccccatccga gccccagaat gcaagtgtga ggcctgcaga gagtgtgggc 3300 

aggtctgaaa gcctgggact ctagtctcgc tgagcggctc ttccgaaaat gggatgaccc 3360 

ttgaacctgt aaagccacct cccccacctg cttatccaca taccgtcttg ttggtttttt 3420 

ttttatttct ttttttattt tgtctttttt ttgtttgttt ttttagaaat ctgtctttca 3480 

ggtttgccag aaggtaggcg ttgagcccgc tgtgtgtgtg tgtgtgtgtg tgtgtctgtg 3540 

tcctgtgtcc tgcctccatc actaactccc cttttctggc tcttactctc ctccatctgc 3600 

ttaaccaagt ctgtgtggcc ctcctctctc tgccatctta aagggatgaa gactgcctct 3660 

gattgggcat cagcacaagg cctgccctcc gtgcccccag tacaaacagg cagggctaag 3720 

aggccacatt ggcccactca gggcaaatgg ctttaaaaat gagggccttc cttgggccca 3780 

cagttaacgc ctgtcctcaa gtaaggggag actgtctcag ggaagcctcc ctttaagatt 3840 

gtctcctctc acccacccca ccccaccccc actccctctc accccaggtt ttggtcacaa 3900 

gtgttgggat cctttcctgc cctttccctt gtcatgtgca tgcgctatga ggaagctcca 3960 

gggttacaag tgcatctggg atggtatctt gtttgttgtc tcctgggttt cctgaacttc 4020 

agagctatgt gacctcttcc ccgtggccta tgggatcgca ggactttgga gacactacgg 4080 

ggaccctggg gccccaaggt ttcagtctgg ctcccccaga ccttaggagc ttttgtctca 4140 

caaatggagc acagcacccc ctcctggcag ctcctgcaga actagcccca cccacccgca 4200 

cccctgcccc agcacccctg ccaccagcag catctggata aatcaagcct cttctcctct 4260 

aggctgtttt ctccagatat ggcctgtctc ttccaaagtg cggggagctg ggacattcta 4320 

gggcaacggc cattctaccc caagccgtag caaaacaaca ggagatctct gcacccttac 4380 

tcaggggtct cccttcacag tccccttcct ggctctttca cccctggcct tatgctcatc 4440 

ctctctgcag gcctcgggga ccaaatacca agctga 4476 
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<211> 6435 
<212> DNA 

<213> Homo Sapiens 
<400> 53 

attttctctc ctactttctc acagtccctg aacctgtggg atagactgtg gtagtaatgg 60 

taaatcgatt tgcaactagt ttcaggtgta cacacacaca cacacagcct gtcacactct 120 

cccagtcaca gccattcgtt tttctcagtc tttgctgcag tctctcacgc ctgctttcgt 180 

acacacaatt gcaccttctt gtatctttgg tctcggtcac tggtttggtc tcccagccat 240 

tctctcccgt acaatcaagt tcgcacccat acagtcttac agctttagct gacactacag 300 

cctcaagtaa acagttttgc acatcatcta tttctcaaac gttgagtcgc ctacagtgag 360 

gtttgtcacc tacacaatct ttgccacaat ccctgacaac ctctactgta gtttcatgcc 420 

cacgtagtct ctcacatgtg gttttggtca cccagtctgt cttgctcgcc atcaaacagt 480 

gacagggctg cagtccccgt tcccttctcg tagacattat tttgtgcata ccatcacagg 540 

ttcttgtaca gtaccttgac tattcatcca ggctcagtta ttttctgaaa cacagccaca 600 

atttcccaca tagacacacc atcaaaagtc tgtggtcaca acttctgcac actctcacac 660 

acaatcacag ctttccacgt ctctgacaca cacatggtta ccgatatgca gtccctacta 720 

ttgtttctca cacacgccat cacaccgttt ctctcacaat cgacctctgc ctcacaccct 780 

ccacacacag tcatgcgtgt cacgctcagc tagtcctgac aacatcagag aaaaagtgcc 840 

agcagcctcc agcatcactt tgaaaaatat atatttacag gaacaattcc ccattctctg 900 

ggactcttta gaaaaaaaaa aggtccattt tggggaagta aaacaaacag tggagacgag 960 

tgtagcactg tcccccaaat caccaacccc caggtcccaa ggcctgggct gggccagcgc 1020 

taacaggtga gcccaggagt cttttgaacc cactctctct tgcctagaat agagacagga 1080 

caggctttat gtcccccatt cctccctccc aactccaggg acattgaaag ggtcctttgt 1140 

acccgcccgc aggaaattgg ggggctggga gggagggaac tgaaaattca tgtttggtat 1200 

caaaaataaa cacrttggagt gggggaggcg gcaggaagat: tcccccccaa atcccttcct 1260 

tcccacccac cccaccaaat ataggaagag atgactccct ctcccctatt gaaaagcccc 1320 

atttaaaaat agattatact atcaaaatgg cagcggggga gagacaggga gacctggagt 1380 

actggctgga ggggcccccc agacggggac caccccccaa aaaaaccctc cattgggagg 1440 

aaacaggcag gaccccaggg aggttggcag acaaaggaat ggcttctcag ggggaagaac 1500 

aaaagggaca ttcctccctg gccaaaaagt tggttgaaaa ggataagctg tctgagagaa 1560 

aggttgggga ggtggaaatt tctattccaa gggtgtgatt cttccttggc caagactccc 1620 

aacccattaa atggtacaaa ttcttcctgg acccctgagt atggccagga ataataaaac 1680 

aaaacaaact aactcttcta ctcacatcca aaaacgcaga cagggctccc tcatctcccc 1740 

aagggcaggc tgagaaataa agaaagaaag gtcgttctca agccagacct cgattgtctt 1800 

gtcttgggga aaaaagtggg gaggtggggg agatgtccta accaccctaa aaggttaggt 1860 

caggctcctc ccagtggcct tcaaaaaata aataaataaa ccgcccctac cattgaagta 1920 

ggagacgtgt aagggcggcc actgggggac tgacagaaaa cagagaagtt gtggaagctg 1980 

tgtgttttct gtggaggaga taatgagagt tactcttggg agtaattctt cccaaggatc 2040 

cccaccccac cacaatcaga gcatgagtct ttcagttgaa ctatgtttct ccttgagatg 2100 

gctggctgga gagagcatgg atggagtgac cccagaaggg gtggtggtgg tggtggtagt 2160 

agtcatggct tctggggccc tggggagagc accacggggg tcgagaggcc atccacgctg 2220 

atagaaggga tgtgcacctg ggcgctgcca ctggatggaa actgggggca aaaagaggga 2280 

gaggtcacag atgggggcca ttcttgggtzc tctcaccctg tccaaaacgg ggaaagagga 2340 

tcctacctgg aaggagagct tggccgggct acggggcgca atgggactca gggtgctcca 2400 

gaagtgaatg ctaggaggca gcgagctggg tgtcagcagc accggggtct gtggggagag 2460 

ggtggtcgga gctcaagtga gtagaaagta gatgtgggat gctggaaggg acttggtccc 2520 

ggccccacca cagatrtccc actcacactc ccagggtcac tccccaaagt gcacacacct 2580 

agatggtaac catggaggcc atgcctatga tctctcaaac tctgcagaca ccccccactc 2640 

cccactcccg agagcccacc ctcagttgct gagacctcat tggtcaagac cacccacaag 2700 

accctatcca aatgctgctc tcaaacctct caagtcacca tcccaacagc tcagggtcaa 2760 

cttccaactc atgtcccacc catgaaaccc catatgcaca tcccagacca aaaagccctg 2820 

ctcaggttcc ttaagccaca ccccaaccta gaaacagtcc cacagaaaga ccctcggtcc 2880 

cacccacaaa agtccccagc cctgcccttc actaatcaaa ccccatccct ccagccagca 2940 

tcagcccagt caatcacact caaaggctcc accctcttca ggaacaacca cactcaccct 3000 

cctctgattc ctgtcagtca cccgccacat gccaatccca ttggtgctgg cgcccacaca 3060 

ccccacccct ccgcaggcac tcaccaatgt atgcgtagga agcagggatg gggtcagcgc 3120 

cggccccgga gcctggaggc cggagccact tcccgatcct ggggtccgtt cgggtcccgg 3180 

cccacctagc aggctcgggc tgagtggaag ctctaggtcc cggggcttcc ggccctitctg 3240 

cggctgggag atctcagggc tggaagccgc atggccgccc gcctgccctg cggtgtccat 3300 

aacaaccgcg ggcagccggg ctggcacgcc ctcctgtgga gggacttctg gctcggcctt 3360 

ggtggtttct ggcacaaacc ctctctcccc cgcaacttcc aactcttcct tgggcccttc 3420 

tactttcact tctgggggca aagcccggcc caaacccggc tccacattaa gctcttccga 3480 

tttcaggttt ggggcctcgg gcggggtcag gatgacctgg tagaggagca ggcacaaaga 3^40 

gggtgtgtaa ggattgttgg gaccacagga ttctcctgtg ggccacccag tgatttcccc 3600 
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gggaaggtgg cacttggtac tcagggcatg actgcacatc agagcaagtc ctccaacagg 3660 

acataggact tcgtcaagga agtagggctt cctctggggc aggtcctcct atgggacagg 3720 

ggacttgggg tgagacaacc tgaagcgggt aggaatctga gggataccat cctcattcat 3780 

taaaacagtg gttctcaaaa atttaccttc ctctggacca gcaatatctg cctcatttgg 3840 

taacaagtta aaaatgcaaa ttattgggcc cctccccagc cccacaaatt cagaaactct 3900 

gggggtgtgc ctggcaatct gtggtttaac aagacctaca ggtgattctg atgagcacta 3960 

gagtgcgaga accagtgcat taaacatatt ccaggcacct acatgttcgg cagctactaa 4020 

ctgtagcaca gtattctccc ctttctgtct caggatttgt gtagctctct gccccacccc 4080 

tccttctaag aggatgtctt ccttccacca ttatctcctt taacgggaca gctatgtctc 4140 

ttctaccagc ctccctcgtc atccggggat tacacatccc tcctctttac tgcctcctct 4200 

tactgaattt ctatattcca gtccttacct gcagaggcaa gccggcctct tcagcctcca 4260 

gacaggcctc caaggggctt ggactggtgc tcctgctccc cgaggggggc gctgctgccc 4320 

ctgcaggagc tgcactgggg agcaccacag caggccgagg atgagggggt ggctgcggct 4380 

gcagagactg gatggtgaag gtggaataga ggcccgagcg catgtactcg ttccggctgc 4440 

tgcgtgccaa accgcctggg cctgccattc ctgcaccctt gggtgtgcct ggctttccag 4500 

agacagtgtc ccctggggcg gcatgtatag cagcaggggc cacatttggc atggtggagg 4560 

taacagacac ctctggctgg ggcgggcagt cctcagtgga gcaccctgcg acctcagggt 4620 

aggacacaaa cttgtagacg aacttctggc cgctcacctt gcggatgatg ttctaggaag 4680 

ggaggagagg aacagagggc cttagaggaa aggggcaggc tggaggcagg gtgtcagggg 4740 

ggaggagggg ggtggagtgg catttctctt ctcctagtgg ctttcttccc acttgctcct 4800 

ctccggcttt cccgtccctc ctctgtaagt ttctctctca gcccatttzcc tctcccattg 4860 

gtrtctacct ctttcctctc ccgccagart ccccttatct atatccctcc ccagttcctc 4920 

ccctttctct cttctctcca caataatttt ttctttaagc cccttcccat ccaccccttt 4980 

ccaagcctcc ctttgtcact ctccaggtcc gtctatctga ttttcccact ccctccacac 5040 

tgtagctcct cttcacaatg gggcccatga agggcagtag gtgtgggagc aggtcttctt 5100 

cctggctcca agaaccagag gttcacaggg ccaggccttg tgggtgactg Hcaggacaca 5160 

gttttggaga ctagcaaatg caataagaca agaaaatgaa gtaactgata taaatgttgg 5220 

aagaaagtaa attaatgtag atggatgggg tgatgaatta ttaatacaga ggattatttg 5280 

aagctaatga tacagaagtg gaacagggca aggtcactgc agaggagagg gtgaggaggc 5340 

ctcattatta ataccactgt ggaggttgaa gctaaaatgg taagaaaatg aagtaactaa 5400 

cagcaattag acaagaatat taaaaatact ggtataaata ttgggagggg gcctatatca 5460 

aggctggctt tgctgggtca cagagcatgt ttggggtttt tcaggtctgg tgggtctcag 5520 

agatgaccta gtgttcccag agggcagagt gagatagcct cattactaat attgttctgg 5580 

a gggttgagc tcaaggaata aaatatgaaa atggaataaa tgccataaac tatgaagggt 5640 

gtgcagcata tgaccagggc agttctggga gtgtggaggc actagggcat tctgtggaag 5700 

tcttagtgct ggtcagatat gtcacagggt gggcctgtgt ttctcggaag aggaataaaa 5760 

agggatattc atgaatattg tttgggaaat ttcagtagaa ggattaagaa aatgaaataa 5820 

ctggtataaa tattgggggg aggagaaacc cagcagtggc ctgggggtct gagagggcag 5880 

gatgggggaa attttaaatt ttagctgtag gaataagata atgaaaacaa ttgatataaa 5940 

tattggggta gaccagtgct cctctgggaa tttaagaaga cagtgtgtgg aagttggcat 6000 

taaacatgtt gtttcagagg ttttagctta aggaataaga gaagaaaata attagttaaa 6060 

aatgttggga gtcccagagg acgaagtaag caaaagttgg tataatactg tcagggaggt 6120 

tttgattcta ggaatatgaa atattctgga ataagagaag actatgaaat aactagggaa 6180 

gcccagcact ggtccagggg tttcagagga cagaatgagg accagccatt attaaccgtg 6240 

ttttcaagat gttggctata ggaatacggc aggaaaatga aacgttgtgc atggcagggc 6300 

agcatcactt tggggatctc agagtgggag tagaaagagt cagtattaat aatgcagtgg 6360 

tggttttcgc caaagcagta agacaagaaa gtaggtgaga tagtgtttdg gggcccaata 6420 

etggtcttag ggtac 6435 

<210> 54 

<211> 4406 

<212> DNA 

<213> Homo Sapiens 

<400> 54 

ccagcctgga ctccccacgc ttggggaaaa aggctgcatt tggagcccga ggcacccatc 60 

acctgcccac cccaggcaag ggtcgttcac actcaggggc tgaggacatc cccgatgcag 120 

ccacctcagt ccacgggagc tggaggtcct ggccttggcc cctcagacat ggcaccgcag 180 

ggcctcgtgc tttgcaagtt tctaacaaat gttggcttta tgggtatttt aagacagaag 240 

agagaccgga atccatcccc cccacaccca tgcaggccac cctctccagc tgcctcctgc 300 

ccccagggca gcggccacca ccccacagcc tggtctgtgg tgctgttgtg gacggctgtg 360 

gcacctgtcc ccatgcccac catcggctgg ggtctcatcc caggccacac agtcattgca 420 

cagcagggcg ggttgtccag gtggcagagc ccgggatccc caagcccttc cacggccaag 480 

ggccccacac tcaccagctt cacgcagatg acgaagggtg gccgggcggc tcggaagtgc 540 

aggatgggga ttcggggctt gggtctttcc tigagagtgac agtggccatc agtcctgccc 600 

gagcctggag cttgtccccg gagagacgcc ctcagggtgc acgcagccac caagttggct 660 

cacacaggag ccagggcttg ggcagcagat aacgtgccag ggtcacagac tgcggcaggt 720 
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gttcagagca ggaacccggc tcccgccacc cccgcctgac acccgactgc agaactgacc 780 

agggtgggct accccaaggc ctagacccac gagggacagc ggggttctcc ccccaagaat 840 

gggggctggg ctccctggag gccccgagga gccagcacgg cccaggtgga ctctgccaag 900 

ggggaaggct cggagcagct gtccctcagc cccggagcag gtggttgacc ccaagggtga 960 

caccatgcgg ccctgcacac ctgagagtcc tcgtggcaat agaagccttt cctgatcttg 1020 

ttctcgtcca cgtcacagaa tgccaccacc tggggacaag cacagaaggg cagctctgga 1080 

caggccgggg ccagggcctg tgctgtggga cagtacgggg gcagggcagg cagtgttgga 1140 

gggtggcagg atgctcacct tgcgctggct gccggcagtc aagctgcggt acagccggcg 1200 

cccctgcttg ccagcgttcc agatggtgaa ggccgcccag cggggcaggg cctgctcttc 1260 

caggaagcgg acgcggtggg tccagatggt cgtcctgcgg tggagaagag ggtgataggc 1320 

aggccggagg ccccacagat gcacacagat gctgacatac agggacatga tgtgtgctca 1380 

gctcacgttg ccagtggagt ggacacctgg cccaatgcct aagagagact ggctgtccta 1440 

cagtcaggcc acctgcccca gctggacgta gccacatgcc taggtgggga tccacccctt 1500 

ccattagacc ccaacaggtg gggacagatg agaagctgtc acttaagaat ctagaaggtt 1560 

ctggaaggca gaatttctgt aggacgcaga ggactggact tgacccaagt ccagtttccc 1620 

agagaaacct ttgtccataa ggacacatgc aaccatccca taaaatccca gcgttcacac 1680 

ttggacacga ccttctgcct cagggtcgct gccccgaacc cttccactcc cctgggaacc 1740 

tggtctctgt tctccctctg ctgtgacaag txgaccccag gctgctgtcc cggggggagc 1800 

aggctccttc ccatgacctg ctgctctgcc cagcgcccag ccccatcgct gcccgggccc 1860 

ttcccagccc cgccctctct cctcccaccg cagcccctcc ccacgtgccc catcgctgtc 1920 

tgcctgtgtg tgcctctccc tcaaccctgc tcctggagct gcccgggctg agcccgtctc 1980 

aaagccccag gagctggacg ggccagaagc tagaggggca ggggatgcag aggccaacgg 2040 

ggccagaacg gcgtgggatg tgtcccagat gctgtcctcc ccaatggccg cgatgtagcc 2100 

acgctgaagc ccggtacccc acagaccctg ccctggccga gtcctgggat gccctctgct 2160 

gtgtcaaggg agctgtgtcc gtgggagtct cctccggggt ggggtctggg ccagacacaa 2220 

ccaacactgg ctggtgacgg gcctcccctg agcagggccc tcgtgggttc agcctcggga 2280 

tcggggcagg atttcgctgt tctacgattc tgctgctgcc tccgcctccc gggataaact 2340 

tgtgactccc ccccaaccct gtggcacccc ctgtttgtag cagaaccccc acaggcaatg 2400 

ggatgggggg gatgggggga cacggggtcc cctctgtcag ctcagggagc tgcagggctt 2460 

aggcactgtc agggaggtcc tgcctggagt ctaccttttc ctccccaacc ctctgcagcc 2520 

acctcaatag tccggacccc acctgggaca cagggagatg cttatcccag tcagtgactt 2580 

ggggacactg aagcatgtgt gtgtgtgtgt atactgtgtg ggggtgtatg gtgtgtgtgt 2640 

gtgtacactg ggggtgtatg gtgtgtgtgt gtgtgtacac tgggggtgta tggtgtgtgt 2700 

gtgtatactg tgtgggggtg tatggtgtgt gtgtgtgtat actgtgtggg ggtgtatgtg 2760 

tgtgtgtgta cactgtgggg gtgtatggtg tgtgtgtgtg tatactgtgt gggggtgtat 2820 

ggtgtgtgtg tatactgtgg gggtgtatgg tgtgtgtgtg tgtacactgg gggtgtatgg 2880 

tgtgtatgtg tgggggtgta tggtgtgtgt gtacactgtg tgggggtgta tgttgtgtgt 2940 

gcacagtgtg catgtgtggt gagtgcatgt gtacatgagc acgtgtacag tgtgtgcagg 3000 

tgttcggagt acatgcctgt atgtactgtg tggatgtgtg tacacgcatg tgtacacgtg 3060 

tgtgcagtga gggcctgtgc atgcagtatg agtacatgta tgcagtgagt gcacatgcct 3120 

gcatgtaccg tgcagatgtg tgtacatgca gtgtgtgtgc acgtgagtgc gtgtgtgcag 3180 

tgagcgtgtg cagtgagtac gtgtgtatac tttcgtgcac gtatgtgtgt gcatgctgta 3240 

catgtggtct gtgtgcacgt gcagtgtgtg tgtgtgtgtg agaacctggg tgaggacagc 3300 

tcctgcagac ggctgcgggg acttccccac caggtgacgc agcctcgccg ggttttggca 3360 

tcttgtcctt agcagcccgg gtgtgttggg tgtctcgggg cggggggacg agcctgatcc 3420 

tcctcatcca ctgacatcag cttagggacc ccagggggtt cacgtgacag gaaagcagtc 3480 

agggctccag aagaaagcgg ggggcctctg tggccgtggg tggcacgcga ggtttggggg 3540 

gcctctgaca ccggcctcct cgacggcccc ttgtctcctg ggctttccta ggggtctctg 3600 

gtctgaaaat gttggcccat cgtaggaagg taaaatgtag aagcagccat gatgtctgta 3660 

tttggttcat cttcattttt cacattttaa aaagttctca tttaatcagg gagaacaggc 3720 

tccccagggc gccctatgcc tctcttttta catcctcaga ggaagccgtg gccccgcctt 3780 

agcaccggga aaggtggaaa cccagttttt cccaaaagca caagacagca tttctggaaa 3840 

tgttgcaaag gagctgaagt gtcagggaca cgtcaccgca tgaggaggcg gccaaggtgc 3900 

cggtgggcat aagggagagg agcgcggacg cgggaggccc cagcatgcag gagtgaggat 3960 

gatgccccca atggggaagc tcgtggccct cctgccaacc ccccagcccc accccgtgac 4020 

cttggacctt agaccccctg gggagctggg atcttgccga gagtctcaac cccatcacgg 4080 

caggcctggg accatcgcgg ctcccacacc cacacatgac cctagttggg gtcttgagac 4140 

tccgggggcc ctgagtaccc gtaggggccg agacgcaagc ctcggacggc ggcctgcttt 4200 

ccaacttaac gctgcggcac gcggcaggca ggtggccgag acctctgtgt cctcacagtt 4260 

ctaaggaggg aactgtctgc ccggatctaa ggaaggaatt ctacatgcag tttcaccccg 4320 

atttgcctca cccctgccac ggctccaacg cagccacgcg gcacccccag acccgggccc 4380 

acgtggccgc tcctcacagt gacacg 4406 

<210> 55 

<211> 4417 

<212> DNA . x 

<213> Homo Sapiens : i. ■ . ' "„ 
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<400> 55 

gaaaccccgt ctctactaaa aatacgaaaa gttatccggg cttggtggcg ggcgcttgca 60 

ggagaatgca gtgaacctgg gagcgggagg ttgcagtgag cagagatcgc gccactgcac 120 

tccagcctgc acgacagagc gagactccat ctcaaaaaaa aaaaaaaaag aaagaaagaa 180 

agaaaaagaa aagagggtgg agatggggga tgacatccag ctcaggaggt gtccatggtc 240 

tggccttccg tggggagaag gaaggccaca cgattggtgt ggcccagggg gcagggcctc 300 

agcattctaa gccgagctcc ctcttccacc ctctgaggtt ggcactggca gtccaggtgg 360 

gggcttgggg acctgaatgg gatgaatggg ccaaggggga tgtcatctct tccattctcc 420 

ttcatccact gcctctccct tcccctcctc ccccccctcc cacccccggt ccatctacct 480 

cccatcccag ccaggagccg tcacctaagt agaaaagggg cctctggaaa gggggcgggg 540 

ccttgactct tgggtaccct gcgcttgaag aggaactctg ggaaggggtt gttcagggat 600 

ttggcccctt ccccacctgg ctgtataacc ttcctgttct tttcttcccc tagccggttg 660 

cccctccctg tcctgagatg tcaggaaaga gggggccacc tgcgtcctcc acagtggcct 720 

ccgaagcctg gggttcccag ccccagagct cagaggtgaa ggaggtgcta cagctctggt 780 

gaccactggt tgtttcccag tcttctccat gccacccttt cccaaaacaa caaaacaaac 840 

aaacaaaatt gggcctggcg cagtggttca tgcctgtaat cccagcactt tgggaagccg 900 

agacgggcgg atcacaaagt caggagatca agactatcct ggctaacatg gggaaactcc 960 

gtctctacta aaaaatacaa caaattagcc aggcgtcctg gtgggcgcct gtagtcccag 1020 

ctacttggga ggccgaggca ggagaatggc aggaacccgg gaggcggatc ttgcagtgag 1080 

ccgagatcgc gccactgcag tccagtctca acaacagagc gagactccgt ctcaaaaaaa 1140 

aaaaaaaaat tgattggaac atcctccaag atgcaagact ctcagttcct cagagttcta 1200 

caggaaggat ggcagagtgc agttgcccag agttgaagtc ccatctctgc catttgttgg 1260 

ctgtgtgacc aggcacaaat cattaatttc tctgagcctg tattttacca tctgttgcta 1320 

ttgagtaata gtagtggact attttctatt tttattttta ttttatttat tttatttttt 1380 

tttgagacgg agtctcgccc tgtcacccag gctggagtgc agtggcgtga tctcggctca 1440 

ctgcacctat gcttcccggg ttcaagcgat tctcctgcct cagcttcctg agtagctgga 1500 

actacaggcg cgcaccacca cacccagcta attttttttt ttttttttag tagagacagg 1560 

gtttcaccat gttggccagg ctggtctcga agtcctgaac tcaggtgatg cacctgcctc 1620 

ggcctcccaa agtgctggga ttgcaggcat gagccactgc gcctggacga acgtttttaa 1680 

ttatattatt attattatta ttattattat tattactaat ttttgagatg gagctttgtt 1740 

tttrgttgccc aggctgaagt gcaatggcgg gatcttggct cactgcaacc tccgcctcct 1800 

aggatcaaga gattctcctg cctaagcctc ccaagtacct gggagtatag gcatgcacca 1860 

ccacgcccga cgatattttg taattttagt agagatgggg tttctccatg ttggtcatgt 1920 

tggtctggaa cccctcacct cagctgatcc acccaccgca gcctccgaca attacaggcg 1980 

tgagccacca cgcccagccg tccaccctgt tttctaccag agttctgtac tgtgactctg 2040 

tataaaatag tttggaagct ggacccaccc tgtgtgtgtg attgccctga gcccacagaa 2100 

agacacctcc agagtgcgga ttgagaagcc tttattgtgg gaggatcggg gtgtcctagg 2160 

gccccgggag acgggatgga cttggaaggc tggggggagg ggcctttgag gaagaggagt 2220 

cctggaagcg ggggtcatca caggtcaagg ggtggtcctt gggacccccg cagtcagtgg 2280 

tgctgcggcg gcagagtgca cattgacagc tgagagccac ggcgtaggag accacggggt 2340 

tcacgccgcg cgggcagcca gggagccgga tggactcgaa gcgcacatcg cggtagttgc 2400 

acaccacctg aggcagggcc ggcaggaccc cctgcagcac gcgggtctgg aagccgtgtg 2460 

agtgggggaa tgagcatgtg cctggggcca gcgcctcagc tgagctcccc agctgccccc 2520 

acaggtctca gactcagggg tccaggaagc cctctgttcc tgccctccca caccccattc 2580 

cgcagcccct gaccagagag gcagaccacc cttcctcccc cgctgcctct gtgggtctgg 2640 

ccctgaggtg gcagcacctg ccctggcccc gggcagctca ccatggtggg gcagtagccg 2700 

gcacagatgg tggtgttgac ggtgatgcac acggggcagc cctccttctc cacagccagg 2760 

gtggcattga tggggcggca ccgtggccga agcggctcct tggatgccca tgtcccgccc 2820 

atgctcagca gcagcaacag cagcagcctc tggggcaagg acactgcttc acccgggtct 2880 

gagactgcag cccccagtcc tggccttccc atcccgcatg gtacaccacc cacaaagacc 2940 

cagagaccct tcccggcatc ttctattcag gacccaccac ccggacacct gcctttcaga 3000 

gcccacccca cagcccagag gacctgagat accccaacat ttcagatccg caccctcagg 3060 

aactgaccca cctgaagctt actgggggtc acgctcctcc agaaagaggc ctccttccac 3120 

agctcacacg ggtctgcccc ttctcatgcc agtgatggcc tggaaggagg tggaaggtgc 3180 

ccaggggccc tgcagtcttt cctggaacat ctccatcttt ggtgcccgga aatgtggatc 3240 

taccctacct ttgacatgtc tctctcttag cgggatatct tccgcaagca ctgggaatgt 3300 

ggacatggaa agtaaattga gtctccgtgg gggagtgaga cagggagtga ggggtgttgg 3360 

acgcggcacg ggaacctggc cagagtcagc ggacccaatt ggctgctctc tctcagatgc 3420 

agttcccctt cctccctcca gggggcgcca cggaacgcag ggccctcact ggccctgggg 3480 

actgggngac gtcagggatg agcctcttgt gattggctcc atcaccctgc gtaagatcaa 3540 

agggaagaaa ggatgggccc gacaaccgga gccattgtgg ctccggccga ttgagcctgc 3600 

cctcgggctc tcaaggctag gcgagggtgt gagggactgg agttcgaggt caacctgggc 3660 

cccattggta ggaaaaaaaa aaaaaatttt tttatcgttc cctatataac aacaaaacat 3720 

aaagggagga cgccttgata ggaagaaatg acatcttcct aagtgttttt aaattacttc 3780 

aatgtatctt uttttttttt tttgggatac cgagccttgc tcggtcctcg ggggacctcc 3840 

ctgcctgtca gccatggcgg tcgtcacctg tggtaaacag aatggcggga gcaagtggga 3900 

tgticccacta ctcgggctga ggcaggagaa tcacaggaac ctaggaggtg gaacctcctc 3960 
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gcccaggagg ggcgcggctt cggacttagt ttctgcccaa tgagagaggg cctccccgtg 4020 

actgtgctgc caggtrtagcc acttgaccct ggtgccgcca acgagggatt cagcccgagc 4080 

cccacctctc ccttagggac ctctgcccac tctaccctca agccaggatg cccggagcgg 4140 

tccccggaaa tgcgtgtgct tcgggtgatt taactgatta ttgaataggc ccgcaggagg 4200 

tgtgtcctgc ccccgaggcc gcagcctcgg aggacattat ctggacttag tcccttcccc 4260 

gcgatgccat caagctggac aattttaagg izctgtttctt tcccaatgta gggtatagga 4320 

tggcacaggg aagagggcta gaaacctgac rtgagctccc cgcccagggc tgaactctcc 4380 

agcatcctga tcacttctca ttgaaccrtg cttatac 4417 

<210> 56 
<211> 4420 
<212> DNA 

<213> Homo Sapiens 
<400> 56 

tccttggcag ctctgggggc cttcccttgc anttcgaagg aaggggaggc gtgtggctgg 60 
aggtgagagg ccctgaggcc ctgcctggct cccagcttcc caaatccgcc tctctctggg 120 
tcgccgccct cagagattgc tgaggacacg agctttccac ccacgggctg tcctaacgga 180 
gccgggctag gggcttctgg acagaggccg cctgcccctt tgggctttgg ggtttgtttt 240 
tctcagcagc tcacccgggt ctctggggtc agggtatcgg gcagaagtca gaggacggtg 300 
cgcttgggac ctgagctctg ggcagccctc ggggctcagg actcctggct cttctgccca 360 
gtttcatgtg ttttrgtgttt tgaaaccacg tgtggtccca cagttgtgcc ctcagaagcc 420 
gccgtggctg ccgtgaggta gcagtggacg ggacgcccac ggtctttccc atgccatgct 480 
gtttccagcc gggccccatc actctcccac aagctgaaag gaagtggctt ctccaccagc 540 
cctggctggg agtcctgtgg gcgaggcctt ccccaggagc tgccctccca agtgccccag 600 
gcactaacaa gaccccacgc gtgctctccc cacagccggc ttcgtccagg cgccgctgtc 660 
ccagcagagg tgggtggggg gcagtgtgga gctgcactgc gaggccgtgg gcagcccggt 720 
gcccgagatc cagtggtggt ttgaagggca gggtcccaac gacacctgct cccagctctg 780 
ggacggcgcc cggctggacc gcgtccacat ccacgccacc taccaccagc acgcggccag 840 
caccatctcc atcgacacgc tcgtggagga ggacacgggc acttacgagt gccgggccag 900 
caacgacccg gatcgcaacc acctgacccg ggcgcccagg gtcaagtggg tccgcgccca 960 
ggcagtcgtg ctagtcctgg aacgtgagtg gcgggcacct ccctccccgc ctccctcagt 1020 
ttccctcctg tgccgctcgc ctcccggctc ctgctcagta gaacccagac gcctcctccc 1080 
ctctccgtcc cgctgtgccc cgttgggccc accgcctgga cggaggggcc cggacctgaa 1140 
gggggtgggc tcgccgcctg accaccagcg ctgggcggcc tggctcgggg caggcagggc 1200 
tctgccctcc agcaggtggg tcctcgtcct cccctcgacc ctcgtgccat ccgttccgtc 1260 
gtttctgctt ccccgggcgc ctgcccggct ccctccctga gcgtccatac tgcaagcctg 1320 
aggggccctc caagctcagc ccaggcttgc aggtctcaga accttctgtg tccttcgcag 1380 
catctcgttc taagggattt tcaaatttca attcctagcg ggaatattgt aaactaacca 1440 
agaaccaaga gatttgcttg gttggggatt ggcagaagaa ccggggtgct cccagcactg 1500 
ctggagtgcc gtcagcctgg ggctgggggg atgtggggcc acctgagtgc ctcctagaga 1560 
agggcacggg agccttggcc gggggtgctg tggctatttt tttttttttt cctggagatg 1620 
gagtcttgct gttcccaggt ggagtgtggt ggcacaatct cggctcactg caacttccgc 1680 
ctcccgggtt caagcgattc ttctgcctca gcctcctgag tagctgggat tacaggcatg 1740 
agccaacatg cccagctaat tttttgtatt tttagaagag acggggtttc accgtgttag 1800 
ccaggatggt ctcgatctcc cgacctcgag atccacccgc cttggcctcc caaagtgctg 1860 
ggattacagg cgtgagcctc cgcgcccggc caccaccgcg cccagctggc tacgttttta 1920 
gaaaagaatt tccagtttgt gctggggctg ctacaagagc tcgatgccct ggtccctgct 1980 
cagggacgtg ggtgggtgcc ttgtctgtcc ccacaccctg gtcccagcct cgtgtgtctc 2040 
tgggcaggaa catcgccagg cggggcctct gggttcccag cgccttagcc cagggcccgc 2100 
cagctgccag cgaagggtgc cttcccgaag ggggtgcctt cccgaagggg cctcctcagc 2160 
aggcgctggg ctcttccctt cccggaggcg tggccagaag ttccccttgg gcctccggtc 2220 
ctcggggcgt aagggccacg gtgtattttt gggatgaaaa ccagtccttc ccggggagga 2280 
gccgcaggtt cctgggggtc aggcaggcag cgcgggccgt gctcctccac tctgctgacc 2340 
gcgtctcgcc gggccttgca gccggcacag tcttcactac cgtagaagac cttggctcca 2400 
agatactcct cacctgctcc ttgaatgaca gcgccacaga ggtcacaggg caccgctggc 2460 
tgaagggggg cgtggtgctg aaggaggacg cgctgcccgg ccagaaaacg gagttcaagt 2520 
gagtgcctga ccacgccatg ccgccacctg ccccttctca cggctctctt gccgccagcg 2580 
gcctgtggtc cgtgagaaca aaagaccggt cggcaggctt gatccaggcg gaagttaggg 2640 
accagctccg cgcagacccc cagagggaaa ccccagggag gggtctgagg ggcgtccttg 2700 
tgaggcaggg ggctccagca gccctggcag gagtccccat tgcttgctcc tcactaggtc 2760 
ccgggcacga aagggcgcca cactgccagg tcccgggcac gtgtgcgggc ctcccaggct 2820 
ctgtgctcct ggcacaagag ggcaccacac tgaggcatgc cctgctcgga ccccagacgc 2880 
tgtcatggcc ggagctttgt cataactggg gctgggacgg ccctgggagc acagagcctt 2940 
ttgttttcag gctgactggg aagacggtca cagggtgtgg ggtgcagagc agggaccaga 3000 
ggtgtggggt tcatcagtcg gggagaaggg ggtctgttgg ttictctcacc tgtggggtct 3060 
tgpcagcctg gctxtgtcct tggggccgca cggggacccc aggagttctt ttgaggttca 3120 
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gacttgactg 
gggtccttgg 
ggtggactcc 
cacggccaac 
tgtcgggaga 
ccctggaggg 
ggctcctctc 
aacacatcaa 
tcactgactg 
gctgggggtc 
acccagccct 
actgggtgag 
ctagactggg 
ccggacccag 
cccggaccca 
tccggactgg 
ggggcctaga 
gggtcctgga 
aggttcttcg 
gaggccgacc 
atcatcacgc 
gctgaggtgc 



ggacagtttt 
aggcgtcctg 
gacgaccagt 
atccagctcc 
agttgttggc 
gaacagccct 
tctcaccctc 
cgagggggag 
ggcctggtac 
ctggacccag 
caggactggg 
gggcctagac 
ggtcccggac 
ccctctggac 
gccctcagga 
gtgaggggcc 
ctgggggtcc 
ctxagccctt 
tgagttcctc 
ccggccagta 
tccgcgtgcg 
tggtgctggt 



gctttttcac 
cagagctggt 
ggggagagta 
acggtgagtc 
ctgaggcacc 
cctgcgggag 
ctgtcagggc 
acggccatgc 
aagatcactg 
ccctcaggac 
tgaggggcct 
tgggggtccc 
ccagccctcc 
tgcagccctc 
ctgggtgagg 
tagactgggg 
cggactcagc 
gcctttggtc 
gcagggccgg 
ccggtgcaac 
cagccacctg 
caccatcatc 



seq_prot . 
tgtgccgtgg 
acgcggctca 
ctcctgcgtc 
ctgcagccag 
cggcacatcc 
gccggggatg 
ctcccagagt 
tggtctgcaa 
actctgagga 
tgggtgagag 
agactggggg 
ggacccagcc 
ggactgggtg 
cagacngggt 
ggcctagact 
gtcccggact 
cctcaggact 
ccctaggccc 
tcagagctac 
ggcaccagct 
gccgccctct 
ttcatctacg 



txt 
ttgggcccct 
cctgcctgct 
ttcctccccg 
gggtaccggg 
cagagccctg 

ggggcgggcc 

gaaggctgtg 
gtcagagtcc 
caaggtgaga 
gcctagactg 
tcccggaccc 
ctccggactg 
aggggcctag 
gaggggccta 
gggggtcccg 
cagccctccg 
gggtgagggg 
tcatgaacgg 
acattgagaa 
ccaagggctc 
ggcccttcct 



ggagaaccct 
gtggttgcag 
agcccatggg 
caccaccgac 
gccccctgct 
tgcggttcca 
aagtcgtcag 
gtgccacctg 
agccaaggag 

ggggtcccgg 

agccctcagg 
ggtgaggggc 
actgggggtc 
gactgggggt 
gacccagccc 
gactgggtga 
cctagactgg 
ctccgagagc 
cctgaacatg 
cgaccaggcc 
gggcatcgtg 



<210> 57 

<211> 4258 

<212> DNA 

<213> Homo Sapiens 



<400> 57 

ctctcaaagt 
cttattggtt 
ggggttctgt 
ctaggacaaa 
catatggctc 
ttcccacccc 
ctatctgagg 
tgttttctta 
gtgcagaaaa 
aatgccnaac 
aattcatgta 
agcagttaaa 
cccataaatg 
cgtgatctgt 
caggagatag 
ctgtttgggg 
cctcagacca 
gcggcggagg 
ggacccgggc 
cggaggcccc 
ggcatctacc 
tggggactcc 
gttcgctggc 
cgccccctga 
gggtctgagg 
aatagaagcc 
gtgggagtgg 
ggtgcgggtt 
agctgctctc 
gagaagagcc 
tcctagcgga 
ggcgtcggtg 
cgcactcgcg 
tcgggagcgc 
ggagaaggtg 
gttgggggtt 
accatgatgc 
ccctctattt 



gctgggatta 
tctctgcctt 
ctaaggagga 
tgcagactta 
cagccatatc 
gctcccttgc 
ctcccgcagc 
tgcagggcac 
gacacccagt 
ccttatatag 
atccacacaa 
taacttgctc 
aaccaagaac 
gcgctttaca 
aggctgggag 
tccgggagca 
aactacaact 
gcgcaggcgg 
cctggctccg 
gcggcgcacg 
atatgggggg 
gtgccgcacc 
cgcgctccca 
gaagggtacc 
ggctcctccc 
cccgggaggc 
tggggagagg 
ctgtggattc 
cgcgggctga 
cctacccacc 
tcctcagtcc 
ctgaggagcg 
ctccgggccc 
tccaagaccg 
cgcaaggggg 
ctctgctcaa 
tctacgctgg 
tcctcgtgga 



caggcctgag 
tggtctcacc 
gagcccacca 
tccttcgctc 
ctcagaccat 
aagtcctact 
ctcctctgaa 
ggaaagatct 
cagcgacttg 
tgctaattcc 
ccccaaaact 
cagatattaa 
tggaaggagg 
tctaagatct 
ggacacggag 
agggcgtggc 
cccaggaccc 
ctgggcgcct 
gccccgcggt 
tccgcagcct 
cat:ccgggcc 
ctcccaagcc 
gggcccgggt 
cgggaccccg 
ccgacagccc 
taggtccttt 
tcgcccgggt 
ggatccttac 
gcctgtcagc 
cacaccccct 
tagcggccac 
gtcccggggg 
gctcgacgtc 
tcacctcctt 
cagccagccc 
ggcctctctc 
ccgctctcag 
tcctggagct 



ccactgcgcc 
aaacgcccca 
gttaaaaccc 
tctgctgccg 
ctgggctgtt 
caggctgcac 
agtcctcatc 
attcaactca 
cattatctgc 
acgcctggca 
gatgatatct 

ggggtcgagc 

acaagagctc 
tccagctccc 
agcctcgaga 
ctggatgcgc 
agcgggcgct 
ggcgagtgga 
aagtrggggcg 
ccatcacagc 
gaaccaagtg 
ggccccaggg 
ttgaaggcgc 
gggcgctggg 
ncccaccgcc 
gggcgcggcc 
ctggggagac 
aacttcctct 
atcctcgacg 
ngccccattt 
cgggtctgaa 
cgggcttccg 
ggccaccgac 
ttacaaccag 
agggtccggg 
cctctctagc 
gacggcagcc 
ctcccagaca 
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tggcctaaag 
ctctaaatca 
ttcagtagtt 
cccctctcct 
gtctgtcctg 
ctacacagct 
acagccaatg 
ctgcccaatc 
aggcatgagt 
ctgttctagg 
cttcatctta 
tggggtttga 
cgagaaggag 
agggagcccg 
gccgtgtgga 
gggcgcccgg 
gccgcccacg 
ctgttcgagc 
accccagcct 
gcgggcgcgc 
acccgcgtgg 
gcccagggct 
tgggcaggca 
gcgaggtttt 
agtagagcct 
tgtgtgcatc 
cgatgcacag 
tccccgcccc 
caccctggtc 
tgggtcgcct 
aggagcaaga 
ctccggcccc 
acacaccacg 
tcggccatcg 
atgtaggcgg 
cctcagtccg 
accttctggt 
ctcaggctcc 



taattgtctr 
ctgcagacaa 
cctaactacc 
tcccagatcc 
ccacacaccc 
gtctcagttt 
atactgaaac 
tcctctcctg 
gaataataat 
cactcatata 
ccaagcactg 
agcctggtta 
tcaggtaggg 
tttcatagag 
ggaagcggtg 
gacggcacgt 
cgacgtcacg 
ccttccgctg 
actcagtccg 
agacggggct 
ggggtcccgc 
ggtgtcgcac 
ggggcagccc 
cgggctggaa 
cgggttgggg 
tggggagacg 
gtggagagat 
ggtagatggg 
cctgaagtcg 
gggtcctcag 
cgatgatcct 
tcctgggacc 
tggagatggc 
acgcggcagc 
gagggagagt 
cctaacgccc 
aagattcacg 
agccccgcct 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4420 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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tcccttctca ttttctccca gaaaagtgct cggtacctgc agcaagaact tccagtgagg 2340 

attgctcacc gcatcaaggg cttccgctgc cttcctttca tcattggctg caaccccacc 2400 

atactgcacg tggtaaggta gagaggacct taggtcagcg ggccaccctg ccccgggggc 2460 

aagtggggag tctggggccc agagtggcag acgattgctt gcctaaaggt gtcagggcca 2520 

cacaggattc aaccccaggc cttcagaagc caaaggtgtg tattcacgga gcctggaagg 2580 

gtcgaagtgg gggtttgatc acgtggtcga ccagctgggt ggtgatcccc atgggtaggt 2640 

gggggtggct gttctctgct cagtgcccat gcggctttgt gaattcccac acctcttcct 2700 

tgcagcatga gctatatatc cgtgccttcc agaagctgac agacttccct ccggtgagtg 2760 

ctgggccaga gcagggtgag gggctgagag gttgggcttg gaccaccctt cctcatgact 2820 

ctgtgacctg cagatcaagg accaggcgga cgaggcccag tactgccagc tggtgcgaca 2880 

gctgctggat gaccacaagg atgtggtgac cctcttggca gagggcctac gtgagagccg 2940 

gaagcacata gaggttgggg cagcaaagga gaggccgggc ctgctggggg tgggaagggc 3000 

acgggattct gagacctcac tctttacagg atgaaaagct cgtccgctac ttcttggaca 3060 

agacgctgac ttcgaggctt ggaatccgca tgttggccac gcatcacctg gcgctgcatg 3120 

aggacaaggt ggggctctgg gacctgagac ccacctggga acattaagtg agacagagga 3180 

gactgggctg gggatccggg tcaagggcct gggggctgag gctgtggggc tggtgctttg 3240 

gggcagttcc gaagttgcca gcatcttggg gtggggctag gggcgtgggt agtcctgacc 3300 

tcctttctcc ggccagcctg actttgtcgg catcatctgt actcgtctct caccaaagaa 3360 

gattattgag aagtgggtgg actttgccag gtgaggcaag aatggctcag ggggtgggca 3420 

gacatctggg gcagggaagg cttgggtctg agcccttgcc cggggcatga tctgcgggga 3480 

gcagggtttc tcaaccatgg cactattgac atttccagcc agataattct ttgtcacagg 3540 

ggctgccccg tgcacgttag gaagttcagc agcatccctg gcgccagcag tactgcctag 3600 

ttgtgacaaa caaaaatgtc tctgcacatt gccatatgtt acttaggggg gcagaattgt 3660 

ttccagttgc aaaccactgg tggaggggcc cctgactgaa ccctcgctcc tatccgcaga 3720 

cgcctgtgtg agcacaagta tggcaatgcg ccccgtgtcc gcatcaatgg ccatgtggct 3780 

gcccggttcc ccttcatccc tatgccactg gactacatcc tgccggagct gctcaagaat 3840 

gccatgaggt ggggtggctt gatgtgctgg cttgggggcg gacaggaacc ggggtgcttg 3900 

tacctactgg tctttcccct ctgcatagag ccacaatgga gagtcaccta gacactccct 3960 

acaatgtccc agatgtggtc atcaccatcg ccaacaatga tgtcgatctg atcatcaggt 4020 

ttgccctgag tgggagttga gctgaggtgg atgggatggg ggtctaggca ctgtttctga 4080 

cttgatttag gaccttgagc cccttcctgc cccattctgg gacttggtcc ctgaccagac 4140 

aaactattct ctgaatcctg agatggccat gagctgctta ttaatggatc tggggccagc 4200 

tgcaggccta ggtatcctgc ctctgtcagc agctgaggag cttgaaattg agaaatag 4258 

<210> 58 
<211> 4435 
<212> ON A 

<213> Homo Sapiens 
<400> 58 

cgcaggcgcc cagggctcac aggctcggag cggcctccgc ctctgccgga ccacccaggg 60 

ccgccccgcc ggaggggcct ccccgccccc gagccctagg tcacgcccgg aaggcccggg 120 

gcccccgcgt ccggccacgc catccgcccg gcggccccag ggactcagcg ccggggtcag 180 

cgcaggcccc gcccagcgct cccgccccct ccagctggcc ggcccccggg accgcgcccc 240 

cgcccgcgct ctccacgctc gcactcgtat gggcagggcg gcccggggtc ccccagctcc 300 

gagccgcggc cccttgaagc gcgttacctg gacgtgcgct gtccgcagac tctggacgag 360 

ctgggcctgg gcgctgcgcg gcggaaccga ccctgtcctg gaccgcaggc agcagctgac 420 

cccgcgccac cggcctgcgg gcgagacctc aggcgcgaga cgggctgtgt gtgtgtgtgt 480 

gtgtgtgaga gagagagaga gagagagaga gagagagaga gagagagaga gtgtgtgtgt 540 

gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtcggggg aggggcccga ggacccgctt 600 

cctagctgag cgtcaggacg gggggcggac tctcaggagc cgcgcggcgc ccgcagagga 660 

gaggaagcgc cggagacccc cctgctccca cccggcctcc ctgccccgct tcctttcccg 720 

gggctccaga gaggggagct ccaggcgtgg ggcgcctggc ccacgtggtg cggcgccggg 780 

cggggcaggc gggggcgcgg cgagagccgc ggagctttct ttatgggtgg gggggcgggg 840 

gcggccgggc tccgcctccg ggggatcggg gcgcgctccg ccaatgggta gccgggcgcg 900 

gggcggggcc gcggggccgc cgcttaaaga aacttgttgc gggtcccgca gcgggacccg 960 

agcctcggcg gcggcggcgg cggcggcagg ggcgagggtc ggggccaccg cgcggcgacc 1020 

tcgggtcccg gagcgaccgc agggcagccc cgggcgccgg ccccggtgcg cgtctcctgt 1080 

gcgcgcccct ccgcgcgcgg ccccgatgct ggacatgagc gaggcccgct cccagccgcc 1140 

ctgcagcccg tccggcaccg ccagctccat gtcgcacgtg gaggactcgg actcggacgc 1200 

gccgccgtct cccgccggct ccgagggcct gggccgcgcg ggggtcgcgg tggggggcgc 1260 

ccggggcgac ccggcggagg cggcggacga gcgcttcccg gcctgcatcc gcgacgccgt 1320 

gtcgcaggtg ctcaagggct acgactggag tctggtgccc atgccggtgc gcggcggcgg 1380 

cggcggcgcg ctcaaagcca agccgcatgt gaagcggccc atgaacgcat tcatggtgtg 1440 

ggcgcaggcg gcgcgccgca agctggccga ccagtacccg cacctgcaca acgccgagct 1500 

cagcaagacg ctgggcaagc tgtggcggtg agtgccggcg ccccgggggc ggggttcggg 1560 

acc^ggcc^cccctggtct cggactgcga gcgggggcct ggagggcgca gagctcgcgg 1620 
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aggtggtggc cacacgcagg ctccgggctc tgcaccccgg gcgggtgtca gggcgggtcc 1680 

cagtggaaaa acatcctctg gccagccggg gtccggcgcc agctatggga gtcgggatcc 1740 

gcggagggca gcgcctgggg tgtgcaccct cgnggcgggt gcgctctgga gccagttctc 1800 

ctggcgggga acaccctgcg gagggccatg gctggcaccc cgacgtgccc tccagccgtt 1860 

gcgagtggcc ccagccccgg gaggcacgcc cggcacgaca gcaagttgca tigcggctctc 1920 

"tggaggtgct ccctccttag aaagggctca gctcctctag ctcctggacc acccatgggg 1980 

agggagtgcg gcaggcggat gggtgggcag caggtttgcc tgcgagtacc ctccccgcag 2040 

tttctccctc tcctggacac cctcggtccg aagcagggag ggcaggagtg gagatgacgg 2100 

ggtgcgccgt gtgcacgcct tggcgtggtg gggccggtca gttttccaga aattaggact 2160 

gcagagtgaa gaacttgttg ctcaggacag gccgccccca ccgagtgtcc txtggttggg 2220 

tttcaaatgg ttctxttggt ttggggctct ggttcccgta tcctgctcct cacctggaag 2280 

aagggagctt ggagcttgcg tatttgcacc tcctgccttc tgtctggatc tagggcgcag 2340 

ccacggggcg ctgtggacgg caggcccctg ctcccccaag tcctgccgca caggcggttg 2400 

gaggtgagcc tcgggaagag tcccaaggag tgtgtxctgt ggctcagtga ggacgggcgc 2460 

ctgggccctg ccagggcttc tgggtgtctg ggaatggagg gacagcagcc cagctgcccg 2520 

ccgaaggccg gcaccgttcc cgggaagcct cgcttttgtc caccgccggc tgggtcctgc 2580 

ccgcctcccc cgcctccctt agaaagcccc agagggaggg aacagcaggc ccggaatagg 2640 

gggaaataga aggaggaaga tgggaggcgt ggggcttgtt cttctttggc tgtcgtaagc 2700 

ataccaacct tcccaaacag gcgccggccg gttctcccag aggagtgtac tgcctggtgc 2760 

cacgggaggc tccggagcgc acggcaggcg ggtttccctg gtcagagcct cctggccccg 2820 

gcctctgctg ccggttcccc ctgtggtgtg tgcctgcgcc gagggcacag tgggcgccct 2880 

ggccatccct gcctctgccc tgtgctgcag cttgctgagc gagagcgaga agcggccctt 2940 

cgtggaggag gcagagcgcc ttcgcgtgca gcacaagaag gaccaccccg actacaagta 3000 

ccagccacgg cgcaggaaga gcgccaaagc cggccacagc gactccgact cgggcgcgga 3060 

gctgggaccc caccctggcg gcggtgccgt gtacaaggct gaagcagggc ttggagatgg 3120 

gcaccaccat ggcgaccaca caggtgggct ccaggccccc cgcatatctg agggnccctg 3180 

tatgggaggc agctgtgggg tttctggctg agaagggtgc atgatggtgg aggggggcag 3240 

gcgccctcga gagaaccggg ccttccccgg gttccctgaa aaagctccca aggccccggc 3300 

cgcccctgtt tttggtgcac gagggacctg cutagccacc ttgagtgctg aggggctgac 3360 

agctttgctt ggcagggacc ccgagaggtg gtgttttcag caagactcag tccaatgcgg 3420 

gctcctcttc ccacaccgag ctgctgccca cacggcactc agcagcccca gcttcagggc 3480 

cagactgaca gggtggagtc ctgccggggc ctcggtcagg ctggctgacg cccgccagcc 3540 

txagagtgct nggctggccc ggggaggaac agggtctgtg tggtccccca gcgccctgta 3600 

aaccccagct ctggccaggc tgggcctgtc ggcttcttcc ccccaccttg gtctctggga 3660 

ggggcctcgg gtcttctcat agcacagcgc ttcatggaat tttctcggcc gaggagtgag 3720 

gtctaggtct gactttgctg gaatttctgg gaaatgtaaa gaaacaaaaa gccgttgatc 3780 

cccctggcag ttagcgcggg cgtccctccc cttccagcct ggccggcccc acccgcccca 3840 

cagaaggggc attcatccct gaagcctgct ctcctgtccc cagggcagac ccacgggccg 3900 

cccaccccgc ccaccacccc caagacggag ctgcagcagg cgggcgccaa gccggagctg 3960 

aagctggagg gacgccggcc ggtggacagc gggcgccaga acatcgactt cagcaatgtg 4020 

gacatctcgg agctcagcag cgaggtcatg ggcaccatgg acgccttcga cgtccacgag 4080 

ttcgaccagt acctgcccct gggcggcccc gccccacccg agccgggcca ggcctatggg 4140 

ggcgcctact txcacgccgg ggcgtccccc gtgtgggccc acaagagtgc cccgtcggcc 4200 

txcgcgtcgc ccaccgagac gggtccccca cggccgcaca tcaagacgga gcagccgagc 4260 

cccggccact acggcgacca gccccgaggc tcgcccgact acggttcctg cagcggccag 4320 

tccagcgcca ccccggccgc ccccgccggc cccttcgccg gctcacaggg cgactatggc 4380 

gacctgcagg cctccagcta ctatggtgcc taccctggct acgcacccgg cctct 4435 

<210> 59 

<211> 4784 

<212> DNA 

<213> Homo Sapiens 

<400> 59 

cttggctgca ctgggactca cagagccaca ccctggaggg cctccccctt gacactgata 60 

agggtgtgca ttacatttca gtgagcgcta cacggctggg ggccaacggg agccacatcc 120 

cccagacctc cagtgtgttc tccatcgagg tctaccctga agaccacagr gagctgcagt 180 

cggtgaggac agcctcccca gaccctggtg aggtggtatc atctgcctgt gctgcggatg 240 

aacctgtgac tgttttgacg gtgattttgg atgccgacct caccaagatg accccaaagc 300 

aaaggattga cctcctgcac aggatgcgga gcttctcaga agtagagctt cacaacatga 360 

aattagtgcc ggtggtgaat aacagactat ttgacatgtc ggccttcatg gctggcccgg 420 

gaaatgcaaa aaaggtggtg gagaatgggg cccttctctc ctggaagctg ggctgctccc 480 

tigaaccagaa cagtgtgcct gacattcatg gtgtagaggc ccctgccagg gagggcgcaa 540 

tgtctgctca gcttggctac cctgtggtgg gttggcacat cgccaataag aagccccctc 600 

ttcccaaacg cgtccggagg cagatccatg ctacacccac acctgtcact gccattgggc 660 

ccccaaccac ggctatccag gagcccccat ccaggatcgt gccaaccccc acatctccag 720 

ccattgctcc tccaacagag accatggctc ctccagtcag ggatcctgtt cctgggaaac 780 
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ccacggtcac catccggact cgaggcgcca ttattcaaac cccaacccta ggccccatcc 840 

agcctactcg ggtgtcagaa gctggcacca cagttcctgg ccagattcgc ccaacgatga 900 

ccattcctgg ctatgtggag cctactgcag ttgctacccc tcccacaacc accaccaaga 960 

agccacgagt atccacacca aaaccagcaa cgccttcaac tgactccacc accaccacga 1020 

ctcgcaggcc aaccaagaaa ccacggacac cccggccagt gccccgggtc accaccaaag 1080 

tttccatcac cagattggaa actgcctcac cgcctactcg tattcgcacc accaccagtg 1140 

gagtgccccg tggcggagaa cccaaccagc gcccagagct caagaaccat attgacaggg 1200 

tagatgcctg ggttggcacc tactttgagg tgaagatccc gtcagacact ttctatgacc 1260 

atgaggacac caccactgac aagctgaagc tgaccctgaa actgcgggag cagcagctgg 1320 

tgggcgagaa gtcctgggta cagttcaaca gcaacagcca gctcatgtat ggccttcccg 1380 

acagcagcca cgtgggcaaa cacgagtatt tcatgcatgc cacagacaag gggggcctgt 1440 

cggctgtgga tgccttcgag atccacgtcc acaggcgccc ccaaggggat agggctcctg 1500 

caaggttcaa ggccaagttt gtgggtgacc cggcactggt gttgaatgac atccacaaga 1560 

agattgcctt ggtaaagaaa ctggccttcg cctttggaga ccgaaactgt agcaccatca 1620 

ccctgcagaa tatcacccgg ggctccatcg tggtggaatg gaccaacaac acactgccct 1680 

tggagccctg ccccaaggag cagatcgctg ggctgagccg ccggatcgct gaggatgatg 1740 

gaaaacctcg gcctgccttc tccaacgccc tagagcctga ctttaaggcc acaagcatca 1800 

ctgtgacggg ctctggcagt tgtcggcacc tacagtttat ccctgtggta ccacccagga 1860 

gagtgccctc agaggcgccg cccacagaag tgcctgacag ggaccctgag aagagcagtg 1920 

aggatgatgt ctacctgcac acagtcattc cggccgtggt ggtcgcagcc atcctgctca 1980 

ttgctggcat cattgccatg atctgctacc gcaagaagcg gaagggcaag cttacccttg 2040 

aggaccaggc caccttcatc aagaaggggg tgcctatcat ctttgcagac gaactggacg 2100 

actccaagcc cccaccctcc tccagcatgc cactcattct gcaggaggag aaggctcccc 2160 

taccccctcc tgagtacccc aaccagagtg tgcccgagac cactcctctg aaccaggaca 2220 

ccatgggaga gtacacgccc ctgcgggatg aggatcccaa tgcgcctccc taccagcccc 2280 

caccgccctt cacagcaccc atggagggca agggctcccg tcccaagaac atgaccccat 2340 

accggtcacc tcctccctat gtcccacctt aacccgcaag cgcctgggtg gaggcagggt 2400 

agggcagggg cctggagacg acatggtgtt gtctgtggag accggtggcc tgcagaccat 2460 

tgcccaccgg gagccgacac ctgacctagc acacactgac acaggggcct ggacaagccc 2520 

gccctctctg gtxctcccaa accccaaagc agctggagag actttgggga cttttttatt 2580 

tttatttttt gcctaacagc ttttggtttg ttcatagaga attcttcgct tcatttittga 2640 

tggctggctc tgaaagcacc atgtggagtg gaggtggagg gagcgaggaa ccatgaatga 2700 

actcgcaggc agtgccgggc ggccccctgg ctctctgcgt tttgccttta acactaactg 2760 

tactgtrttt tctattcacg tgtgtctagc tgcaggatgt aacatggaaa acagtaacta 2820 

aagattaaat tcaaaggact ttcagaagtt aaggttaagt ttttacgttt aatctgctgt 2880 

ttacctaaac ttgtatgtat aatttttggg tgggtatggg gaattgcttt gctaaaaata 2940 

agctcccagg gtgtttcaaa cttagagaag accaagggac agtatttttt atcaaaggaa 3000 

tactattttt tcacactacg tcaacttggt tgctctgata ccccagagcc tgattggggg 3060 

cctcccggcc ctggctcacg ccaagtccct ggtgctgggt ttgctctccc gctgttgcca 3120 

ggggctggaa gctggagggg tctcttgggc catggacatc cccacttcca gcccatgtac 3180 

actagtggcc cacgaccaag gggtcttcat tticcatgaaa aagggactcc aagaggcagt 3240 

ggtggctgtg gcccccaact ttggtgctcc agggtgggcc agctgcttgt gggggcacct 3300 

gggaggtcaa aggtctccac cacatcaacc tattttgttt tacccttttt ctgtgcattg 3360 

tttttttttt tcctcctaaa aggaatatca cggttttttg aaacactcag tgggggacat 3420 

tttggtgaag atgcaatatt tttatgtcat gtgatgctct ttcctcactt gaccttggcc 3480 

gctttgtcct aacagtccac agtcctgccc cgacccaccc catccctttt ctctggcact 3540 

ccagtcccag gccttgggcc tgaactactg gaaaaggtrct ggcggctggg gaggagtgcc 3600 

agcaatagtt cataataaaa atctgttagc tctcaaagct aattttttac taaagttttt 3660 

atacagcctc aaattgtttt attaaaaaaa agatttaaaa tggtgatgct tacagcagtt 3720 

tgtacgagct cttaagtgtt gattccatgg aactgacggc tttgcttgtt ttgattcttt 3780 

tccccctact tttcctaatg gtttaaattc tggaattaca ctggggttct tttgcctttt 3840 

ttagcagaac atccgtccgt ccatctgcat ctctgtccca tgactcaggg gcgcccactc 3900 

tgcttcgatt ctcctcctgt ggaagaaacc attttgagca tgacttttct tgatgtctga 3960 

agcgttattt tgggtacttt ttagggagga atgcctttcg caataatgta tccattccct 4020 

gattgagggt gggtgggtgg acccaggctc cctttgcaca cagagcagct acttctaagc 4080 

catatcgact gttttgcaga ggatttgtgt gtgctgcctc aggaggggag ggctggtagg 4140 

agggggggag aggtctctgt cctactgctc tccagagggc atttcccctt gcgccttctc 4200 

ccacagggcc cagcccctct cccctgcccc agtccccagg gggtactctg gagtgagcag 4260 

tgcccctgtg ggggagcctg taaatgcggg ctcagtggac cactggtgac tgggctcatg 4320 

cctccaagtc agagtttccc tggtgcccca gagacaggag cacaagtggg atctgacctg 4380 

gtgagattat ttctgatgac ctcatcaaaa aataaacaat tcccaatgtt ccaggtgagg 4440 

gctttgaaag gccttccaaa cagctccgtc gcccctagca actccaccat tgggcactgc 4500 

catgcagaga cgtggctggc ccagaatggc ctgttgccat agcaactgga ggcgatgggg 4560 

.cagtgaacag aataacaaca gcaacaatgc ctttgcaggc agcctgctcc cctgagcgct 4620 

gggctggtga tggtcgttgg actctgtgag atggagagcc aatctcacat tcaagtgttc 4680 

accaaccact gatgtgtttt tatttccttc tatatgattt taagatgtgt tttctgcatt 4740 

ctgtaaagaa acatatcaaa ctaaataaaa gqagtgtctt tatt " 4784 



seq_prot.txt 

<210> 60 

<211> 4337 

<212> DNA 

<213> Homo Sapiens 

<400> 60 

cagcatgagg aaacttagct gtaagaaagg agacctggga ccatgtgggt gctgtgccca 60 

ccctcttgac agcatccctg ttgtgggagc agatgtggca gaccctgaaa gtcgggagga 120 

gaggaaaaga ggtgctctga ggaggctgat tttggctggg ggtgaagagg agctttcctg 180 

caattgcagc aaatttatgg agcgatggtc tctactgcca tggaaataac tagcagaggc 240 

tttcactctt ttggactctt aactacagga ggaactacta catttttctg taatcctgtg 300 

cacacttagg aaaagttgaa gtagctttcc caaaattact tatcctttga gattattgct 360 

tagatagttt ctatgttttg tttgtttctt aatgctgttg ggatccacta aattgatttc 420 

agaatctgct aatggtccag gctgcagttg gaaacccctg gtctcagcag agctgaactc 480 

cacccgtgga gatgttagga gccaggagga ttaggggtgg gtgcctctgg gcttccactc 540 

cagtctrtagg ggttcccttc ccctgccggg tcacgtgcag gttgtgaagg ttttacctcc 600 

tcagcagtac attaaatcct attatcccac ttaagctaat cttgcccttc ccccacccga 660 

gggcttcgcg tggacagaat attaagcgga caagcccgca gggtgcctag actaagtgct 720 

gagcttgtag tgaatgccca gccatgactt gcagtggatt gacctgtgca aaaattggac 780 

tctcaagctg ctagccccag tgtcagcacg agctcggccg agttcatttg gtcgcaggaa 840 

caagggggtc gccaggagca agagcggagg gggcgcctaa agggaggaga ccaaggccta 900 

atxaccatcc gccccctaac tccctaggcg catagcgggg acgcagcgtg tccccaaacc 960 

aacgccacta gcagagggtc ccagaggagc cggaaggggc cagggcctgg gcggcggtgg 1020 

gagggtctga gcagtggggg cggggagctg ggggtcggct ttcgtccgcg tcccagcgct 1080 

ccggcggccg ccggggctac tgatcccggg cttggcccct gcggcagcgc cccgagctgc 1140 

cgcccctcgc gctgccagcg cccgggaagg aggaaggggg aagggggcgg gccggccggg 1200 

ctcacggccg acttcctcct tcagccccgc cctccgccgc ttgcgggtga gctctgccca 1260 

agccgaggct gcggggccgg cgccggcggg aggactgcgg tgccccgcgg aggggctgag 1320 

tttgccaggg cccacttgac cctgtttccc acctcccgcc ccccaggtcc ggaggcgggg 1380 

gcccccgggg cgactcgggg gcggaccgcg gggcggagct gccgcccgtg agtccggccg 1440 

agccacctga gcccgagccg cgggacaccg tcgctcctgc tctccgaatg ctgcgcaccg 1500 

cgatgggcct gaggagctgg ctcgccgccc catggggcgc gctgccgcct cggccaccgc 1560 

tgctgctgct cctgctgctg ctgctcctgc tgcagccgcc gcctccgacc tgggcgctca 1620 

gcccccggat cagcctgcct ctgggtgagt gccggggacc cggggacgcg ggccggggga 1680 

aggaaggctg ccggggacgt gcctcgtgcg gaggtggccg tgacaaagga ccaaggaggg 1740 

gacctgtcgg aggcacggga tccattagga ccccttcttt rttcaaggcg ctgcgcggag 1800 

ggagggagag gagaggggag ctgtggggcc gctggcccct ttcttcagtt ttctttctgc 1860 

gtagggatgg ggtctagccc gctctgtcgt cccccctagt tcccttagga ggatgcactt 1920 

tcggcggggt ccgcctgccc atctacttgt ctttgatgct gaccagagcc gaccgcggtg 1980 

ccgcccccga gggcggctcg gggtagaggg aggccactgc ttgcagtcct ataaacactg 2040 

gggttctggt attcccagca agtcgcagca gggccgggag ctgtggtggg cgggcaggcg 2100 

cccgatcctc cctctxcgcc cctccatccc tggcttgcca gtgcagcccc aagccccagc 2160 

anggagagac ctccaggccc tgggagcgac cagactgcgg atgggggcag ctctgggcgc 2220 

tgacattccc aagtcctggg agcgaacaga ctgcggatgg gggcagctct gggcgctgac 2280 

attcccaagt cctgggcgcg acccctcctc agagtgacct gggagtagca tcctggttgg 2340 

ctgggttcct ttcctggtcc tcacatctgc ttctgtggcc acctcctgtt tcagccctcc 2400 

gcccttccct ctccttttcc cttccatccg gttgcttctc cagctcttct tggggcactt 2460 

gggctittaga ttcagagggt cctggatccc aagcccattt cttccactgg gcaagtaagt 2520 

gcctitctgct cccagtgccc cacctttctc atctgcaaaa taggcagtcg taggtgccrt 2580 

gcaaggtgtg ttactcgggt atgaagttta taagtgccca acagatgtat ccgcaagccc 2640 

tttaacttgt tggttgtcat attgtatgac ttggaataaa gcatgtatga attagaggtt 2700 

acagactgaa actgtcccag acttccatct cctcacagca tccccacctc taaaagcatc 2760 

cngt:t:t:acaa agggaaatag tagaatcagc acttaaatta aggcagggcc ctagcggctt 2820 

caagtaaccc tacctgaacc cctgtctgaa atactgcaga tgacctgggt tcaaatgcta 2880 

cctcagccac ttagtaacca tgtgaccttg agcaggctac ttaaagtttc tgggtcttga 2940 

tttcttcatc tgtaaaacag gtataataac agaacctacc tcaaggtagt gaggattaaa 3000 

tcagctcgtg tttgaaaagt cgttaggaca gggcctggta cgggaaccgc tgaatgagtg 3060 

ttactcagct cagagtttac ttctgtacat cttgcccaga tctggggtga ttctgcctgg 3120 

ggttaggaca gggttggggt gttagtttga ctgtagcccc ttgggtgctt: cagcttgtga 3180 

agtcaggatg aacaggtgta accagcccca ggcctgctgt cttgctactt ctgttgtggc 3240 

cacaactccc tttgccctca gaccctctca tttttgggag aaggcagggt gagatggggg 3300 

cagatgctcc aatggactcc acttatggcc ttgggaaagc tgggccagct catgggacca 3360 

tgtcaaaaga cggtaggttt catgtccaac tctggaactg gatgggacaa ncacagggta 3420 

gctagaaggc aaaactggag agcagcgagg ggaatxgagg ggacctgctt cccgaggagc 3480 

agctagaata cttggttatt attaccaatg cagctgcagg acctggtcca cttctacaag 3540 

cccgtgactg ttcccaggct tctgtcacgc accagcattc acctgtcagg gctgccgtag 3600 

gtgggcttgc attttgtgca ttgtaccaag gtgcccagct gagggttgag aggacctgaa 3660 
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atgcagtcca 
tgttcttggg 
aatagtgtag 
tgccactata 
ctactcttac 
gtaactgaac 
ggctcattag 

cgccgggccc 
actggatatt 
atgccccgta 
aatcccttga 
aattattgtt 



cactcagctc 
gctaggtctg 
tggcacctag 
ccctggagtg 
ctacgtgcag 
aggctcaccc 
atgttcctgg 

agggccctag 
acacacccct 
aggacccaga 
agtggagcgt 
gatgaac 



tccaacctaa 
cccggagtgg 
ctacctgaac 
ccccctctct 
gtcctagggc 
ttgagtgtgg 
cagtgtgccc 
tgccctccct 
ctttcctcct 
tgcctaggga 
ccagcttgcc 



seq_prot . 
gagtcctgga 
cccatttttc 
ttgtgagcac 
gggcttcctc 
tggctgggag 
cctaaggcgg 
atcctgqgca 
tgcatgcatc 
gccactcact 
cagtggggct 
ctgcatctgg 



txt 

gcaaaagacc 
cagtcccacc 
gtatctatag 
ctggccctca 
tccgagaggg 
tttcctgcct 
ggctcgtggt 
ttaggagaaa 
cctctactga 
atgtctgtgg 
ggtccttcat 



ccttctgcta 
aaagccctgg 
tcgactgagt 
tacccacctc 
gaatgccctg 
catcccaggg 
gaccatggga 
gagtcatgga 
attcccaccc 
tttaaaacaa 
agggaaccct 



<210> 61 
<211> 4388 
<212> DNA 

<213> Homo Sapiens 



<400> 61 

agccatgagt 

aaacatatta 

gcagttgggt 

tgtcagcatt 

ctggctgatt 

gtttctgttg 

ttctctctct 

aatattttgg 

ttggccccag 

acgtgaatca 

atagcatttt 

gtaggccctg 

tttcccctac 

acatatttct 

ccaccagtag 

gtagggaaag 

tccccccttg 

gacaggataa 

gtgtttggag 

agagtgctga 

gctgctgcca 

aggttcatcc 

gttcctttaa 

gcaggaagag 

aaggcattta 

gcgtcttgaa 

ttgtcttttt 

ataggatctc 

aaagtggctg 

aacacacggc 

atcctaaagt 

tcttccttcc 

cctttctgtg 

gcaagaacgt 

atgcgcccac 

ctgcgggctc 

gtttccctgc 

cattttaccc 

tgcatccaac 

gtggcacatc 

taacagctct 

agctgaactc 

agacccaggc 

gaataagcca 

ggtcaagggt 

gtggtacacc 

tccccetata 

ttttaacttg 



ggattagatg 
gctttgactc 
gtttttattt 
agcaagccta 
ttactggcca 
gaatcgtgtg 
ctctctgaca 
ttcctcacta 
ccatatgttg 
tcaggttgca 
aaatgaaaaa 
aatagctttg 
taaaaatgca 
tttaaaagaa 
tgtggtgggg 
gagaatgcca 
ttttcctgaa 
agcctgcttc 
gcgggagaat 
gaaaaacaga 
ctgggtagcg 
cagcaagtag 
gggcagcaca 
cttggagaca 
gatcgtattc 
ctgctttact 
agaagctttg 
atttgaggcc 
cagggagagt 
catgagttcc 
gggtctttaa 
ccttctaggg 
tcttcccacc 
ggtgcccctc 
tagccgtgga 
tacttcatcg 
cacggtctga 
tcatcatgca 
accaatggct 
ttctgtcttc 
ctttccccct 
agtctatggg 
ctggagagta 
cagcaaagaa 
ttattatagc 
ttaaagcttt 
ggaatacaag 
atacactgca 



ccaaaatccc 
tgagctggga 
gttgaaagga 
attagttaat 
cgtctggcag 
aaaagctgga 
cacacacaca 
agccaatcta 
ctactcagag 
caaggaacac 
aaaaaaaagt 
atgttgtgtt 
aatttttcta 
atttcaaact 
cagcagagtt 
tttgcctaca 
cgtgaactga 
tctaggaatt 
aaacacaaat 
acggtgtgaa 
gtaggccttt 
gaacagcaag 
aaaatcagtg 
tggatggaag 
tgagttaaat 
catttaaaat 
gcgatcctat 
acggaggtcc 
gtgagggtgg 
agattagggc 
acaggaagaa 
atttcagcac 
tacagtaaca 
tatgacctgc 

ggggcatccg 
cattccttgc 
gagctccctg 
ccactttagc 
ttctagatga 
tgttgggaac 
tgctatgtta 
ttggggctca 
gacattttgc 
tttaaagtgg 
accctcttgt 
tatatgactg 
gggcacacag 
gattcagatg 



tggctgagag 

atttcggtga 

attgctgaat 

actaagtaaa 

agggcagcag 

gaggttgctt 

catacacaca 

aatcgcagaa 

aatttaatat 

agtggcagat 

aacacttaaa 

ttcatggcaa 

agaccttctc 

agataatagt 

gtggctagtg 

tttccctctg 

gctctgggca 

cgcactgagg 

aaataaaaag 

agaggaaggc 

ccgaggaggc 

tgtaggtccc 

tggctccgga 

ctggaccagt 

gggaagtgac 

acccactcct 

ttgaatgcat 

atggaagtca 

gaccgccctg 

ttctgaaagc 

agaaagattg 

tcctggggct 

aaggcatgga 

tgctggagat 

tggaggagac 

aaaagtatta 

gctcccacac 

caaattctgt 

gtggccattc 

agccaaaggg 

ctaagcgtga 

gataactctg 

ctctgataag 

ctcctttaat 

attcctatgg 



ggaaggcaga 
tgctgaaagc 



aataacctta 
tgttgtagat 
tttcaaatcc 
tttgcactaa 
cagggagaaa 
ctctttctct 
cacacaattg 
gtctattttg 
cagatttcat 
ccagggggca 
caagtaattt 
gtctccaact 
ttggtcctgc 
tgttctcctc 
gaggagagca 
cccatttccc 
ctgttttagg 
gtgtgagtgt 
gagaatttca 
tgagcctgca 
ggcatttgaa 
ctaagtcttg 
gagcacatta 

tgggccttgt 

gtgagagatt 
gcttggctga 
ttaggtccta 
cctgcatagc 
gtaggaggtg 
cctcagcttt 
ctaagtgtct 
cgggttggct 
gcatctgtac 
gctggacgcc 
ggaccaaagc 
catcacgggg 
ggttcagata 
ctcctgcata 
atttgcttgc 
attccaaggc 
ggattcccgt 
tgcatttaag 
cactttttaa 
tggtgacttg 
caatgcatcc 
tctggtgatt 
tcccctagtt 
tctgcctctg 



ccctggagga 

tcaatgcatt 

attaactgct 

atatacaacc 

gctctgtaga 

ctctctctct 

taataataat 

ttaagtaagc 

ctgactgtaa 

tttaactttt 

agatcatgct 

tgagctgaat 

tactcatgat 

cccacccccg 

gaggaggaga 

gctgcccatt 

cctagcaggg 

gtgcacgtgt 

ggcagtgata 

gaggcttgag 

gaccggagga 

ggggagctta 

ggggagagag 

tgaacatgga 

taacaatgga 

atatctcatg 

ttggagggga 

aaataccctg 

gaaaatgaaa 

cccagctccc 

ttggagttcc 

ctaaagtagt 

agcatgaagt 

caccgcctac 

cacttggcca 

gaggcagagg 

atccctgctg 

cactccggca 

tcagttctta 

taaatctttg 

agctcttcac 

ctacttgtag 

atggctctaa 

gagaaagcta 

ttttatgaaa 

gtcaattcat 

ggcaagacta 

gctttccggt 



3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4337 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
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catgggttcc agttaattca tgcctcccat ggacctatgg agagcagcaa gttgatctta 2940 

gttaagtctc cctatatgag ggataagttc ctgatttttg tttttatttt tgtgttacaa 3000 

aagaaagccc tccctccctg aacttgcagt aaggtcagct tcaggacctg ttccagtggg 3060 

cactgtactt ggatcttccc ggcgtgtgtg tgccttacac aggggtgaac tgttcactgt 3120 

ggtgatgcat gatgagggta aatggtagtt gaaaggagca ggggccctgg tgttgcattt 3180 

agccctgggg catggagctg aacagtactt gtgcaggatt gttgtggcta ctagagaaca 3240 

agagggaaag tagggcagaa actggataca gttctgaggc acagccagac ttgctcaggg 3300 

tggccctgcc acaggctgca gctacctagg aacattcctt gcagaccccg cattgccctt 3360 

tgggggtgcc ctgggatccc tggggtagtc cagctcttct tcatttccca gcgtggccct 3420 

ggttggaaga agcagctgtc acagctgctg tagacagctg tgttcctaca attggcccag 3480 

caccctgggg cacgggagaa gggtggggac cgttgctgtc actactcagg ctgactgggg 3540 

cctggtcaga ttacgtatgc ccttggtggt ttagagataa tccaaaatca gggtttggtt 3600 

tggggaagaa aatcctcccc cttcctcccc cgccccgttc cctaccgcct ccactcctgc 3660 

cagctcattt ccttcaattt cctttgacct ataggctaaa aaagaaaggc tcantccagc 3720 

cacagggcag ccttccctgg gcctttgctt ctctagcaca attatgggtt acttcctttt 3780 

tcttaacaaa aaagaatgtt tgatttcctc tgggtgacct tattgtctgt aattgaaacc 3840 

ctattgagag gtgatgtctg tgttagccaa tgacccaggt gagctgctcg ggcttctctt 3900 

ggtatgtctt gtttggaaaa gtggatttca ttcatttctg attgtccagt taagtgatca 3960 

ccaaaggact gagaatctgg gagggcaaaa aaaaaaaaaa agtttttatg tgcacttaaa 4020 

tttggggaca attttatgta rctgtgttaa ggatatgttt aagaacataa ttcttttgtt 4080 

gctgtttgtt taagaagcac cttagtttgt ttaagaagca ccttatatag tataatatat 4140 

attrttrtga aattacattg cttgtttatc agacaattga atgtagtaat tctgttctgg 4200 

atttaatttg actgggttaa catgcaaaaa ccaaggaaaa atatttagtt tttttttttt 4260 

tttttgtata cttttcaagc taccttgtca tgtatacagt catttatgcc taaagcctgg 4320 

tgattattca tttaaatgaa gatcacattt catatcaact tttgtatcca cagnagacaa 4380 

aatagcac 4388 

<210> 62 

<211> 18 

<212> DNA 

<213> Homo Sapiens 

<400> 62 

cagggaagct ggaatgag 18 

<210> 63 

<211> 18 

<212> DNA 

<213> Homo Sapiens 



18 



17 



<400> 63 

tccctcccag agagtgcc 

<210> 64 

<211> 17 

<212> DNA 

<213> Homo Sapiens 

<400> 64 

ccagctccac caccagc 

<210> 65 

<211> 18 

<212> DNA 

<213> Homo Sapiens 

<400> 65 

L I* I 

agggctctag gccaggtc IS 

<210> 66 

<211> 20 

<212> DNA 

<213> Homo Sapiens 
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<400> 66 

agaacccatc gtataaaaag 

<210> 67 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 67 

ggacgtcgat ggtatt 

<210> 68 
<211> 16 
<212> DNA 

<213> Homo sapiens 
<400> 68 

aacggtgtcg tcgaaa 

<210> 69 
<211> 16 

<212> DNA 

<213> Homo Sapiens 
<400> 69 

cacgcagttg cgcgct 

<210> 70 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 70 

ttttgtgcgc acggac 

<210> 71 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 71 

ttaagcgggc gctgat 

<210> 72 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 72 

tagaggcgcg ggttac 

<210> 73 
<211> 17 
<212> DNA 

<213> Homo Sapiens 
<400> 73 

cttcgactcg gctcaga 
<210> 74 



seq__prot.txt 
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<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 74 

caatggggcg ccgact 

<210> 75 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 75 

caccgcagcg gccagg 

<210> 76 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 76 

aggaacgcca gccgtt 

<210> 77 

<211> 17 

<212> DNA 

<213> Homo Sapiens 

<400> 77 

ggccgccgca ccatgga 

<210> 78 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 78 

acagtaaacg cgagga 

<210> 79 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 79 

ccccgtggcg gagaac 

<210> 80 
<211> 16 
<212> DNA 

<213> Homo sapiens 
<400> 80 

cacggacacc ccggcc 

<2l6V 81 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 81 



seq_prot.txt 



16 



16 



16 



17 



16 



16 



16 
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ccacgactcg 

<210> 82 
<211> 16 
<212> DNA 
<213> Homo 

<400> 82 

gaagccaccg 

<210> 83 

<211> 16 

<212> DNA 

<213> Homo 

<400> 83 

gaccgatgcg 

<210> 84 

<211> 16 

<212> DNA 

<213> Homo 

<400> 84 

gtccggcggg 

<210> 85 

<211> 17 

<212> DNA 

<213> Homo 

<400> 85 

agccgattcc 

<210> 86 

<211> 16 

<212> DNA 

<213> Homo 

<400> 86 

ggaagagccg 

<210> 87 
<211> 20 
<212> DNA 
<213> Homo 

<400> 87 

acatacagta 

<210> 88 
<211> 19 
<212> DNA 
<213> Homo 

<400> 88 

agaatgagta 

<210> 89 
<211> 18 



caggcc 

Sapiens 
cgctgg 

Sapiens 
gtccat 

sapi ens 
aggaga 

Sapiens 
cgcccag 

Sapiens 
cgggcc 

Sapiens 
attcttgagg 

Sapiens 

aaagcctgt 

. ■ 

"Sefte SB 



seq_prot.txt 

<212> DNA 

<213> Homo Sapiens 

<400> 89 

aaatatttta ctcctcaa 

<210> 90 

<211> 18 

<212> DNA 

<213> Homo Sapiens 

<400> 90 

aaagtccccc aaaattat 

<210> 91 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 91 

agcccccaaa tgacct 

<210> 92 
<211> 16 

<212> DNA 

<213> Homo Sapiens 
<400> 92 

ccaccactat gcgcag 

<210> 93 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 93 

acgctgcaca tccagg 

<210> 94 
<211> 16 
<212> DNA 

<213> Homo sapiens \ 
<400> 94 

ggctcgccgg gtgatg 

<210> 95! L 
<211> 17 ; 
<212> DNA 

<213> Homo sapiens 

<400> 95 i ■ 

gacagcatcc ttgggca 

<210> 96 j 
<211> 17 

<212> DNA J v ; 

<213> Homo sapiens • " v ^ : -j 

<400> 1 96 J * 1 ' * * * . 1 1 j *-■ ■ ' \ ' 
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gcaaggaggg tggcrtc 

<210> 97 
<211> 16 
<212> DNA 

<213> Homo sapiens 
<400> 97 

aaaccaaacc ttgata 

<210> 98 
<211> 16 

<212> DNA 

<213> Homo Sapiens 
<400> 98 

tcttgagcac atggga 

<210> 99 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 99 

agctggacag tcgcca 

<210> 100 
<211> 16 
<212> DNA 

<213> Homo sapiens 

<400> 100 

ggaggaatcc tgcatt 

<210> 101 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 101 

gtggaaggaa gaaagc 

<210> 102 
<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 102 

gacacgatta catagc 

<210> 103 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 103 

r 

gcggggctgg cgtaac 

<210> 104 
<211> 16 

<212> DNA 
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<213> Homo Sapiens 

<400> 104 

cgcctactcc ggggct 

<210> 105 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 105 

ccataggcgc cctccc 

<210> 106 

<211> 17 

<212> DNA 

<213> Homo Sapiens 

<400> 106 

taggctgtag ggggatg 

<210> 107 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 107 

ctctagggat ttacag 

<210> 108 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 108 

tggaagcaac agcagcatct 

<210> 109 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 109 

accaatcgcc gctcgg 

<210> 110 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 110 
caatcgtcag cggcgg 

<210> 111 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 111 

gtcgtcctcg cgagga , <« . r » 
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<210> 112 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 112 

caatgagcgc gctgta 

<210> 113 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 113 



gccacagcca gcaacg 

<210> 114 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 114 

ggccctggcc acgggc 

<210> 115 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 115 

ctgtcctcct taacag 

<210> 116 
<211> 18 
<212> DNA 

<213> Homo Sapiens 
<400> 116 

gccccaaccc aacctgtc 

<210> 117 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 117 

aaaacgtgga cgtttt 

<210> 118 
<211> 17 
<212> DNA 

<213> Homo Sapiens 

. » ■ 

<400> 118 

tgaaagtcgg ccaaagc 

<210> 119 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
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<400> 119 
ctggacgtcg aggaga 

<210> 120 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 120 
ctctcgggcg gagaga 

<210> 121 

<211> 17 

<212> DNA 

<213> Homo Sapiens 

<400> 121 
ttcggccagc cccgcat 

<210> 122 
<211> 17 
<212> DNA 

<213> Homo sapiens 
<400> 122 
cccgggccac cacgctt 

<210> 123 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 123 
cccagtcgcg cagcgc 

<210> 124 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 124 
aacccgcgag cttaga 

<210> 125 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 125 

tgtgagaacg gctgca 

<210> 126 

<211> 16 

<212> DNA 

<213> Homo sapiens 

I <400> 126 1 

..... . » i t 

ccaggcgtcc cggcgc 
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<210> 127 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 127 

caggcctgcg cgaaga 

<210> 128 
<211> 17 
<212> DNA 

<213> Homo Sapiens 
<400> 128 

ggcagccggg ctggcac 

<210> 129 
<211> 16 
<212> DNA 

<213> Homo sapiens 

<400> 129 

cagggctgga agccgc 

<210> 130 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 130 

agtggaagct ctaggt 

<210> 131 
<211> 18 

<212> DNA 

<213> Homo sapiens 
<400> 131 

ccggagccac ttcccgat 

<210> 132 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 132 

cttaaaccga tggcct 

<210> 133 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 133 

taggtgctgg agccac ; 

<210> 134 i 
<211> 16 
<212> DNA 

<213> Homo Sapiens 1 
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17 



16 



16 



18 



16 



a 

16 

i 

" - ■ ' > ' \ 

It 
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<400> 134 
agacagaatg gaggtg 

<210> 135 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 135 
ggcgctcccc attccc 

<210> 136 

<211> 17 

<212> DNA 

<213> Homo sapiens 

<400> 136 

acttcgagga tcacgtc 

<210> 137 

<211> 17 

<212> DNA 

<213> Homo Sapiens 

<400> 137 

agatggcgct ccccgca 

<210> 138 

<211> 18 

<212> DNA 

<213> Homo Sapiens 

<400> 138 

accctcgaac cgactcct 

<210> 139 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 139 
agcccgacgg tctcag 

<210> 140 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 140 

agcccacg-tg accgag 

<210> 141 1 1 

*211> 16 

<212> DNA 

<213> Homo Sapi ens 

<400> 141 : 

ctacgagaag cgggta 

<210> 142 
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1 ■ ' < 

t ! 

j 

. > 

* a 

• » ; . s * • s f ,! ■ 

* ■ J' .« ! ''.*»' i * 1 
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<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 142 

agggggcgac tcccgg 

<210> 143 
<2U> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 143 

ttagcgccgc ttgacc 

<210> 144 
<211> 17 
<212> DNA 

<213> Homo Sapiens 
<400> 144 

agcccgcctc attgcgc 

<210> 145 
<211> 17 
<212> DNA 

<213> Homo sapiens 
<400> 145 

gcccacagaa agacacc 

<210> 146 
<211> 16 
<212> DNA 

<213> Homo sapiens 

<400> 146 

tgtgtgtgtg attgcc 

<210> 147 
<211> 17 
<212> DNA 

<213> Homo Sapiens 
<400> 147 

tctgtataaa atagttt 

<210> 148 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 148 

gttctgtact gtgact: 

<210> 149 
<211> 17 
<212> DNA 

<213> Homo Sapiens 
<400> 149 

Seite 94 



seq_prot.txt 

tgtagtcctc cccaggg 

<210> 150 
<211> 20 
<212> DNA 
<213> Homo Sapiens 

<400> 150 

ctcttcctat atgtataccc 

<210> 151 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 151 

agaggaagaa actgag 

<210> 152 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 152 

agagaatctc agaagg 

<210> 153 
<211> 16 
<212> DNA 
<213> Homo Sapiens 

<400> 153 

agccgggaga gcgaaa 

<210> 154 
<211> 17 
<212> DNA 
<213> Homo Sapiens 

<400> 154 

aagagtcggg agccgga 

<210> 155 
<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 155 

ggccgaagag tcggga 

<210> 156 
<211> 16 
<212> DNA 
<213> Homo Sapiens 

<400> 156 , ! 

• . -■ ' * 

atfgccagcgg gccgaa ']' 

<2io> 157 • • 

<211> 17 ' • ! ' ' ■ "' : '"• ' ' 

Seite 95 



seq_prot:.t:xt 

<212> DNA 

<213> Homo sapiens 

<400> 157 

gagcggtagg tgtcgaa 

<210> 158 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 158 

agtccgcagt cccgag 

<210> 159 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 159 

cagggcgcgg agcaga 

<210> 160 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 160 

catccgcggg cggctc 

<210> 161 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 161 

attcggcggg agatca 

<210> 162 
<211> 17 
<212> DNA 

<213> Homo Sapiens 
<400> 162 

agatcagccg aaagagc 

<210> 163 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 163 

tacatcccgg ggncccaa 

<210> 164 

<211> 17 . 

<212> DNA 

<213> Homo Sapiens 

<400> 164 

- t * 

► * 
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actcctctca taaaata 

<210> 165 
<2U> 16 
<212> DNA 
<213> Homo Sapiens 

<400> 165 
gcagatatta ggtgaa 

<210> 166 
<211> 17 
<212> DNA 

<213> Homo Sapiens 

<400> 166 

caaggaaacc ctaaatc 

<210> 167 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 167 

atagcccrag gtcttc 

<210> 168 
<211> 17 
<212> DNA 
<213> Homo Sapiens 

<400> 168 

ccccgggccc aagacga 

<210> 169 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 169 
aggggcctga aggtgg 

<210> 170 
<211> 17 
<212> DNA 
<213> Homo Sapiens 

<400> 170 

atttggaggt tccggtt 
<210> 171 

<2ii> 16: 

<212> DNA 
<213> Homo Sapiens 

<400> 171 .: 

ctccgagatt tttact 

, . m 

<210>, 172. ;i 

:<2li> 1,5;. tV \- ; ( 
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<213> Homo sapiens 

<400> 172 

tgaaacaaca gtccta 

<210> 173 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 173 

cccctgcagg gccccc 

<210> 174 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 174 

gcagaagaga tatcut 

<210> 175 
<211> 18 
<212> DNA 

<213> Homo Sapiens 
<400> 175 

accctacacc gacaccgg 

<210> 176 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 176 

tttgagatct tgacta 

<210> 177 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 177 

tattgtgaca tcatcg 

<210> 178 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 178 

tccctcactg tgtgcg 

<210> 179 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 179 

tg dat ag ctc agaacc _ 
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16 



16 



18 



16 



16 



16 



16 
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<210> 180 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 180 
cctcttcctc aggttt 

<210> 181 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 181 



ccccagccca cccacc 

<210> 182 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 182 



gaacacacct tggggc 

<210> 183 

<211> 17 

<212> DNA 

<213> Homo Sapiens 

<400> 183 

gcactcagcc gtatagt 

<210> 184 

<211> 16 

<212> DNA 

<213> Homo Sapiens 

<400> 184 
catttgtcgc gttgat 

<210> 185 

<211> 16 

<212> DNA 

<213> Homo sapiens 

<400> 185 

tgcaattcgg ggactc 

t » i 

<210> 186 
'<211> 16 
<212> DNA 

<213> Homo sapiens : 

<i400> 186 , t- 

aggaagcacg gagaat 

<23i0> 187- :Jf. • : ■•:■.•* !•.;'* 

<211> '17 ' * "* :" : : . i i ■ ■ f . T.; v. 

<212> DNA " - ,f i\ ! ...» .-i •» 

<:213> Homo Sapiens ! : 1 ; f * j r : fr • 
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<400> 187 
tccgctggag 

<210> 188 
<211> 16 
<212> DNA 
<213> Homo 

<400> 188 

ttgcggaagc 

<210> 189 

<211> 16 

<212> DNA 

<213> Homo 

<400> 189 

caaactcgac 

<210> 190 
<211> 16 
<212> DNA 
<213> Homo 

<400> 190 

ttctcgcccg 

<210> 191 
<211> 16 
<212> DNA 
<213> Homo 

<400> 191 
cccgcgtcca 

<210> 192 
<211> 16 
<212> DNA 
<213> Homo 

<400> 192 

cttcgcggcc 

<210> 193 

<211> 19 

<212> DNA 

<213> Homo 

<400> 193 

acagtctcgc 

<210> 194 
<211> 16 
<212> DNA 
<213> Homo 

<400> 194 

cccagtacgg 



atcgcgc 

Sapiens 
acgcgg 

sapiens 
gggccc 

Sapi ens 
gcggag 

sapiens 
tcgtgt 

sapiens 
cgcagt 

sapiens 
catctgcat 

Sapiens 
ggtgca 
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<210> 195 
<211> 18 
<212> DNA 

<213> Homo sapiens 
<400> 195 

ggcgtcacag ccacgccc 

<210> 196 
<2U> 17 

<212> DNA 

<213> Homo Sapiens 
<400> 196 

tgcccatcga aggcaga 

<210> 197 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 197 



gcgggagcgc ctgggg 

<210> 198 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 198 

tgggcccgga ctrctgg 

<210> 199 
<211> 16 
<212> DNA 

<213> Homo Sapiens 
<400> 199 



aagggagaac ggaaaa 

<210> 200 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 200 

gtctttttaa agaact 

<210> 201 
<211> 16 
<212> DNA 

<213> Homo Sapiens 

<400> 201 - ■ i- 

5 '; " . . 

■ J 

ctgcccgaag atcgcc 

<210> 202 ' 
<211> 17 

<212> DNA ~{i 

<213> Homo Sapiens ^' ^ 
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<400> 202 

cagcgcaagg acccggt 17 

<210> 203 

<211> 18 

<212> DNA 

<213> Homo sapiens 



18 



18 



16 



<400> 203 

agagtccggc ttcccgca 

<210> 204 
<211> 18 
<212> DNA 

<213> Homo Sapiens 
<400> 204 

actcgtacct gcgggcta 

<210> 205 
<211> 16 
<212> DNA 
<213> Homo Sapiens 

<400> 205 

aactcgaccg cgggca 

<210> 206 
<211> 4200 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 206 

gaaaatgagg gttggttttg aagttggata gatttgtttg atttgtttgt ttttagtttg 60 

ggttaattat attgttttag taagttattt atattgtttt tatatttttg aaatgaggta 120 

tatgtttgtt ttatttggga atgcgaggat taaggagaat aatatatata taatgttgaa 180 

tatttacgtt ttttaattat tttgaagttt tagaaatatt tttagttttt aggtgagttg 240 

tgattaaatt cgtttattaa tttaatatat atttattgaa tgtttatttt gtgtcgtagt 300 

ttttggtagg tgtgtttgat agtgggtgtg taatatgttt agggtttgta ttttatgagc 360 

gtggggattt tttttatttt ttggtatata tggttttggg ggagaagtta aggggaaggg 420 

ttaggagttt atatggtaga tttagtaagt ttttagtata ataattttaa ttgtaaaaat 480 

aaatagtttt gttaattgtt tgagagtagc gtttgtttta taaaaattaa taataataaa 540 

aggaaaaaaa atttaaagta aaacgttata tttaatggtt gttggataag atattgttgt 600 

ataaagtttc gcgcgagatc gttttggcgt tgttttagtt tgtttttggg ggttatagag 660 

gaaggcgtgg ggcgtgtgtt agttataaaa gattgttttg aattttaggg tagtgtttat 720 

ttattgggtg gttttagaag atttttttta aagcgcggtt gataagggtg atagcgtttg 780 

gatcgcggga attgttcgcg gagttggtgt tgaatagggc gtgtgcggcg ggcgttttaa 840 

ggaaattgga agcgggatcg gaggtcggtt ttcgggcgtg cgaggaggag ttggaagaag 900 

agcggggagg ggaacggttc ggaattcgtg cgcgtcgaac ggcggcgttt taatttataa 960 

atttattttt tgtttagaga gaagcgaagg agaggtttag cgagtaaaag tcgaggtttt 1020 

tgagtcgttt ttcggcgcgc gttttttgtt tgaggtgtat tttgttgtgt cgtttttttt 1080 

tatcgttgcg cgcgaagcgg cggtcgtata cgtcgtaagt aaaaggtttt tcgtcggtgt 1140 

gggtgcgtac gtgcgtggtg aggtggtcgt tgcggttgaa gttgcggagg tagatgcggt 1200 

attggaaggg tttgtggttc gtgtggatgc gtaggtggcg attgagttcg tcggagcgcg 1260 

taaagtttcg tatataattt tttatcgggt aagcgaaggt tttggcgtgc ggtcgcgggt 1320 

agaagtagcg cgtgttgtat ttgtcgtcgc ggcgtttttt gcgtcgcgtt ttggtttggg 1380 

ggaaaggggt gggcggcggc ggcggtacgg gtggtgtagt tacgttattg tttttaggga 1440 

tgttcgttat tagaggttta gggaagttcg tcgcggcggc gttgcgaagg tttagcgggg 1500 

aaagttgagg ttgcgtattg gttagaaatt tttcgtcgtc gtcgttattt tttttttttt 1560 

tattaggagg ggttaggagt tta ggga ggt ttttagtttt tttttttaag ttattcgggg ^ 620 
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ttagcgggaa agcgttatag gtttcgttgg gatagagttt gttggttggg acggtcggta 1680 

gtttcgtagg gtagttgatg gatagtaagt ttttaatttt ggtttttatt acgggaaaac 1740 

gagttttcgg ggcggtttgg tagttttttt gtgatttata gttttttggg gtttttatag 1800 

aaagtagttt ttagggcgcg tagggatttt tgaaggtaga gatagcgttt agcgttggcg 1860 

aggcgggagg cgttcggagg tcgggtttga cgtcgggcgg ggagagttga ggtttatata 1920 

ggtattgcga gggggtattc gcgtaaggcg aggtttttta gaacgttttt gggaaagggg 1980 

tagcgtttag attcggggag taaaggttcg gcggattcgg tagtaaggta tcggatttcg 2040 

taggaaaagg ggtatttagc ggggatttgg acgttgttgt ttttggatcg gggaagggtg 2100 

ttaggtttaa gatgttcgat atgaggttga agagtgtttt cgggtcgtgc gggtgttcgg 2160 

gtattgtttg aatgaagaag ttatcgttgt agttgaggtc gggagggggt gtgggcgtag 2220 

gtttttttag gaagtaggag tcggttaagt ttttatttgc gtcgtagttg tttaaagttt 2280 

agtttaagaa gtcgtttgtt gaggagggag taggaattag ttttgggtat attaaagggt 2340 

gtatttgagt ttatagtttt tatttataaa atttttggtg tttacggata tatataaagt 2400 

gttttttgta tattttgatt tagtaggggt tcgtatacgt tttaaggagt atatagaaat 2460 

atgtatacgt aaaatatacg tgtataggta aaagggcgtt tcgataatcg tataggtttt 2520 

tgcggaggcg ttggcggttt agtgtgggtg ggaatggggg tgcgtatttt aggattttta 2580 

agttttttat cgtttttatt tttatagttt taatgtttta gttttttttg gtttttaggt 2640 

atggtgcgtt ttcgcgttgc gtcgtcgtcg tttgagtatt tttgttcgtt ttttttattt 2700 

ttagggtagt cggtggtcgc gggagcgttt ttggtaggta gtcggggtaa tttagcgttg 2760 

ggttcggcgt aatagttttt agtggatttg acgaggagcg cgtcgggttc ggaaaattcg 2820 

ttaaggtgga gtatggcgcg gcgtcggttg tggggcgttc ggggtttcgt tcgttgggtt 2880 

tgggggcgcg cgggtggcgg ggaggttggc ggtaggggtt tttcgtagcg tatagattta 2940 

ggcgttcggg tttttggttt cgggtattta tttttgggaa aggagtcggg gagtcgcggc 3000 

gttttcgttc gttcgtatcg gtttcgggcg gcggtttttc gtttttttaa agcgttgtcg 3060 

ggttttttta gtttatagtt tttttattta tatagttgcg gcggcgttgg tttcgggttt 3120 

tcgatttttc gggttattgt tattttggga ggtttcggtt ttgtcgtcgt gacgtaaatg 3180 

tttaaatatg gatataggat gtgtgtcggg gattttcgaa aaggaaagtt cgagttgtga 3240 

cgttgttgtt gtcgtcgtcg tcgttaggtt cgcgtcgttg cgtgcgcgcg cgttttttag 3300 

gcgtggagcg ttggggtcgc ggatgcgtta gtttggggcg ttgcgtttta tcgggtcgat 3360 

cgttcggtag atgcgttttt tatttggagg tagaggtcga ggggtagtag gggaggatcg 3420 

tgttgggttt gaatttgttt tttgttatta attagtttga tatttagttt tttttaacgt 3480 

tagtttttta tttgtgtatt ttttgtggag agaaattgag agtatataaa tgtttaggcg 3540 

tatttagtaa attatagaga ttattgttat ttattttttt tgtttacgtt tgtgtgtttt 3600 

tttattttaa gttttttttt tttaaatttg tgttaggaaa aataatttaa attaaaaagg 3660 

attgtaatat ttgttaatag taggaggaaa gttaattagt ataatagggt atattttaaa 3720 

ttgaaaagta aaagtttttt ttttttattt atttttattt ttagtttttg aggtagttat 3780 

atttaagttt tttgggtatt tttttagaag aattttatgt atatttaatt atatatattg 3840 

ttttgtaata ttattattat tattattttt ttatttgaaa taatttagag attgtgttat 3900 

gttttgatat aaagattttg ttattttttt ttaaggttgt atagtatttt attgtaaggg 3960 

aggattttga gttagtcgat ttgtttttta tttatgagta tgtaggtagt ttttaatttt 4020 

ttatttatat taatgttgtt ggagttaata tttttgatta tataaatgtg aatgtgtttg 4080 

tgagataaat tttaacgagt gtaattgtgt gttaaagtat atgtaaaatt ttgatagata 4140 

attggtaaat tgttttttag aaaagtaggg ttaatttttg tttatttttt ttttatttat 4200 

<210> 207 
<211> 4200 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 207 

gtgggtgagg gaggagtgag taggaattgg ttttattttt ttggaagata atttgttagt 60 

tgtttattaa aattttatat atattttggt atataattat attcgttgga atttatttta 120 

tagatatatt tatatttata tagttaagga tgttgatttt aataatattg gtataagtaa 180 

aaagttggaa attgtttata tgtttatgga taggggataa atcgattaat ttaaggtttt 240 

tttttataat ggagtattat atagttttga aaaagaatga taagattttt atgttaggat 300 

atggtataat ttttaagtta ttttaagtga aaaaataata ataataataa tattgtagaa 360 

taatgtgtat ggttggatat atataaaatt tttttgaaag aatatttaag aaatttaaat 420 

gtggttattt taggagttag gaatgaaggt aggtggagag gagggatttt tattttttaa 480 

tttagaatat attttgttgt gttagttgat ttttttttta ttgttaataa gtattatagt 540 

tttttttaat ttaaattatt ttttttagta taaatttaaa aaaaaaaaat ttaaaatgga 600 

aaaatatata ggcgtgagta aaagaaatga atggtaataa tttttgtggt ttattgagtg 1 660 

cgtttggata tttatgtgtt tttagttttt ttttatagga aatgtatagg tgagaaattg 720 

acgttaaggg ggattgagtg ttaagttagt tagtggtaga gggtagattt aaatttaata 780 

cggttttttt ttgttgtttt tcggtttttg tttttaggtg ggaagcgtat ttatcggacg 840 
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gtcggttcgg tgaggcgtag cgttttagat tggcgtattc gcggttttag cgttttacgt 900 

ttggggagcg cgcgcgtacg tagcggcgcg agtttggcgg cggcggcgat aataataacg 960 

ttatagttcg agtttttttt ttcgggagtt ttcggtatat attttgtgtt tatgtttggg 1020 

tatttacgtt acggcggtag ggtcggggtt ttttaaaatg gtagtggttc ggggagtcgg 1080 

aagttcggag ttagcgtcgt cgtagttata taagtggggg ggttgtgggt tgggggagtt 1140 

cggtagcgtt ttggagaggc gaggagtcgt cgttcgaggt cggtgcgggc gagcgagggc 1200 

gtcgcggttt ttcgattttt tttttagagg tgagtgttcg aagttaggag ttcgggcgtt 1260 

taggtttgtg cgttgcgggg aatttttatc gttagttttt tcgttattcg cgcgttttta 1320 

agtttagcgg gcgaggtttc gggcgtttta tagtcggcgt cgcgttatgt tttattttag 1380 

cgagtttttc gaattcgacg cgtttttcgt taagtttatt gaaggttgtt gcgtcgaatt 1440 

tagcgttgaa ttgtttxggt tgtttgttag ggacgttttc gcggttatcg gttattttgg 1500 

aggtaaggag ggcgagtagg ggtgtttaga cgacgacggc gtagcgcggg ggcgtattat 1560 

atttgagaat taggagggat tgggatattg gagttatgag aataggggcg atgggaagtt 1620 

taggagtttt ggggtgcgta tttttatttt tatttatatt gggtcgttag cgttttcgta 1680 

ggaatttgtg cggttatcgg agcgtttttt tgtttgtgta cgtgtgtttt gcgtgtgtat 1740 

gtttttatgt gttttttggg acgtatgcgg gtttttgttg aattagaatg tgtaaaaggt 1800 

attttgtgta tattcgtggg tattaagagt tttgtaggta ggggttgtgg atttaggtgt 1860 

attttttgat gtgtttagag ttgatttttg tttttttttt agtaggcgat tttttgagtt 1920 

gggttttgaa tagttgcggc gtaagtgggg atttagtcga tttttgtttt ttggaggggt 1980 

ttgcgtttat attttttttc ggttttagtt atagcggtag tttttttatt taggtagtgt 2040 

tcgaatattc gtacgattcg gaggtatttt ttaattttat gtcgggtatt ttaggtttgg 2100 

tatttttttt cggtttagag gtagtagcgt ttagattttc gttggatgtt tttttttttg 2160 

cggggttcga tgttttgttg tcgggttcgt cggattttta tttttcggat ttgggcgttg 2220 

tttttttttt agaggcgttt tgggaggttt cgttttgcgc gggtgttttt tcgtagtgtt 2280 

tgtatgagtt ttagtttttt tcgttcgacg ttaagttcgg ttttcgggcg tttttcgttt 2340 

cgttagcgtt ggacgttgtt tttgttttta agggttttta cgcgttttgg gagttgtttt 2400 

ttgtgggggt tttagggaat tgtgggttat agggagatta ttaggtcgtt tcggaggttc 2460 

gttttttcgt aatagggatt aagattgagg atttgttgtt tattagttgt tttgcggaat 2520 

tgtcggtcgt tttagttaat agattttatt ttagcggggt ttatgacgtt ttttcgttgg 2580 

tttcgggtga tttaggggag ggggttgagg gtttttttgg gtttttgatt ttttttagtg 2640 

gggagggagg gagtagcggc gacggcggag agtttttggt tagtacgtag ttttagtttt 2700 

tttcgttggg ttttcgtagc gtcgtcgcgg cggatttttt taaatttttg gtggcggata 2760 

tttttggaag tagtggcgtg gttgtattat tcgtgtcgtc gtcgtcgttt attttttttt 2820 

tttaggttaa ggcgcgacgt aaggggcgtc gcggcggtaa atgtagtacg cgttgttttt 2880 

gttcgcggtc gtacgttaag gttttcgttt gttcggtgga gagttgtgtg cggagttttg 2940 

cgcgtttcga cgagtttaat cgttatttgc gtatttatac gggttataaa tttttttagt 3000 

gtcgtatttg ttttcgtaat tttagtcgta gcgattattt tattacgtac gtgcgtattt 3060 

atatcggcga gaagtttttt gtttgcgacg tgtgcggtcg tcgtttcgcg cgtagcgatg 3120 

agaagaaacg gtatagtaag gtgtatttta agtagaaggc gcgcgtcgag gagcggttta 3180 

agggtttcgg tttttattcg ttgggttttt ttttcgtttt tttttgagta agagatgggt 3240 

ttatgggttg gggcgtcgtc gttcggcgcg tacgagtttc gggtcgtttt ttttttcgtt 3300 

ttttttttaa ttttttttcg tacgttcgag ggtcggtttt cggtttcgtt tttagttttt 3360 

ttgaagcgtt cgtcgtatac gttttattta gtattagttt cgcggatagt tttcgcggtt 3420 

taggcgttgt tatttttgtt agtcgcgttt tgggggaagt tttttgagat tatttagtga 3480 

ataggtatta ttttgggatt taagatagtt ttttgtaatt ggtatacgtt ttacgttttt 3540 

ttttataatt tttagagata ggttggggta gcgttaaggc ggtttcgcgc gggattttgt 3600 

atagtagtgt tttatttagt agttattgga tgtaacgttt tgttttgggt tttttttttt 3660 

tttgttgttg ttaatttttg taaagtagac gttattttta agtagttgat aaaattgttt 3720 

atttttgtaa ttaaaattat tgtgttaaaa gtttattgaa tttgttatgt aagtttttga 3780 

tttttttttt tagttttttt tttaggatta tatgtgttag aagataaagg agatttttac 3840 

gtttatggga tatagatttt gaatatatta tatatttatt gttaggtata tttgttagga 3900 

attgcggtat agagtaggta tttagtaaat gtgtgttggg ttaatgaacg aatttaatta 3960 

tagtttattt aagggttgga ggtgtttttg gaattttaaa gtggttaaaa ggcgtaggtg 4020 

tttaatatta tgtatatatt atttttttta attttcgtat ttttaaataa agtaggtata 4080 

tattttattt taaagatgtg gaagtaatgt agatagtttg ttggaataat gtagttaatt 4140 

taggttagga atagataggt taaataaatt tgtttaattt taaagttaat ttttattttt 4200 

<210> 208 
<211> 4491 
<212> DNA 

<213> Artificial Sequence 
<220> 

<?23> chemically treated genomic DNA (Homo sapiens) 
<400> 208 

tttagtaatt ttttatgttg aaatttggtt tagtatattt tgattatata tttatattta 60 



seite 104 



seq_prot.txt 

attatagttt taaaatagtt aagtagtttt aaaaatttaa attaaatttt tgtggaattt 120 

ttaaaaataa gaaaaatatt attgagtaat tggtaatata aaatataaaa tgtttaattt 180 

ggtttttttt tagatttgaa gtatagtgat taatgaatat ataatggtcg tagttttttt 240 

tataattatg ttatttaagt tattgatgaa tatttatgat atatgtagta aaaatttaga 300 

taaatatttt ggttataaat tggtttatta aataatagtt tgtattatag tttgagttaa 360 

gagtaatagg ttitataattg tgttttaggg tattaagaag tttttttgag gattttttaa 420 

aaggtatttt ttgttttgaa agattgtttt ttttttggaa gttttaggtt ttatgagaaa 480 

ttatttattt tgaatttgtt ataaaaaatg atttgtaaat aagttttgtg tagtgaatta 540 

tgtgttttta gtaagaggat tttgaaagtt ttttgtaaaa tgaagttatt tttaaaagag 600 

agttatttag aatataatta aattgattta tatgttttta tttattttaa atatgaaggg 660 

aaattgtttt taaatatatt taaattttta atagaatagt agtatgttat tagtgtggaa 720 

ataatttatt tgttaaataa atatttgggg tgtatttgtt gtatattagg tattgtaata 780 

tagggattaa tgaaagagaa aaaaaataaa gtacgaaagg atagtagaaa tagttttata 840 

attttatttt ttaggaataa tttttatgga tattttgttg tgtgtttatt gtttttttaa 900 

tatatatatt ttttaattaa atggaattgt tttaaatatg ttatttaata gtttgttttt 960 

ttttatatta tttgaaaatt aagaaacgat tataatatgt tttatttaaa aattttaagt 1020 

ttaggtcggg tatagtggtt tacgtatgta attttagtat ttggagggat cgaggtgggc 1080 

ggattataag gttaggagat tgagattatt ttggttagta tggtgaaatt tcgtttttat 1140 

taaaaatata aaatttagtt gggtatgatg gtacgtgttt gtagttttag ttatttggga 1200 

ggttgaggta ggagaatcgt ttgaattcgg gaggtggagg ttgtagtgag cggagattgt 1260 

attattgtat tcgtttggtg atagagtaag attttatttt aaaaaataaa taaagaaaat 1320 

ttgaagtata gtattttttt aaattttaaa tagataatag aaattggttt ttttttattt 1380 

aaattagaat ttaagtttaa ttttatatat ttttgatagt ttggattttg ttttttaatt 1440 

ttataaaatt gggaatttaa gtattatttg gttcgattta aatgtaatgt agaatttgta 1500 

ttaaaatatt atattaaagt tttagatttg tagtagttaa tagtattttt atgtatgtgt 1560 

tagggattgt tttaaatatt ttatatatat taattttttt attttgtatt tttgttttcg 1620 

ttttatatag taggaaattg aaatattgag aggttaagta attaaagtta tagagttaga 1680 

gtgataggag taaagtttta atttaggtaa tttagatttt tagagttttg atttttatta 1740 

ttaagttgtt agtatagttt ttttggtaat tatttttaat ttaaatataa ttcgagtgat 1800 

ttatttaata agttattatt ttgataattt agtgatttgt aatgtaaaat tatttattgt 1860 

aatttattta atattattgt ttttttgtgt tgtaaaaatt atagtaatcg agatgtaatt 1920 

tattattttt ttttttattt tcggtatttt gtgttaattt ttttgttttg cggatttttt 1980 

tcgatttttt attatgcgtg ttaattgtta ttaatttttt tgtttgttgg ggattggggt 2040 

cgcgagggta tattttcgag gggtacgggg ttagggttag gtaggttgtg cggttgggcg 2100 

gggttttgtg ttttattgcg gagtgcgggt cgggaagcgg agagagaagt agttgtgtaa 2160 

ttcgttggat gcggattagg gcgtttttta ttttcgtcgg gagttcgtcg attggttggg 2220 

tgtgggcgta cgtgatcgat atgtggttgt attggtgtag ttcgttaggg tgttattgga 2280 

gatagaatgg aggtgttgtc ggattcggaa atggggtagg tgttggagtt attatggtta 2340 

ggtttgttgc ggggggaggg gggaaggtgg ttttttttcg tattgtttta aatcgatggt 2400 

ttttttttgg tatagggttt attgtagtat gttaaacgag gaggtagggg cgtcgttttt 2460 

tcgtttttta ttgtagtatt ggagatggat tttttgtatt tcggatttag ggtttttgat 2520 

agaagaggaa gaagggggag gggtagaagt gttaagggga gtttgttgag aaaagttgtt 2580 

tttgaagtta gaaggggttt ttgtttttat aatgttattt gatagagtgg aataatagta 2640 

tttaaggaaa cgggtagagg ataataaaga atggagtata tttatggcga ggagtaaaag 2700 

ttttatttta ttgaaaggtt tttttttttt tttggcgata aggatatatg tattggtggt 2760 

taaaagagag aggagataaa atcgttgtag atggttgatg tgaatttagt ggaaagagtt 2820 

attggggatg agagaaagag gaggaggtag gtattgtaga gcgtgagtgg tggtgttggt 2880 

tggtgaaata ttggttatta gtagtgtgtt tgtttttgta aaatatttaa gtaaattttt 2940 

tgtgaatagg gtggtaaata gatattagtg tttttgttag ttataaaatg tagtggtagt 3000 

ggttttttgc ggacgattgt agtagtgttt tttttttttt tgttaggtcg aaaagataat 3060 

tgtagaggaa taagaaattt tgtagtaaat gttggggtag aagtttattt ataagaagtt 3120 

atagtttata aatgtagttt gaatagtaga aaaaaaatta ttgtttttta aagtaggaat 3180 

aatgttaggt tatgaatgtt ttgttattgg aatgtattgg atattttgat tttatattac 3240 

gaaagtgatg tttaaatttt ttgatttaat ataaatttta tacgaaattt taataaatta 3300 

tgtatgaaat agtggatttt tttttttgtt agtgaagttt ttatgttatt aattaggtta 3360 

tttaagagta aattatttta taacgtaaat tattttgtta aaaattatgg tgaataaatt 3420 

tttgtaggtt taatatttaa gatttatagt taagtaattt tatatttttt tttggttgtt 3480 

tttagataat attgaataaa tatttaagat attaatttag tgtgtaaata ttttaaatta 3540 

aagtaatatg gttttttttt tagatgtatt tttgtttagt gagtatttat gagatatatt 3600 

tttgatataa agtgtttttt attgagtttt tttttttttt atttaaatgt aaaagtttat 3660 

ttttatgtat atttatagta gttagtttgt tatatttttt tattttttga ataaggatgg 3720 

tagtggtttt gtaaagtttt ttagggttag tgatagtgtt ttttaaataa tgtttatgta 3780 

atagaaagtt ttaagatgat ttattatatt gttttagtaa tatttttaag ataatgcgga 3840 

attgggttgg tttagataaa taagattata taagtttttt attgataaat ttaaatagtt 3900 

ttttaaaaaa * ttatttttgt tttaagaatt atttgattat ggatattagg gttaatagta 3960 

tttaggagtc gggcgtggtg gtttatgttt gtaatataag ttaaatagga ggttgaggga 4020 

agttgaggta ggaggattat ttgagttttg gcgtttaaga ttagtttggg taattaagta 4080 

agattttatt ttaaaaaaaa gtatttagag tttttttttt aaaatttgat ggaaattatg 4140 
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gttagaatta aagatgttat attaaaattt taaatgatat ttaatagtta atattattta 4200 

gttttttgaa ttttattagt atagtgtttg agagtttata aaattattat ttttatataa 4260 

attatttttt attattttaa tagtttattg tagtattatt tttattttaa atgttagaga 4320 

tttttattta agtatgttgt gattttattt aaggttaaat attgtataat aatgtaaaga 4380 

atttaagttt tttattttat tttaatttta taaaattata atgtttttta ttatatatat 4440 

ataaaataat atataataga ggtaatttga aaaatagtgt aagttattga t 4491 

<210> 209 
<211> 4491 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA CHomo sapiens) 
<400> 209 

gttagtgatt tatattgttt tttaaattat ttttgttata tattattttg tatgtatgta 60 

ataaaaggta ttatagtttt atagaattag aataaaatag aagatttaag ttttttatat 120 

tgttatataa tatttgattt tggataaagt tataatatat ttgaatagga gtttttaata 180 

tttaaaataa aaataatgtt ataatgagtt gttaaaatgg taaaaggtaa tttatgtaaa 240 

aataatagtt ttatagattt ttaaatatta tgttaatgga atttaaagaa ttgagtaata 300 

ttagttgtta aatgttattt ggaattttaa tgtgatattt ttagttttag ttatagtttt 360 

tattagattt taaaaaagag attttaagta tttttttttg agatagggtt ttgtttggtt 420 

gtttaggttg gttttgaacg ttaaggttta agtgattttt ttgttttagt tttttttagt 480 

tttttgttta gtttgtatta taggtatggg ttattacgtt cggtttttaa gtattattaa 540 

ttttgatatt tatgattaag taatttttaa agtagaaata attttttaaa gaattattta 600 

aatttattag tgaaaaattt atgtagtttt atttatttaa gttagtttaa tttcgtatta 660 

ttttgaaaat attattgaaa taatataata ggttatttta agatttttta ttatataaat 720 

attatttaag agatattgtt attgattttg gagggtttta taaagttatt gttatttttg 780 

tttagggagt ggggaggtgt ggtaggttgg ttattataag tatgtataaa gataggtttt 840 

tatatttagg taaaggaaaa aaaagtttaa tgaaaaatat tttatattaa gagtatattt 900 

tatgagtatt tattaaatag aaatatattt agaagaaaag ttatgttgtt ttaatttaaa 960 

atatttgtat attgaattaa tattttaaat atttatttag tgttatttaa aagtaattaa 1020 

agagaaatat aaaattattt aattataggt tttaaatatt aggtttataa aaatttgttt 1080 

attataattt ttgataaagt aatttacgtt gtaaagtggt ttgtttttga atgatttaat 1140 

taatggtata aaagttttat taataaaaga aaagatttat tgttttatat ataatttatt 1200 

aagatttcgt ataggattta tgttaaatta aagaatttga gtattatttt cgtgatgtag 1260 

aattaaagtg tttaatatat tttaatgata aaatatttat agtttgatat tatttttatt 1320 

ttaaaaaata gtaatttttt ttttgttgtt taggttgtat ttataaatta tggttttttg 1380 

taaatgggtt tttattttag tatttgttgt aaggtttttt atttttttgt agttgttttt 1440 

tcggtttaat agagggagaa aaagtattgt tgtagtcgtt cgtaaaaagt tattattatt 1500 

gtattttgta attaataaag atattggtat ttgtttgtta ttttgtttat agggagttta 1560 

tttagatgtt ttataaaagt aggtatatta ttggtgatta gtattttatt aattaatatt 1620 

attatttacg ttttgtagta tttgtttttt tttttttttt ttatttttag tagttttttt 1680 

tattagattt atattagtta tttgtagcgg ttttgttttt tttttttttt ggttattaat 1740 

gtatgtgttt ttgtcgttag ggaggaaaag aagtttttta atggggtaga gtttttgttt 1800 

ttcgttatga atatgtttta ttttttgttg ttttttattc gtttttttag atattgttat 1860 

tttattttgt taaatggtat tataaaaata aaaatttttt ttggttttaa aaatagtttt 1920 

ttttagtaga ttttttttaa tatttttatt tttttttttt tttttttttt tgttaaaaat 1980 

tttggattcg aagtatagga aatttatttt tagtgttgta gtggggggcg gggggacgac 2040 

gtttttgttt tttcgtttgg tatgttgtag tggattttgt gttaaggaaa ggttatcggt 2100 

ttaagatagt gcgagggaaa attatttttt tttttttttt cgtagtaagt ttggttatgg 2160 

tggttttagt atttatttta ttttcgagtt cggtagtatt tttattttgt ttttagtgat 2220 

attttggcgg gttgtattaa tatagttata tgtcggttac gtgcgtttat atttagttaa 2280 

tcggcgggtt ttcgacggga atggggagcg ttttggttcg tatttagcgg attatatagt 2340 

tgtttttttt ttcgtttttc gattcgtatt tcgtagtggg gtatagggtt tcgtttaatc 2400 

gtatagtttg tttagtttta gtttcgtatt tttcgggggt atgttttcgc ggttttagtt 2460 

tttagtaagt aaggaagttg atggtagttg atacgtatag taaagagtcg ggggaggttc 2520 

gtagggtaga aggattagta taagatgtcg gaggtgggag ggagagtaat aaattatatt 2580 

tcgattgtta tgatttttgt agtatagaga aataataata ttaaatgaat tataatgaat 2640 

aattttatat tataagttat tgagttgtta gagtgatgat ttgttagata gattattcga 2700 

attatatttg aattaaaaat agttattaga aaagttatgt tagtagttta gtagtggaga 2760 

ttagaatttt ggaagtttgg gttgtttgag ttgaagtttt atttttgtta ttttagtttt 2820 

gtaattttag ttatttaatt ttttagtgtt ttaatttttt gttgtataaa acgggaatag 2880 

aagtatagaa tagaggagtt aatatatatg aagtatttag agtagttttt gatatatata 2940 

tagaagtgtt gttagttatt ataaatttga ggttttaatg tagtatttta atgtaaattt 3000 

tatattgtat ttaaatcgaa ttaggtgatg tttaaatttt taattttatg aggttgaagg 3060 
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ataaaattta aattgttaag aatatataaa gttaaattta aattttggtt taaatggggg 3120 

gaaattagtt tttattattt atttaaaatt taaaaggata ttatatttta agtttttttt 3180 

gtttgttttt tgagatagag ttttgttttg ttattaggcg agtgtagtgg tgtaattttc 3240 

gtttattgta atttttattt ttcgggttta agcgattttt ttgttttagt tttttaagta 3300 

gttgggatta taggtacgtg ttattatgtt tagttaagtt ttgtattttt agtagagacg 3360 

gggttttatt atgttggtta ggatggtttt aattttttga ttttgtgatt cgtttatttc 3420 

ggttttttta agtattggga ttatatgcgt gagttattgt gttcggttta aatttaaagt 3480 

ttttaaatga agtatattat aatcgttttt taatttttaa atagtatgaa aagaagtaag 3540 

ttattaaata gtatatttga ggtaatttta tttgattaaa aaatgtatat attagaaaag 3600 

taatagatat ataataaaat gtttatagag gttatttttg gaaagtgaga ttatggagtt 3660 

atttttgttg ttttttcgtg ttttgttttt tttttttttt attagttttt gtgttataat 3720 

gtttgatata tagtaaatat attttaaata tttatttaat aagtgagtta tttttatatt 3780 

gatggtatat tattgtttta ttaaaagttt aaatatattt agaaataatt tttttttata 3840 

tttgaaataa ataaaagtat gtaaattaat ttgattatat tttaaataat ttttttttga 3900 

gaatgatttt attttataaa aggtttttaa aattttttta ttgaaagtat atgatttatt 3960 

gtataggatt tatttataag ttattttttg tgataaattt agaataggta attttttata 4020 

aggtttagaa tttttagaag gaaaatagtt ttttaaaata aagaatattt tttaaaaggt 4080 

ttttaaaaga attttttagt attttaagat ataattgtaa gtttattatt tttgatttag 4140 

attatgatat aagttattat ttagtaaatt agtttgtagt taaaatgttt gtttaaattt 4200 

ttattgtatg tattatagat atttattagt gatttaaatg atatagttgt aaagaaagtt 4260 

acgattatta tatgtttatt agttattatg ttttaaattt aaaagaaagt taagttaaat 4320 

attttgtatt ttatgttgtt aattatttag taatattttt tttattttta gagattttat 4380 

aaaagtttaa tttgaatttt taaaattgtt tagttatttt aaaattataa ttaaatatag 4440 

atatataatt aaagtatatt gggttaaatt ttagtatgga aaattattga a 4491 

<210> 210 
<211> 4256 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 210 

atatttttta ggtttatgat ttgagagttt attaaataag agatggttat ttttttggtt 60 

tttaaattat tttggaaata aagttatttt tagagaggaa ttttaaaata ttgtttgtag 120 

ttatagtaat tttaaaattt gagtgttgta tggtggaagt agataattta ttttaggata 180 

attgttattt gttatattag tttgaggatg gtggtgttaa agaggagtta tttattttta 240 

ggtatatttt atattaaata taaattgtat aatttgttta aattaaggaa ttatattaaa 300 

ttatattatg gttattaaat tttgttttga gaaagtgaaa ttgatttagt ttttaaagag 360 

ataaagagaa agtataagta aattaaattg tagttataaa aagaaagata aaatgttgta 420 

gtatatttat tgttttgtgt atttaatgaa gtttttcgtt ttggttataa aattagtttt 480 

aaaggttttt tttatatttt atagtatgaa aaatttaaaa agtaatttat atgtaaatat 540 

ttaaattatg atagaaattt aaagtaaaaa gaaaatgaat taattgaatt aaaatgtgta 600 

ggatgtttaa atttatttga taatatattt atttgataat atattaatat gaatttagta 660 

ttttaaaatg ttatataaat aaatgttttt atattaaata ttaatgtagt taggatttta 720 

agttaatatt attttttttt ttttatatgt ttttttttcg tttttattaa aaattgttaa 780 

aattatttat tttttttttt tttttttgtt tttaaataaa taaggttttt tttaagatat 840 

tgtaggatta taaagttaaa ttttcgggtt taagttgttg gtaaaatttt agagatgtta 900 

agttatttat gtattaatta tttttaaatt tttttttaat ttttttataa aataggagta 960 

gggagaggag aaatattttt gtttaaaaat gaggaattga aaatttttat tataaataaa 1020 

ttatattaag taagttaaag atagtaaaag agtaaaaatg ttagtagata tttttaaaat 1080 

ggtaattata tattattttt ggaatgatta tatgaatgtg gtttattatt ttttaagttt 1140 

ttatagtaaa tatatattta tttgttttat ttagttaaaa ataaatataa tatgtagttg 1200 

tttttgaata attttttttt tttttttttt tttttttttt ttttcgataa agttttattt 1260 

tgttatttag gttggagtga agtggtttta tttcgttgtt tattataatt ttagtttttc 1320 

gggtttaagc gatttttttg ttttaatttt tcgagtagtt gggattatag gcgtttgtta 1380 

ttattttcgg ttattttttg tatttttagt agaggcgagg ttttatttgt tggttaggtt 1440 

ggtttcgaat tttcgatttt aggtgatttt tttcgttttg attttttaaa gtgaagggat 1500 

tataaggcgt gaggtatcgc gttcggtcgt ttttgaataa tttcgattaa aatttatatt 1560 

cgatatttat tttaatatat attatagatt tttattgata atttttttta gtaagaaaga 1620 

taagttttat ttaggtattt gtgaattgga ggttaagtag ttttagtata ttttatattt 1680 

ttttaagatt ttttttttat tttaaacgtt cgtaaatttt gtatttgata aagagtatat 1740 

ttttatttaa tataaatatg tttttttttt tagatttttt tagtattcga gagatttgta 1800 

cgcgcgtggt tttttatttt ttttttttgg ttttttaagt ttttagggcg tcgttaggag 1860 

gaggtttgtg attataaatt ttttttgaaa attttttagg aagttttttt ttttttcgga 1920 

gaatcgaagc gttatttgat tttaattttt ttgtaaattt cgttttttag agtcgttcgt 1980 
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tattttttgt tttcgttgta gattttttat ttatttggat cggttttcga tcgtaattat 2040 

tcggtgcgtt gggtagcgtt ttcgttttta gtagcgttcg tatttttttt attcgatttc 2100 

gggtcgcggt cgtggttagt tagttagtcg aaggttttat gttgtttttc gtcgtcggtt 2160 

ttatgttgtt tttcgtcgtt cgttgtttgt tttttttttt ttcgtagtcg tcgagcgtac 2220 

gcggttxgtt ttattttttg gtgattagtt agtttttttt tttttttttt tcggtgttgg 2280 

cggaagagtt tttttcgatt ttgtttttta aattttttgg agggatcgcg gtattttttt 2340 

aggtaagggg acgtcgtgag cgagtgttcg gaggaggtgt tattaatttc qaotatttaa 2400 

cgaatgtggt atttttgaag tcgttttagg ttgggttttt ttcgggggta ttagtcggaa 2460 

gtagttttcg ttagagttag cgttggtaag gaaggaggat tgggtttttt tttatttgtt 2520 

ttttatatcg tttttcggtt tttttgtttt tagtcgcgtt ttttcgtttg ttagtaaagg 2580 

cgtgtttgag tgcgtttatt ttgttaaaaa gaaattcgtt ttcgtttcgt tttttttttt 2640 

cgcgatataa tttttttaat tgttaaattg aatcggggtg tttggtgtta tagggaaagt 2700 

atggtttttt tttttaatta taagaaaaag taaaattatt tttttttagt tgtgagagtt 2760 

ttatcgagaa tcgaaattat ttgtacgatt agaaagtgtt tttttatttt tttaattttt 2820 

gatttttagg agcgcggggt ttattaagtt agaaatttta gtttaaagga ttttttttgg 2880 

agagtcggat tgtttttttt tttttttttt tttttttttt tgcgtgtaaa acggttgttt 2940 

ggggtaaggg ttttttagac gtgtatattg tttggtataa gagtagattt tgaaaagatg 3000 

aggtttattt aatacggacg ggggagaatt ttgtttgtag gtagatagga aaatggggag 3060 

ggagttattg gaaggacgga ttttattttt aaagttataa tttttagatt agaaaaagtg 3120 

tttagtgttt tagaagtaga gttgtatagt gatttaaaga ttagttttaa atattgtttt 3180 

gtttttttta tattttttat attttttttt tttattgaaa atattttgta tttttcgtaa 3240 

ttataaaggg ggaagggaat atgagtgttt tttgttttat aggggttgtt gtgagtttaa 3300 

atgatgtatt aatatatata agttttaaga atagtgttat atattttaag ttaatatttg 3360 

ttagtttttg aattattcgt tttgaggatt ggtttgtaat tttgttttga ggtatagaaa 3420 

gaaaatgttt tggagtagga cgcggtggtt tatatttgta attttagtat tttgggaagt 3480 

cgaggcgggt agattatttg aggttaggag ttcgaggtta gtttggttaa aatggtgata 3540 

tttcgttttt attaaaaata taaaaattag ttggttatgg tggcgtacgt gtgtaatttt 3600 

agttatttag gaggttgagg taggagaatc gtttgaattc gggaggtaga ggttgtagta 3660 

agtcgagatc gcgttattat tttttagttt gggtgataga atgagatttc gatttaaaaa 3720 

aaaaaaaaaa aatgttttgg atagaattat tattattata taaaaggaaa gttcggatgc 3780 

ggtggtttac gtttataatt ttagtatttt gggaggtcga gataggcgga ttatttgagg 3840 

ttaggagttc gagataagtt tgattaatat ggcgaaattt tgtttttatt aaaaaatata 3900 

aaattagcgg ggtttggtgg cgtatgtttg taattttagt tattcggagg ttgatgtagg 3960 

agaatcgttt gaatttagga gaaggcggag gttgtagtga gtcgagatcg cgttattgta 4020 

ttttagtttg ggagataaga gcgaaatttg gttttaagaa aaaaagaaag aaagaaagaa 4080 

agaaagatta agaagaattt attttttgaa aagattatgg gtatttttta ttatttttat 4140 

ttataaagaa aagttaaata gtattaaaga gtataataag cgtaaggagg taaaagtttt 4200 

aatttttttt gtgattatta ttttttaagt ttattaaaaa tatgtattac gtttta 4256 

<210> 211 
<211> 4256 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 211 

taaaacgtag tatatgtttt tgataagttt aaaaagtagt agttatagga aaaattaqaa 60 

tttttatttt tttgcgtttg ttatattttt tagtgttgtt taattttttt ttgtaagtga 120 

gggtggtgga gggtgtttat aattttttta gggagtaagt tttttttggt tttttttttt 180 

tttttttttt tttttttttt tgagattaag tttcgttttt gttttttagg ttggagtgta 240 

atggcgcgat ttcggtttat tgtaattttc gttttttttt gggtttaagc gattttttta 300 

tattagtttt cgagtagttg ggattatagg tatgcgttat taagtttcgt taattttgta 360 

ttttttagta gagatagggt ttcgttatgt tggttaggtt tgtttcgaat ttttggtttt 420 

aggtgattcg tttgtttcgg ttttttagaa tgttgggatt atagacgtga gttatcgtat 480 

tcggattttt tttttatgta atagtgataa ttttatttaa agtatttttt tttttttttt 540 

t 9 a 9j£9Sag ttttattttg ttatttaggt tggagggtgg tggcgcgatt tcggtttatt 600 

gtaatttttg tttttcgggt ttaagcgatt tttttgtttt agttttttga gtagttggaa 660 

ttatatacgt gcgttattat ggttagttaa tttttgtatt tttagtagag acggggtgtt 720 

attattttgg ttaagttggt ttcgaatttt tgattttagg tgatttgttc gtttcggttt 780 

tttaaagtgt tgggattata ggtgtgagtt atcgcgtttt gttttaaagt attttttttt 840 

tatgttttaa aataagattg taagttagtt tttaaagcgg ataatttaag agttaatagg 900 

^S^S""** 99 at 9t:gtgg tattgttttt aaggtttata tgtattaata tattatttaa 960 

atttataata atttttataa agtagggggt atttatattt tttttttttt ttataattac 1020 

gaaaaatgta aggtattttt agtaggaaag agaaatgtga gaagtgtgaa ggagatagga 1080 

r^tairttga agttggttr^ tgtaattttg . tttttagaat attgagtatt 1140 - 
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ttttttggtt taggaattat gattttgaga atggagttcg tttttttaat gatttttttt 1200 

ttattttttt atttgtttat aggtagaatt ttttttcgtt cgtattaaat aaattttatt 1260 

tttttagagt ttgtttttat attaggtaat gtatacgttt gagaaatttt tgttttagat 1320 

agtcgtttta tacgtaggag gggaagggga ggggaaggag agagtagttc gattttttaa 1380 

aaggaatttt ttgaattagg gtttttgatt tagtgaattt cgcgtttttg aaaattaagg 1440 

gttgaggggg tagggggata ttttttagtc gtataggtga tttcgatttt cggtggggtt 1500 

tttataatta ggaaagaata gttttgtttt tttttatgat taaaagaaga agttatattt 1560 

tttttatgat attaaatatt tcgatttaat ttggtagtta ggaaggttgt atcgcggagg 1620 

aaggaaacgg ggcgggggcg gatttttttt taatagagtg aacgtattta aatacgtttt 1680 

tgttggtagg cgggggagcg cggttgggag tagggaggtc ggagggcggt gtggggggta 1740 

Sgtggggagg agtttagttt tttttttttg ttaacgttgg ttttggcgag ggttgttttc 1800 

ggttggtgtt ttcgggggag atttaatttg gggcgatttt aggggtgtta tattcgttaa 1860 

gtgttcggag ttaatagtat ttttttcgag tattcgttta cggcgttttt ttgtttggaa 1920 

agatatcgcg gtttttttag aggatttgag ggatagggtc ggagggggtt ttttcgttag 1980 

tatcggagga agaaagagga ggggttggtt ggttattaga gggtggggcg gatcgcgtgc 2040 

gttcggcggt tgcggagagg gggagagtag gtagcgggcg gcggggagta gtatggagtc 2100 

ggcggcgggg agtagtatgg agttttcggt tgattggttg gttacggtcg cggttcgggg 2160 

tcgggtagag gaggtgcggg cgttgttgga ggcgggggcg ttgtttaacg tatcgaatag 2220 

ttacggtcgg aggtcgattt aggtgggtag agggtttgta gcgggagtag gggatggcgg 2280 

gcgattttgg aggacgaagt ttgtagggga attggaatta ggtagcgttt cgatttttcg 2340 

gaaaaagggg aggttttttg gggagttttt agaaggggtt tgtaattata gatttttttt 2400 

tggcgacgtt ttgggggttt gggaagttaa ggaagaggaa tgaggagtta cgcgcgtata 2460 

gatttttcga atgttgagaa gatttgaagg ggggaatata tttgtattag atggaagtat 2520 

gttttttatt agatataaaa tttacgaacg tttgggataa aaagggagtt ttaaagaaat 2580 

gtaagatgtg ttgggattat ttagttttta atttatagat atttggatgg agtttatttt 2640 

ttttattagg agggattatt agtggaaatt tgtggtgtat gttggaataa atatcgaata 2700 

taaattttga tcgaaattat ttagaagcgg tcgggcgcgg tgttttacgt tttgtaattt 2760 

ttttattttg ggagattaag gcggggggaa ttatttgagg tcgggagttc gagattagtt 2820 

tggttaatag gtgaaatttc gtttttatta aaaatataaa aagtagtcgg gggtggtggt 2880 

aggcgtttgt aattttagtt attcgggagg ttgaggtagg agaatcgttt gaattcggga 2940 

ggttgaggtt gtagtgaata gcgagatgga gttattttat tttagtttgg gtgatagagt 3000 

gagattttgt cgaaagaaag aaagagagaa agagagagag aaaaattatt tagaagtaat 3060 

tatatattgt gtttattttt aattgagtag ggtaaataaa tatatgtttg ttgtaggaat 3120 

ttaggaaata atgagttata tttatgtgat tattttagag gtaatatgta gttattattt 3180 

tgggaatatt tgttaatatt tttgtttttt tattattttt agtttatttg atatagttta 3240 

tttgtgataa gagtttttaa ttttttattt ttgaatagag gtgttttttt tttttttatt 3300 

tttgttttgt gagggagtta ggggaggatt taaaagtaat taatatatgg gtaatttagt 3360 

atttttaaaa ttttgttaat agtttgaatt cgggagtttg gttttgtagt tttataatat 3420 

tttagaagag attttatttg tttaaaaata aaaaggaaaa agaaaagtgg atagttttga 3480 

taatttttaa tggagacggg agaagaatat gtagaaaagg ggaaatgatg ttggtttaga 3540 

attttaatta tattggtgtt taatatagga atatttattt atataatatt ttaaagtatt 3600 

aaatttatat tagtatatta ttaaatggat atattattaa atgggtttaa gtattttata 3660 

tattttaatt taattgattt attttttttt tgttttggat ttttattatg atttaaatat 3720 

ttatatatgg gttatttttt agatttttta tattatgaaa tataagaaaa atttttaagg 3780 

ttagttttat gattaagacg aaggatttta ttgaatatat aaaataataa atatattgta 3840 

atattttgtt tttttttttg tagttgtaat ttggtttgtt tatatttttt ttttgttttt 3900 

ttgaaaattg agttagtttt atttttttag gataggattt aataattata atataattta 3960 

gtataatttt ttgatttagg taaattatgt aatttgtgtt tagtatgaaa tgtatttaaa 4020 

aataagtaat ttttttttaa tattattatt tttaaattaa tataataaat aatagttatt 4080 

ttaaaataaa ttgtttattt ttattatgta gtatttaaat tttaaggttg ttatgattgt 4140 

agatagtatt ttaaaatttt tttttggaaa tggttttgtt tttaagatga tttaggaatt 4200 

aaagaggtga ttattttttg tttaatgaat ttttaaatta taaatttggg aagtgt 4256 

<210> 212 
<211> 4414 
<212> DNA 

<213> Artificial Sequence 

<220> . 
<223> chemically treated genomic DNA (Homo sapiens) 

<400> 212 r 

•« - : . 

aatgtttgga gtatatattt taatgaatat ttattttatt ttattttttt ttatttttga 60 

attaagtaat tttgaattta aagttgttat gattagtatt gaaaagatta ttggattatt 120 

aattgtgtga ttttgggata gtaatttttt gtattttagt ttgtttatat gttatatatg 180 

aaggttgaag tttgattttg ttttgtgatt attattttaa atatttgatg aaattaaatt 240 

ttagtgtttg gaatggtagt ataatakatt tattaagaat aaataattta ttgtaaaaat 300 
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atattgattt ttaaatgatg taattgatag ttatattatt gtagagggtt gataaataat 360 

aaaagaaatg aaagatgtat atggtgagaa ttgaaattat tttgataagt tttttatttg 420 

tttattattt aaaattaatg attatgttga atgtttataa attataaaat ataaaagaaa 480 

ttttataaat gcgtatgtat aggagtttaa gttattaaaa gttttaaagt ataagtttaa 540 

attaaattaa ttaaagaagt tgagaggaaa aattggtttt tatttttaat tattattgtt 600 

ttgaggtttt atgtttaata taatttttta agtagaggtt ttagagagaa gagttgtgag 660 

gatattttta tatttgtgta gaaggaaaag tttgttattt attttagtat ttttagtgtt 720 

atattgatgt gtattttgga tttattttgt ttttattgta taaatttata tttgatttta 780 

aagaaaagga aaatttaaag tttttttttt ttaaggggat agaaattttt tgtgttaatt 840 

gtttgatttt tttttttgta aggttttatt ggaaattttt tgtaatataa tgtaggggat 900 

ttttttatgt gttgatgttg tttatatagt ggggtgggtt tgattgaaga aaaaaaatcg 960 

tatatacgta tgaaagatta tggttttatt ttcggaaagt atgaaaggtg attgatattt 1020 

ttaagaagtt tttgttattt aggaaaatta ttaaatattt tatttagaga tatttggaaa 1080 

gattgaagga aaggaagaac gaagaaagta gaatttagat ttatgtgggg agagatttgt 1140 

ggtagaggaa aagtattttt tttgaattcg ataagggatt tgtttggggg aattttttgt 1200 

ttagtttttt attattaggg ttttttgaag tcgggttttt tattgggtag ttttttggga 1260 

gtgtagtggg gaatttttat attttttttt taggttttcg aaggatttcg tttttttagt 1320 

gtttttttta ggttggtagg agttttgagt ttgatatttt tttttgatgg gataggtaag 1380 

ttttgtgggc gcgtaaatac gttgtaatta agttttttgt tgattttata gttttgtgtg 1440 

ttttcgagaa gaagtgatcg tatttaattg tttattgttg gtttgttttt taagagtttg 1500 

ggggtttttt ttttttaatt tagaattagt tgtacggggg gcggggaaat gggggtgggg 1560 

aaggagtggg agggtagtgg ttttcgcgag tagagcgatg ttattgagtg agtttttgaa 1620 

tggggagcgt tgttgttttt aagtcgattg gtattttttg ttaggaagaa acgttaagag 1680 

gtgggagtgt ttggggaggg aggtaggcgg tttttatcgt aggcgcgggg agttgttttt 1740 

tcgttttttc gtttgttttt taagtttgga tttttaggag tggttgaagt tgcggagcgt 1800 

ttttggagtt tgtgaatgaa tttttttttt tttttttttt ttttttttcg ttgagttttt 1860 

ttttcggttt tgacggtata gtgatataat gatgatgggt gttataattc gtatttgaat 1920 

ttgtaggcga gttgtttcga gtttttttgg ggaagaattt taggcgtgcg gacgtaatag 1980 

tcgagaatat taggtgttgt ggataggagt tgggattaag attttcggtt agtttcgtat 2040 

tttttcgtat tttttagtat cgtttcgtat ttttcgtatt ttttttcggg ttattacgtt 2100 

ttttatgtga ttcgtttggg taacgtcgaa tttagtcgcg tagcgttgta gtgaattttt 2160 

tttttaaatt gtaataagtc gttttttaag gtaattacgt tttttttgtt ttttttttaa 2220 

aaaataaaaa taaaaaattt atagaaaaaa attcgcgagt ttagaaaaaa gaagtaattg 2280 

gtagaaggtt ttaattaagg taaagagttg taaggcgaag ttaagaaaat gtaggtattt 2340 

aaaaaatgta ggtaattttt ataagggttt ttggggagag gtatatagag ggattttggt 2400 

gttgaaaaag atttagataa aagaaattta gggtggggtg gggggtaaaa tgattaacgg 2460 

aattggggga agggagggaa taaattgtaa agaaattata gaaaagtggt gggtttttga 2520 

gttggagaga agagagggat ttttggtatt ttgatttttt tgttgttgtt gtttttaata 2580 

cgttcgaggt aaaagtttga atggggatta ttaagatttg ttatagataa gttttcgaag 2640 

tcgttttggt gtaggttatt tggtttttta gtttttggtg tgtggttagt gtttggtgtt 2700 

tttggaaagt tattttcggg tagtttttga tagtgcgatt cggcgtttaa gtagtttggg 2760 

attttgcgcg gatttgattt ttttagatcg taggtagttt gggaggaggt tcggtcgggg 2820 

gaggtgtagg attttcgtcg tgtttttttg acgatttggg gattgttacg gttttttttc 2880 

ggcgtttttg ggtttttttg ttttgtacgc ggtgcgaagg ggttagtagg gaaggagtag 2940 

aggatggggg gtggggttgt tggagtttcg cggaggtttg ggaggttttt gggcgggaaa 3000 

agtttgtttt gaatcggtag ggatgtgtaa taattttttt tattttgaag agtgaaatag 3060 

ggtttgtcgt ttttatttta ataagtaaat cggtatttag agcgtattgt agataaaggt 3120 

ggttcgggga ttcgaattag gggtttttgg gttagttttt tcgtttagat tataggagtt 3180 

tttcgttttt ttatatattt tttatttttt tttagttttt cgttttagtt gagtaatttt 3240 

attggtaggt ttagcggtag cggcgcgcgc gtgtgtcggg agtttcgggg gacgttttcg 3300 

gttggagcgt tttatcgttt tgtagaggcg tgggcgttgt agggcgatat ttttggtgcg 3360 

tgtagattta gggtattcgg gttgttttgg ttcgcggttt gtgttattgg gcgtggagcg 3420 

gtttgggtgt taggtagagg agagcggggt agaaaaatag tgttatagtt taaattattt 3480 

gttttttaat ttaattgcgt ttcggtaagt tttgtatgtt tttgggagat gttgcgggaa 3540 

gggggagaaa tttatacggc gtttggagag ttgttttgtc gcgcgtatat tcgcggtaga 3600 

gtttttttgg gtcgcgtttt ttatttttgt gttttttttt gtttttattt tttttgtttt 3660 

tttttttttt ttttagtttt tgtttttttt ttttagtttt tgcgtttttt tttttatagt 3720 

tgataaatga atggttagtg tgaaattttt gtttttttcg tgttttaagg tagtagggag 3780 

ggaggagcga gggaggcggt gcgttttttc ggatttgttt ttagtttagt ttataagatt 3840 

tgtagaattt agatgtttag gaattgggag ttttgcggcg ggtgtgggcg ttttttgatg 3900 

gagaagtttc gtataggcgg agaaaaataa gttttttaga ttaagcgagt attttttata 3960 

attttgtgtt taaggatgga aggttttagt ttttttttaa gttatttatt ttgtatttta 4020 

£aatttgcgg cgtatattta ggagtttgga ggattttgaa aaaaggtttt ggtttgtgta 4080 

aagtgtagag atgttttttc gcgagtcggt tggaacgtac gcggcgtttt gggtttagtt 4140 

cgtcgtttag tggacgaacg tatatggtta gggtggagtt aggtttcgga ggtttagacg 4200 

cgttattttg ggcgtttttg agaaagataa tatatatatc gtgtttttaa aatttatgat 4260 

tatttgaatt taacgttagg gtttcgtttt aggatatfgt aaaagaaggg tttttaattt 4320 

.aaaatttaaa tttttttaat tttagggcgg gcgtcggacg ggaaagtgga gagaaggcgg 4380 



seq_prot.txt 
gtagtgggag gaaaaaagaa agggaaggaa ggga 

<210> 213 
<211> 4414 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 213 

tttttttttt 
acgttcgttt 
ttttggggcg 
tgtattattt 
attttggtta 
tttagtcgat 
gttttttaaa 
gaaaattagg 
gggtttgttt 
aaaattttta 
attcgaagga 
aggtagggat 
agagagagag 
agatataaag 
ataatttttt 
taggatttgt 
ttttatttcg 
cgggttagaa 
tttacgtttt 
atacgcgcgc 
taggagaggg 
tgatttagag 
gtcggtttgt 
attgttgtat 
ttcgcggggt 
tatcgcgtgt 
tcgttaaaga 
tttgcggttt 
attgttagga 
gggttgggaa 
ttggtagttt 
ttaaagtgtt 
ttttttataa 



4414 



tttttttttt 
tgaagttgag 
gggttttagc 
tttttaaaga 
tgtgcgttcg 
tcgcgagaag 
tttttagatg 
gttttttatt 
tttttcgttt 
atttttagat 
gcgtatcgtt 
tttatattag 
aatagaggtt 
atgaggaacg 
aggcgtcgtg 
cgagacgtag 
tttttttttg 
taattcggat 
tgtaggacgg 
gtcgttgtcg 
tggggaatat 
gtttttgatt 
ttgttgagat 
atttttgtcg 
tttaataatt 



attttgggtt 
ttaaaagttt 
tttatagttt 

gggttttttt 

ttattttgga 
tgggttcggc 
gagggtgcgg 
tttagttttt 
ttttttagaa 
tattattata 
ggagaggagg 
agagtttagg 
gggatcgttt 
gtattaatcg 
tttgttqgcg 
tgtagttagt 
tagataattg 
aatttggtta 
ttaggtttaa 
tttagaggga 
tcggttttaa 
ttgtcggatt 
attttgtttt 



aggataaaag 
gatacggcgg 
gaaggggtta 
gttgttcggg 
attaggtggt 
ttatttaaat 
aaaggttttt 
tttatttttt 
ttttttgttt 
ttatgaaagt 
tttgttttaa 
ttataagttt 
aggcggttta 
gttgtttagg 
gacggtgttg 
gtttataata 
aggttcgggg 
ttattgtatc 
agggtttatt 
tttggaaagt 
gttttttttt 
gtttggggat 
gaaattattg 
tttgggttag 
agtacgatta 
tagcgtgttt 
ggtttttgtt 
aagtgtaaga 
aagattttgg 
taaagagaat 
tttcgttttt 



ttttttttta 
gaagtttaag 
gttgggttta 
cgtttagggt 
tttattaagc 
gtatttttgt 
tacgtcgtag 
tttgaatata 
gtgcgggatt 
atttaggttt 
tttttcgttt 
ttatttattt 
gggagagggg 
cgatttagag 
tagatttttt 
ttggattgga 
tttaatattt 
gttttaggtt 
tggggcgttt 
ttagatttgt 
gtaaggaaac 
cgggttttcg 
gggagcgata 
atttagaata 
ttatttttta 
aatttagggg 
ggattttgta 
ggttcgcgta 
agtggttttt 
ttgtattaag 
ttttgtttcg 
tttttttttt 
tttttttttt 
gaattttttt 
tatttgtatt 
ttaaagtttt 
tttgtttttg 
ttgtagtttg 
cgggttatat 
gaagatgcgg 
tttaatgttt 
tagttcgttt 
gttagagtcg 
tataggtttt 
aggcggaggg 
ttaggtattt 
agtagcgttt 
gttttttatt 
gggaaaggag 
ttttttttcg 
acgcgtttat 
aatttgaaag 
attttttatt 
taataaaagg 

attttttttt 
tttttttttt 



ttgttcgttt tttttttatt 
ttttggatta ggagtttttt 
agtggttatg ggttttgaga 
ggcgcgttta gattttcgaa 
ggcgaattgg atttagggcg 
attttgtata gattaagatt 
attgtggagt gtaaaatgaa 
aggttgtgaa agatgttcgt 
tttttattag ggggcgttta 
tgtaagtttt gtgagttgaa 
tttttttttt gttgttttgg 
gttagttgtg ggggagggga 
aagggaggta gagaagatgg 
agattttgtc gcgggtgtgc 
ttttttttcg tagtattttt 
gagtaaatag tttaggttat 
agatcgtttt acgtttagtg 
tgtacgtatt agggatgtcg 
tagtcgagga cgtttttcgg 
tagtaaagtt gtttaattag 
ggaggatttt tgtagtttgg 
agttattttt gtttgtagtg 
gattttattt tattttttag 
ggtttttttc gtttaggggt 
ttttttgttt tttttttgtt 
cgtcgggaga gaatcgtgat 
tttttttcgg tcggattttt 
aggttttagg ttgtttgggc 
taggaatatt aggtattgat 
gcggtttcgg ggatttgttt 
agcgtgttaa gaataataat 
tagtttaaga atttattatt 
aatttcgtta gttattttat 
taatattaag gtttttttgt 
ttttaagtgt ttatattttt 
ttattaattg tttttttttt 
ttttttaagg ggggaataaa 
gggggaaaat ttattgtagc 
aggaagcgtg gtggttcggg 
gaggatgcgg ggttggtcga 
tcggttgttg cgttcgtacg 
gtaagtttaa atgcgggttg 
aggaggagat ttagcgagaa 
aaaagcgttt cgtagtttta 
gcggaaaggt agtttttcgc 
tttttttttg gcgttttttt 
tttatttagg gatttattta 
ttttttttat ttttattttt 
tttttaggtt tttagggggt 
ggagtatata aaattgtaaa 
agagtttgtt tgttttatta 
agatattgag aaaacgagat 
gtatttttag ggaattgttt 
ttggatagga aattttttta 
tg-ttataaat ttttttttat 
agttttttta agtatt^ttg 
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ttttcgttcg 
ttttgtaata 
gtacggtatg 
atttggtttt 
tcgcgtgcgt 
tttttttagg 
tgatttaagg 
ttggtttggg 
tattcgtcgt 
ttgggggtag 
agtacgggaa 
cgtagaagtt 
agatagaggg 
gcgcggtagg 
tagggatatg 
gatattattt 
atatagatcg 
ttttatagcg 
ggttttcggt 
agcggagaat 
gcgagaggat 
cgttttgggt 
ggtgagaaaa 
tttttagatt 
ggttttttcg 
agtttttaag 
ttttagattg 
gtcgggtcgt 
tatatattag 
gtggtaagtt 
aataaaaaaa 
tttttatgat 
tttttatttt 
atgttttttt 
ttaatttcgt 
ttaagttcgc 
agaaacgtga 
gttgcgcgat 
gaaggatgcg 
agattttggt 
tttggagttt 
tgatatttat 
gaaggaggag 
gttattttta 
gtttgcggta 
ttgataagaa 
gtaatatcgt 
tcgtttttcg 
aggttagtaa 
attagtaaag 
aagggaagtg 
ttttcgggga 
aatggggagt 
ggtaaatttt 
ataagtttag 
agtagaatat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
1060 
3120 
3180 
3240 
3300 
3360 



seq_prot.txt 

ttgataattt ttttgagtaa tagggatttt ttggaagtat taattatttt ttatgttttt 3420 

cggaaataag attataattt tttatgcgta tatgcgattt ttttttttta gttaggttta 3480 

ttttattgtg taaatagtat taatatatgg aagagttttt tgtattgtgt tataaaagat 3540 

ttttaatagg attttataga gaaaagggtt aaatagttga tataaaggat ttttgttttt 3600 

ttagaaaaga gggattttgg attttttttt tttttgaagt taagtatgag tttatataat 3660 

aggaataaaa taaatttaag gtgtatatta gtataatatt agggatatta gaatggatgg 3720 

taaatttttt ttttnatata aatatgaaag tatttttata attttttttt ttgaagttrt 3780 

tatttagaaa attatattaa atataggatt ttaaaatagt agtgattaaa gatgaaagtt 3840 

aatttttttt tttaattttt ttgattagtt nggtttaaat ttatgtttta aaatttttag 3900 

taatttagat ttttgtatat gcgtatttat aagatttttt ttatattttg taatttgtag 3960 

gtatttagta tggttattga ttttaagtga taaataggta gaagatttgt taggataatt 4020 

ttagttttta ttatgtgtat tttttatttt ttttgttatt tattagtttt ttgtagtaat 4080 

ataattgtta gttatattat ttggaaatta atgtgttttt gtagtgaatt atttattttt 4140 

agtaaattta ttgtattatt attttaaata ttgaaatttg attttattag atgtttagaa 4200 

tgatagttat agagtagaat tagattttaa tttttatgta taatatgtaa ataaattaag 4260 

gtgtagaaag ttattgtttt aaagttatat aattaatagt ttagtggttt ttttagtgut 4320 

aattatagta attttagatt taagattgtt tgatttagga atggagagaa ataaaataaa 4380 

atgaatgttt attgaaatat atattttaga tatt 4414 

<210> 214 
<211> 4001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 214 

ttgatatgga atagagttaa gatttttttt trttttttttg atacggagtt ttattttgtt 60 

ttttaggttg gagtgtagag gcgtaatttt agtttattgt aagttttgtt ttttaggttt 120 

acgttatttt tttgttttag ttttttgagt agttgggatt ataggtattc gttattatat 180 

ttggttaatt ttttgtattt ttagtagaga tagggtttta tcgtgttagt taggatggtt 240 

tcgatttttt gatttcgtga tttgtttgtt tcggtttttt aaagtgttgg aattataggt 300 

gtgagttatc gcgattggtt agatttaaga tttgaattta ggtttttttg gttttagagg 360 

tttttgtttt ttaatttttt aggatggtat agtaatttgt tttataagag gtgtttgttt 420 

taagtgtgtt tagtatatgg aagtaagttt agaaatgtaa gtgtatattt gtaaagaggt 480 

gtgggagatg ggggggaggg aagagagaaa gagatgttgg tgttttttat tttttagttt 540 

ttgataggtg tttttgattt ttttttgatt agtatagttg tatttttggt tggggtattt 600 

taattagaat tgttaaattt agtatataaa aataaggagg tttagttaaa tttgaatttt 660 

agataaataa tgaataattt gttagtataa atatgtttta tgtaatattt tgttgaaatt 720 

aaaaaaaaaa aaaaaagttt tttttttatt uttattttta ttattaggtt taaggaatag 780 

ggttaggggt tttaaataga atgtggttga gaagtggaat taagtaggtt aatagaaggt: 840 

aaggggtaaa gaagaaattt tgaatgtatt gggtgttggg tgttttttta aataagtaag 900 

aagggtgtat tttgaagaat tgagatagaa gtttttttgg gttgggtgta gttgttcgtg 960 

gttgtaattt tagtattttg ggaggttgag gcgggaggat tatttgaggt tgggagttta 1020 

agantagttt tattaacgtg gagaaatttt gtttttatta aaaatataaa aaattagttg 1080 

gttatggtgg tatatgtttg taattttagt tgttcgggag gttgaggtag gagaattatt 1140 

tgaattaggg aggtagaggt tgtggtgagt agagatcgcg ttattgtttt ttagtttggg 1200 

taataagagt aaaagttcgt ttaaaaaaaa aaaaaagttt tttcgatgtg attgtttttt 1260 

tttaaatttg tagatttttt taagattatg ttttttagat attttaaaga ttttagaaga 1320 

tatgtttcgg gggttttgga agttataagg taaatataat atattttttt ttttgattat 1380 

taantttatt agaggatgtg gtgggaaaat tattatttga tattaaaata aataggtttg 1440 

ggatggagta ggatgtaagt tttttaggaa agtttaagat aaaatttgag atttaaaagg 1500 

gtgttaagag tggtagttta gggaatttat ttcggatttc gggggagggg gtagagttat 1560 

tagtttttgt atttagggat ttttcgagga aaagtgtgag aacggttgta ggtaatttag 1620 

gcgtttcggc gttaggaggg acgtatttag gtttgcgcga agagagggag aaagtgaagt 1680 

tgggagttgt tatttttaga tttgttggaa tgtagttgga gggggcgagt tgggagcgcg 1740 

tttgttttta attataggag aaggaggagg tggaggagga gggttgtttg aggaagtata 1800 

agaatgaagt tgtgaagttg agattttttt ttattgggat cggagaaatt aggggagttt 1860 

ttcgggtagt cgcgcgtttt tttttacggg gttttttatt gcgtcgcgcg ttcggttttt 1920 

atttttcgta gtatttcgcg tttcgcgttt ttttagtcgg gtttagtcgg agttatgggg 1980 

tcggagtcgt agtgagtatt atggagttgg cggttttgtg tcgttggggg ttttttttcg 2040 

tttttttgtt tttcggagtc gcgagtattt aaggtgggtt tggtgtgggg aggggacgga 2100 

gtagcggcgg gattttgttt tgtggatgtH ncgtcgaggt ttcgcggtcg gcggggttag 2160 

aggggttcgg acgagttttt ttatttcgaa gttgtggata gtcgagacgt ttagggtagt 2220 

cgggttttgg ggttttcggg cgggaggggg tagttatacg gtagcggttc gagatggttt 2280 

atttaagaga ttggcgtttt ttaggtttcg aggggtttcg ggaatttgtt aaagaagttt 2340 
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tttgaaattg tttagaaagt tttttcgtaa agggtgtatt gcgtagagcg cgcgcgcgcg 2400 

tttttttttt ttttgagttt ttttaagttt ttttaaagtt tttttagttg gtagttttcg 2460 

ttttcggatt ggtttgggtt ggattttttg ggggggtttt ttgttttgtt ttttttttag 2520 

ttttttttcg ttttttttta gacgattttg gtttggttgt ttttgttttt ggcggggtcg 2580 

ggtgtgtgtg tgtgtggtgg agtggagggt ggtatagtaa tttgttttaa ttagagtcgg 2640 

ggaggaaagg gtggttcgga gggtggtttt ttgttggggt ttgggttggg ggcgggggag 2700 

acgtttgttt tgaatagatt tttggggtta gtttagggat tgtgttttgt gatttttgga 2760 

gcgcgtggat tatggagggg tgggggtggg ttttttgggg tgtaaagtgg gagagttttt 2820 

agagaaggaa gttaagaaat aaggttagat gggagtttag ggagggttgc gttgttttgt 2880 

tgtttttttt ttggtgttgt gcgtggggaa gggtgagtgg gggtagtgtg tattttgatt 2940 

tatttgttta tttgtgtgta ttaattataa aagttaatat atagtttggg ttaggtatat 3000 

tttgttagga attgtttgtg gtgttttgta tgtatttttt tttaatttta gaatattttt 3060 

atagtggaag ttttgttagt attttggatt gagtagtagt ttagaggttg agtagtagtt 3120 

agtaagtggt ggggttaaga tgggatttta ggtagtgcga ttrttaatta tgtattcgaa 3180 

atcgttatat ggatgagtgt atttggagta atgagggata ttgttttttg agttattggg 3240 

ttgtagggga gataaaatga aagtgttttg ggagtcgtgg gtggttttta taggttagag 3300 

ggtttgggga gggagtgggt gttatcgtgg ttgtgtgttg ttcgaggggt tttttgtgag 3360 

tgagtgtatg gtcgtgttat ttttgtaggt ttacgttagg gtgtttttta gttgtgtggt 3420 

ttttgtattt gtgtgtttgg gttttgtgtt gttaaatagt agtttttttg ttgatttggg 3480 

gatataggtt gaattttgtt ttttgtagga atttttttaa ggtgttgggt tagatttgtt 3540 

ataaatagag ggaggtagtt ttttatggtt acgttttttt gttgaggaag aaggtttttt 3600 

ttttttaggg agtatatttt tgtttttttt gttttttaga taagtatttt tattttttat 3660 

tttttgatga gaagggtgag gttatattga gttgttaggt tgagttgttg ttttttttta 3720 

ttttgggttg ggagttgatt agggaatggt agttgttgta gagttggatt tgagggttgg 3780 

gttttttgga tggggttttt ttatgttttt attttttaat ttgtattatt gattgtgttg 3840 

tgtaggagtt agttaaaaag ttattgtata gtttgggtaa taaggtaaaa ttttgtataa 3900 

aaaatataaa aattagttgg atgtgattat acgtgtttgt agttttagtt atttcggagg 3960 

ttgaggtagg aggattattt gagtttagga agttgaggtt t 4001 

<210> 215 
<211> 4001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 215 

aagttttaat tttttgggtt taggtgattt ttttgtttta gttttcggag tagttgggat 60 

tataggtacg tgtaattata tttaattaat ttttgtattt tttgtataga gttttgtttt 120 

gttgtttagg ttgtgtaatg attttttaat taatttttgt ataatataat taatagtgta 180 

ggttgagggg tgaggatatg aggaggtttt atttagagaa tttagttttt aaatttagtt 240 

ttgtagtaat tgttattttt tgattaattt ttagtttaag gtgaggaagg gtagtagttt 300 

agtttgatag tttagtatgg ttttattttt tttattagaa gatgagaggt gaagatgttt 360 

gtttgggaaa tagggagggt aaggatgtat tttttggaag agaggaattt ttttttttag 420 

taagaaggcg tggttataga aggttatttt tttttgttta tggtagattt ggtttagtat 480 

tttaagggag tttttgtaga ggatagagtt tagtttgtgt ttttaagtta gtagagagat 540 

tgttgtttgg taatataaag tttagatata taaatataaa gattatataa ttgaggaata 600 

ttttggcgta gatttgtaga gataatacgg ttatgtattt atttatagag ggtttttcgg 660 

gtaatatata gttacgatga tatttatttt ttttttagat tttttgattt atggagatta 720 

tttacgattt ttagaatatt tttattttgt tttttttgta gtttagtgat ttagggaqta 780 

gtgtttttta ttgttttagg tgtatttatt tatatagcga tttcgaatgt atggttgggg 840 

gtcgtattgt ttggggtttt attttgattt tattatttat tagttgttat ttaatttttg 900 

gattgttatt tagtttagaa tgttggtaga atttttatta taggggtgtt ttaggattaa 960 

aggagaatgt atgtaaaata ttataaatag tttttggtag agtatatttg gtttaggtta 1020 

tatgttagtt tttatgatta atgtatatag gtggatagat gggttaggat atatattgtt 1080 

tttatttatt tttttttacg tatagtatta aggaaaaggt agtagaataa cgtagttttt 1140 

tttaggtttt tatttggttt tattttttag tttttttttt tgggaatttt tttattttat 1200 

attttaagaa atttattttt atttttttat ggtttacgcg ttttaaaagt tatagagtat 1260 

agtttttaag ttggttttaa gaatttgttt aaagtaaacg tttttttcgt ttttaattta 1320 

gattttagta agaggttatt tttcgggtta tttttttttt ttcggttttg gttgggatag 1380 

gttgttatgt tattttttat tttattatat atatatatat tcgatttcgt tagaagtagg 1440 

agtaattaaa ttaaaatcgt ttgaagggga gcggggaggg gttggaggag gggtagggta 1500 

gaggattttt ttaaggaatt tagtttaggt tagttcggag gcggaggttg ttaattggaa 1560 

aggttttgag aaagtttgag ggggtttaag aaggggggaa acgcgcgcgc gcgttttacg 1620 

taatatattt tttgcgggaa aattttttga ataattttag agaatttttt tgataagttt 1680 

tcggagtttt tcggagtttg gaaagcgtta gttttttgga tgggttattt cgagtcgttg 1740 
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tcgtgtaatt gttttttttc gttcgagggt tttagggttc ggttgttttg agcgtttcga 1800 

ttgtttataa tttcgggata ggagagttcg ttcgggtttt tttggtttcg tcggtcgcgg 1860 

gatttcggcg gggtatttat agggtagggt ttcgtcgttg tttcgttttt tttttatatt 1920 

agatttattt tgggtgttcg cggtttcggg gggtaagagg gcgaggagga gtttttagcg 1980 

gtataaggtc gttagtttta tggtgtttat tgcggtttcg gttttatggt ttcggttgga 2040 

ttcggttggg agggcgcggg gcgcggggtg ttgcgagggg tgggggtcgg gcgcgcggcg 2100 

tagraaaggg ttrcgtggga aggggcgcgc ggttgttcgg ggggtttttt tggttttttc 2160 

ggttttaatg gaggggaatt ttagttttat aattttattt ttatattttt ttaagtagtt 2220 

tttttttttt attttttttt ttttttgtga ttgggagtaa gcgcgttttt agttcgtttt 2280 

ttttaattgt attttaataa gtttgggagt ggtaattttt agttttattt tttttttttt 2340 

ttcgcgtagg tttgggtgcg ttttttttag cgtcgggacg tttgggttgt ttgtagtcgt 2400 

ttttatattt tttttcggag aatttttaaa tgtagaggtt ggtgattttg tttttttttt 2460 

cggagttcgg gataaatttt ttaggttgtt atttttaata tttttttaag ttttaggttt 2520 

tattttaaat ttttttgggg agtttgtatt ttattttatt ttaagtttat ttgttttaat 2580 

attaaataat ggttttttta ttatattttt tagtaaaatt gatagttaag gagggggatg 2640 

tgttgtgttt attttgtggt ttttaggatt ttcggggtat attttttgga atttttgaag 2700 

tatttgaaaa gtatgatttt aagagggttt ataaatttgg gaggagatag ttatatcgaa 2760 

aggatttttt ttttttttta aacgaatttt tgtttttgtt gtttaggttg gagagtaatg 2820 

gcgcgatttt tgtttattat aatttttgtt tttttggttt aagtgatttt tttgttttag 2880 

tttttcgagt agttgggatt ataggtatgt gttattatga ttagttaatt ttttgtattt 2940 

ttagtaaaga tagggttttt ttacgttggt gaggttggtt ttgaattttt aattttaggt 3000 

gattttttcg ttttagtttt ttaaagtgtt ggaattataa ttacgagtaa ttgtatttag 3060 

tttaaaaaga tttttatttt aattttttaa aatgtatttt ttttgtttat ttaaggaggt 3120 

atttagtatt taatgtattt aaggtttttt ttttgttttt tgttttttat tagtttgttt 3180 

aattttattt tttaattata ttttatttgg agtttttgat tttatttttt aggtttagtg 3240 

gtaggggtgg ggatggaagg aagatttttt tttttttttt taattttaat aagatattgt 3300 

atgggatata tttatattaa taaattattt attgtttatt tgaaatttaa atttaattgg 3360 

gtttttttat ttttatgtgt taaatttggt agttttagtt ggaatgtttt agttaagaat 3420 

gtagttatat tggttaagaa gggattaaag gtatttatta gggattggag aatgaaggat 3480 

attagtattt tttttttttt tttttttttt ttatttttta tattttttta taggtatata 3540 

tttgtatttt taaatttgtt tttatgtgtt gagtatattt aaagtaggta ttttttgtgg 3600 

gataggttgt tatgttattt tagggagttg agaaataggg gtttttggga ttaagaggat 3660 

ttgggtttaa attttgaatt tggttagtcg cggtggttta tatttgtaat tttagtattt 3720 

tgggaggtcg aggtaggtag attacgaggt taggagatcg agattatttt ggttaatacg 3780 

gtgaaatttt gtttttgtta aaaatataaa aaattagtta ggtgtggtgg cgggtgtttg 3840 

tagttttagt tatttaggag gttgaggtag gagaatggcg tgaatttggg aggtagagtt 3900 

tgtagtgagt tgagattgcg tttttgtatt ttagtttggg agatagagtg agatttcgtg 3960 

ttaaaaaaaa aaaaaaaaat tttgatttta ttttatatta g~ ' 4001 

<210> 216 
<211> 4334 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 216 

aaattaatta cgaaaaaaat tattaaaatt tattaaataa atattgggaa atatgatttg 60 

aatatgatat tggttttaga aaattaagtt aattgagttt tttcggttat aaaatttttg 120 

gatttttaaa attagataaa gttataaggg tgaagaattt tttttttata aattttttaa 180 

tattttatcg aaattgtaag aagatatttt agatttaatt taatagtgtt ttcggagtta 240 

ttttaatatt gtttataatt tattgtgtaa atgaaatatt tatttttgtt ttgtttgtgt 300 

ttgacgtggt ggataagaaa acgtttaagt taatagtaat taataatgtt gaatttttat 360 

tgttttttag gtttaagttt ttttgtttaa tagtaagtaa aatgtttatg aatttttaga 420 

ttttgattta tatttgttgt cgttatgtta tgaaaaatgt tgtatttttt gttgattttt 480 

ttaatattat gatttttgta gacgttagtt agtattaaaa acgtaggatg ataggttttt 540 

aaaagttagg aattaaaatt agaggtagcg ataagattta agaaaagtag agaattagac 600 

gtagtatcgg atttaggatg ggcgtggttt tggttcgggc gaggaaaaga ggatcggtcg 660 

ttttcggtta attagagtac gttttgcgtt tcggtttttg gtttcgtttt tttttttttc 720 

gcgttttttc gaattttttc gaatttaata tttcggagtt cgcgcgcgtg ggaaggggag 780 

gggtgggcgg ggttaacggt ttaattcggg taatttcgtt tttgtttgat tttttttgcg 840 

cggacgtttt ttttaatttt cgttaaaaat atttttaggt cgtcgttatt atttttttta 900 

tagtttagcg ggtggggtag gtttcgggaa taggttttcg ggtggtttcg cggtgagttc 960 

gttatcgttt ttcgtcgttt ttatgtttaa tttgtcgtta tttacgttgt tcgcgcgttc 1020 

ggtcgcgaag atagaagaat tttcgtgggg ttcgggagcg tcgtttttta attattagtt 1080 

gcggtcgtta ggggacggtc gtagggaggg tcgcggtttt ggttgttcgt agtttggttt 1140 
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taaagtcgtt ggtttttggg gtatcgtttc gtttttttag ataatgggga attcggcggg 1200 

gttcgtaggg aagggaggga gggagggagg taaacgaggg tttacgtttt ttattgtttt 1260 

ttcgacggta tttcggagcg gacgttcggt gatcgtttag ttttttgttt acgaatagtc 1320 

gcgttttcgg agagcgggga taaaggcgag gtttcgttcg agttatatat aaagcggagc 1380 

gattttcgtt tattaatttt ttcgaggttc gtagtttcgt tcgtcgtttt gtagtagggg 1440 

gtcgttgtta cggtttagtt tttttcgtcg atcgcgtttt ttttgttggt tttagttagt 1500 

cgtcgttttt ttcgagtttc gtatttattt gtttagttcg agtcgtttga ttatattttg 1560 

agtattaata gattcgggcg cgtataaaag cgttaaatag aggtacgtga ttttcgcgcg 1620 

gggtttcgta ttggtcgaga gcggcggtac gtgtcgtttt cgttttttat aattcgcgtg 1680 

gagcgtcgtt tttcgggatt tcgaagtatt tcgggaagtg tagttttgtt tcggagggta 1740 

ggaggataat gtcgggggtt gtagggcgtt tggggagtat ttttaggtgt gggtttggga 1800 

ttaggggggt ggtttcgtcg tggaagaggg cgtggtagtg gatttcggat taggttcgaa 1860 

ttagtttttt attttcgaga ataagggtag ggcggagtag gcggagggcg ggggggtttt 1920 

tttttatttt tttaaatttt ggtcgtttgg attttatttt gtagagttat atgttcgttt 1980 

cgttttaagg gtagtattcg gatagagtag tgtttttttt agtgcgttat ttatatgggt 2040 

gtttaagacg ttcgagtttt tgtgatagta taatgttatt ttacgttttt gtagggtttt 2100 

ttgttttcgg tattttatat cgatatcggc gtgttattta tattgtgata ttatcgtaat 2160 

ttttaattta tttagtttat aggtcgcggg cgtggttttt tttttattgt gtgcgttttt 2220 

gttagtattt ttaggttttc gagtcggtga ttagggattt gtgcgggaaa gcgtggggtt 2280 

ttggaatttg atcgtttttg gtttaataaa gttatttagt ttaaatgtag attttatttg 2340 

tttttttaag ttttttgcgg atgatagggg aggaagagta atagtcgttt tagattatat 2400 

agattgtagt taagtgttgt ttttgttttt agagtacgtg ttttttagag tagtattggg 2460 

ttttttttaa aaatcgaaaa aggagttata aggttattgt tttttttgat ttggatagtt 2520 

tagaatttta gtaaaggtat aaaggaaatt ggtttgatcg tgttttagaa taattggttt 2580 

ttttttttat aaaaagggga aaaaaataaa tgaaatgaat atggttttcg ttttttttta 2640 

ttttttgatt aagatcgtgt tttcggttgg tggttaatat attaagatat ttgttttgag 2700 

attttgatta agtagaagag atatttttat tttattaatg agttgtattt ttttaaatta 2760 

agaaataaat gtgattaatt tttttgaaat aatagtttta atttagaaag atttttatgt 2820 

ttagttagtt attagggatg tgttagagat aaaatataat taaggggtgt ttaagataat 2880 

aaaaattgga ttatataagt agtagtatat tatataaagt aaataattag taatgcgttg 2940 

ttgtttgaaa tatgtttgtt gcggtatttt ttttaataat cgtttttgtg tggggtcggt 3000 

ttttttttta agttaatcga agtggttttt atttagtttg tgagtaggat taggaataat 3060 

ttagatattg agttttgaaa ggtttttttt tagattaaat ttagttttaa aatggttagg 3120 

tgttttcgtt tgttattttt ttgaatagaa ttttagatat tattagaaaa gttggagaag 3180 

gatgggtatg agatttttta ggaaagcgat agataggtag gtaatataaa tgagtaagga 3240 

atttaagttt aagaggtttt tatttattga ataggttttt tattttgaag tttttgtatt 3300 

tgatttggag gtttatagag gaaggtttta tttatttaat gtattttttt aagtataata 3360 

aattgttata taaaagtttt atatttttga ttattaatgt tgtgatagaa aaaagaaata 3420 

aattttttaa aatattgata atttgaaagt atttatattg ttttttttta tatttttttt 3480 

ttttttagag tagatgttta tttttatgga aattatagta aggaacgtag atgttagaat 3540 

ttatgtttat ttttaatttt ttttagtaat gttatttggt ttttgggttt gttgggagat 3600 

gtttggtttt gttagtttgt gataatgttt taagtttttt atagttgttt gaggattgag 3660 

agggttgggt taaagttttt tttagaatga gtttttgaat aaaaaggtgt ttttgaggtg 3720 

ggatttttgt ttttttatta ttattattat tattattatt attattatta ttattattat 3780 

tatttgattt aaaaaaaatt gagatagggt tttattatgt tgtttaggtt ggtttcgaat 3840 

ttttgggttt aagtaatttt tttgttttag ttttttaaag tgttgggatt ataggtatga 3900 

gttattatat ttagtttgat ttttgttttt tagggtgggg ttgttttttt aagagttaga 3960 

ttatagttta ttattgttag tttttgtggg tgttttatat aagttttgga ataggaaggg 4020 

tttttaattg taaggagaga taatagtttt gttattttat tttggagagg ggtagaatcg 4080 

tttttttcga aagtttttta aaacgaattt taagatttat ttttttttga agtggtagta 4140 

aggagttaat gtttatattt tggaattgtt tatttttgtt tataagaatt atttttgttt 4200 

ttgtttagat ggtttggttg gattgttatg gagttggttg tgattatgga gttatataac 4260 

gaagtggaga tagggtagtt atttagttta tttgatttgt gtagatagtt aaaggagagt 4320 
attttaggtt ttaa 



<210> 217 
<211> 4334 
<212> DNA 

<213> Artificial Sequence < 

<220>, 

k:223> chemically treated genomic DNA (Homo sapiens) 



<400> 217 



4334 



« 

ttgaagtttg aagtgttttt ttttagttgt ttatatagat taaataagtt ggatgattgt 60 

tttattttta tttcgttgtg tggttttata gttatagtta gttttataat aatttaatta 120 

aatt f atttga gtagaajgtaa aaataatttt tatgggtaaa gatagatagt tttaagatgt 180 
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gaatattaat tttttgttgt tattttagaa gagagtggat tttgaagttc gttttaggaa 240 

gttttcgaga agggcgattt tgtttttttt taaggtgaag tggtaaaatt gttgtttttt 300 

tttatagttg aaagtttttt ttattttaag gtttatatag gatatttata aaagttgata 360 

ataatgagtt gtgatttggt ttttgggaag gtagttttat tttaaagaat aaggattaga 420 

ttgggtgtgg tggtttatgt ttgtaatttt agtattttgg gaggttgaga taagaggatt 480 

gtttgaattt aggagttcga ggttaatttg ggtaatatag tgagattttg ttttaatttt 540 

ttttaagtta aataataata ataataataa taataataat aataataata ataataataa 600 

aaggatagga attttatttt aaaagtattt ttttatttaa aggtttattt taggggaaat 660 

tttaatttag tttttttagt ttttaagtag ttgtgaggag tttggaatat tgttatagat 720 

tggtagaatt aggtattttt taataaattt agaagttaag tggtattgtt aaaaaagatt 780 

aaaaatgaat atgagtttta atatttacgt tttttgttat ggtttttatg gaaatagata 840 

tttgttttag ggagaaagaa aatgtggggg gaagtagtgt aagtgttttt aaattattag 900 

tgttttgaaa gatttatttt ttttttttgt tataatatta atagttagga atgtgagatt 960 

tttgtgtggt aatttattgt atttaaaaaa atgtattaaa tgagtgggat ttttttttgt 1020 

gaatttttag attaaatata aagattttaa gataaaaagt ttgtttagtg ggtaaggatt 1080 

ttttgggttt gggttttttg tttatttgtg ttgtttgttt atttgtcgtt tttttgaggg 1140 

gttttatgtt tatttttttt taattttttt aatgatgttt gagattttgt ttagaaaaat 1200 

gatagacgga agtatttagt tattttgaag ttaagtttgg tttgagaaga agttttttaa 1260 

ggtttagtat ttgaattatt tttggttttg tttataggtt aagtgaaaat tatttcgatt 1320 

ggtttggggg ggaggtcgat tttatataag ggcgattgtt ggaaaaaatg tcgtaataag 1380 

tatattttag gtaatagcgt attgttaatt gtttgttttg tgtagtgtgt tgttgtttgt 1440 

atgatttagt ttttattgtt ttgggtattt tttagttgta ttttattttt ggtatatttt 1500 

tggtgattaa ttaggtatga gagttttttl: aaattaggat tgttgtttta ggaaggttgg 1560 

ttatatttgt tttttgattt ggggagatgt agtttattaa taaaataaag atatttnttt 1620 

tgtttagtta agattttaaa gtaggtgttt tggtgtattg attattaatc gaggatacgg 1680 

ttttggttag agaatgggag gaggcgggga ttatgtttat tttatttatt tttttttttt 1740 

ttttgtgaga gaaagggtta gttatttrtaa ggtacggtta gattaatttt ttttgtgttt 1800 

ttgttgaggt tttgagttat ttaggttagg gagaatagtg attttgtggt ttttttttcg 1860 

atttttaaaa agaatttagt gttgttttgg ggagtacgtg ttttgagaat aaaagtaata 1920 

tttaattgta atttgtgtag tttagaacgg ttattgtttt tttttttrta ttattcgtag 1980 

aaggtttgga agaatagatg gagtttgtat ttaagttgag tggttttgtt agattaaagg 2040 

cgattaaatt ttagagtttt acgttttttc gtataggttt ttggttatcg gttcgggaat 2100 

ttgaggatgt tggtaagaac gtatanagtg agggaagggt tacgttcgcg atttgtgggt 2160 

tgagtggatt agaaattacg atgatgttat aatatgggtg atacgtcggt gtcggtgtag 2220 

ggtgtcgggg gtagggggtt ttgtaggggc gtggagtggt attgtgttgt tataggggtt 2280 

cgggcgtttt aggtatttat gtgggtggcg tattagaagg ggtattgttt tgttcgagtg 2340 

ttgtttttgg ggcgaggcgg gtatgtggtt ttataaggtg gagtttaggc ggttaaagtt 2400 

tggaaaggta gggaaggatt ttttcgttztt tcgtttgttt cgttttgttt ttgtftrcga 2460 

gaatggggag ttggttcgga tttagttcgg ggtttattgt tacgtttttt tttacggcga 2520 

gattattttt ttagttttag gtttatattt ggggatgttt tttaggcgtn ttgtaatttt 2580 

cggtattgtt tttttgtttt tcgggatagg attatatttt tcgaggtgtt tcggggtttc 2640 

ggggggcggc gttttacgcg ggttgtgggg ggcgggggcg gtacgtgtcg tcgttttcgg 2700 

ttaatgcgga gtttcgcgcg gaggttacgt gtttttgttt ggcgtttttg tgcgcgttcg 2760 

ggtttgttgg tgtttagagt gtggttaggc ggttcggatt gagtaggtgg gtgcggggtt 2820 

cggaggaggc ggcggttggt tgaggttagt aagagggacg cggtcggcgg gaggggttgg 2880 

gtcgtggtag cgattttttg ttgtagggcg gcgggcgggg ttgcgggttt cggaggggtt 2940 

ggtgggcggg ggtcgtttcg ttttgtgtgt ggttcgggcg gagtttcgtt tttgttttcg 3000 

tttttcgggg gcgcggttgt tcgtgggtag ggggttgggc gattatcggg cgttcgtttc 3060 

ggggtgtcgt cgaggagata atagggggcg tgggttttcg tttatttttt tttttttttt 3120 

ttttttttgc gggtttcgtc gggtttttta ttgtttgaag ggacggggcg gtgttttagg 3180 

gattagcggt tttaggatta aattgcgggt agttagggtc gcgatttttt ttgcgatcgt 3240 

tttttggcga tcgtagttgg tgattgaggg gcggcgtttt cgggttttac gagggttttt 3300 

ttgttttcgc ggtcggacgc gcggatagcg tgggtggcgg taggttgggt atggggacgg 3360 

cgggaggcgg tggcgagttt atcgcgggat tattcggggg tttgttttcg gggtttgttt 3420 

tattcgttga attgtgaagg gggtggtggc ggcggtttgg aggtgttttt ggcgggagtt 3480 

ggggggggcg ttcgcgtagg gggagttagg taggggcgga gttattcgga ttggatcgtt 3540 

agtttcgttt attttttttt tttttacgcg cgcgggtttc ggggtgttga gttcggggag 3600 

attcgaaaag gcgcggggag gaagggggcg gggttagggg tcggagcgta aggcgtgttt 3660 

tgattggtcg ggggcgatcg gttttttttt tttcgttcgg attagggtta cgtttatttt 3720 

gggttcggtg ttgcgtttaa ttttttgttt tttttaaatt ttgtcgttgt ttttgatttt 3780 

aatttttagt ttttgggaat ttgttatttt acgtttttgg tattagttgg cgtttataaa 3840 

agttataatg ttaaaaagat taataagaga tatagtattt tttatgatat aacggtagta 3900 

aatataagtt aaaatttaga ggtttataaa tattttgttt gttgttgggt aaggaagttt 3960 

aaatttgagg gataatagga gtttaatatt attggttatt attagtttgg gcgttttttt 4020 

atttattacg ttagatatag ataaagtagg ggtgggtatt ttatttgtat aatgagttgt 4080 

aggtagtatt aagatggttt cgggggtatt gttgagttga atttggaata tttttttata 4140 

gtttcggtga aatgttaaag agtttatggg ggaaaaattt tttatttttg tgattttgtt 4200 

tgattttaaaaatttaagag ttttatgatc gagaaagttt agttaatttg attttttgga^ 4260 

" Seite 116 



seq_prot.txt 

attaatatta tatttaggtt atatttttta atgtttattt agtagatttt gataattttt 4320 

ttcgtggtta attt 4334 

<210> 218 
<211> 4528 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 218 

cgcgtagatt tgttaggaag agtataagaa gaaatattcg gatttttttg ttaatttcgt 60 

ggaatttttt aagaagtgtt tggagagatg gaagattacg tttgtaaagg agaagtgaag 120 

tttgaagaga aggtaaaaag tgataaagtt cgttgtgata gggagattaa aaattatatt 180 

ttttcgaaat gtaagaaagg gtaagaaagg aaagaaaaag gatcgtaatg tttttagaag 240 

gttattattt gttttttttt tgttttgttt tgaatatcgt ttaaagatta aaagtggata 300 

tttaggttta tttgtcgtgg aaattgtaaa gaaattgggt gaaatgtggt ttgggtagtt 360 

agttaaagat aaataattat atgagtagaa agtagttaag ttataggaga gatatgaaaa 420 

gggtattgtt gtatatcgtg ttaagggtaa aagtgaagta ggaaagaagg gtttaaagaa 480 

gaataaatta gaagatgagg aggaggagga ggagaaagaa gatgaagatg aggaggaaga 540 

gggtgaagat gaagaataaa tggttatttt ttaatgatgt ttgtgtagtg ggtttgtttt 600 

gttaagaatg tgaattttag tatagtttag tattagtttt agtataaaat tgtataaatt 660 

ttcgtatagt ttataagatt ttttgtatag aaaatatttt tttttttttt tttttttttt 720 

gagatagagt ttcgtttttg ttgtttaggt tggagtgtaa tggcgtgatt tcggtttatc 780 

gtaattttcg tttttcgggt tttggtttaa gtagtttttt tgttttagtt ttttgagtag 840 

ttgggattat aggtatatgt tattacgttt agttaatttt tgtattttta gtagagatqg 900 

ggttttatta tgttggttag gttgttttta aatttttgat ttcgtgattc gtttgtttcg 960 

gttttttaaa atattgggat tataggtgtg agttatcgta ttttgtttaa tgtttttaaa 1020 

tatttaatgg tttttaaaaa atttattgtg tatggtagta tagtatattt gtaggaatta 1080 

gtattaatag tatattttgc gttttttaag atgttgtatt ttttaatatt ttgtaataaa 1140 

attatgcgta ttaaaaaaat aaagaaattt cgtgtgtagt ttatatttat agtatatttt 1200 

cgtttaggta tttgagagaa tgattaggag gggtttttgg aggaggtggt ttttgaacgg 1260 

agaatttatt tttaaggatt ttgtttgtaa tggttattaa gtattttttg agttattttt 1320 

atgtgttttg tagttttttg aaggggcgtg ggatttatcg atgttaatta tttagtatta 1380 

tttttagatt ttaagaagtt ggggtgtgag ttagtaatta gtatagaaaa gagatattaa 1440 

aataagtttg agttggggag tgttttttta attttagttt tttggaagag attttttttt 1500 

t HJ t i tgag a *agagtttc gtttttattg tttaagttgg agtgtagtgg tacgatttcg 1560 

gtttatcgta attttttttt ttcgggttta agcgattttt tttttttagt tttttgagta 1620 

gttgggatta tagatatgta tttgtaattt ttattaaaaa tataaaaatt agtcgggcgt 1680 

ggtggcgtac gtttgtaatt ttagttattg gggaggrtga ggtaggagaa tcgtttgaaa 1740 

ttaggaggcg gagattgtat taagatagtt tgttttagtt aaataatttg gcgttagtgt 1800 

aggaaaaggt ggaaggtacg gggttagtat aggagggttt aatattttta attttattaa 1860 

gttatatttt ggtaattttt gttttttacg agaagttttc gttgggtttg ttttagcgtt 1920 

gttttgaggt tttttttatg agtttcgata gggtagaggt cgttttgagc gttttttttt 1980 

tttttggttt aagagtggtt taaaagaagg atttttgant ggaattggtt attttgtgtt 2040 

SSJHSJ 5H5H att i£ gttttaaagg gggatgcggg ggaggggttt tggtaggggt 2100 

ggtttcgttt tttttaggtt cgtaagttta ggttttcgtt tatcgggttt agtttatttt 2160 

gcggtcgttt agggaggtcg ttggtattcg tgatttacga tttttttttc gagttttatc 2220 

gaggttatag tcgtggtttc gtttttttat gtttgttttt cgttttttgt tcgtgacggg 2280 

cgttttcgag gattaatgag cgcgttgtat ttatttttcg ggcggggtta agcgtcgatt 2340 

aatcgtcgtt cgggcgttcg gtcgggttta aacgttttaa tcgttagcgg cggcggggcg 2400 

ggtagagggt cggggatggt aggtttaatt aacgggtggg tacgtcgttt tcgcgaggag 2460 

SS£5EH t9 cggtcgggcg tgcggtgttc gcggcggcgt agggaggggg agggaggtaa 2520 

ataagatggc ggcggcgtgt cgggcgcgga agggggaggc ggttcggggc gttcgcgagt 2580 

fl 522S!3!22 g 5 ggcgaagg gagcgcgggt ggcggtattt gttgtcgcgg ttttggatgg 2640 

gttgggtttt tttcgtcgtt tcgttttttt tatacgcgcg gcggtcgcgg cgagggggiE 2700 

gcgtcgttcg gggttcggta ttttcgggaa tttttcggtt cggagtttgc ggtttgcgtc 2760 

gtttcggtcg tcgggagttt cgtggagttt tcgtcgtcgc gtcgtttcgc ggatcggacg 2820 

ttgagggtat tcggggcggg gcgcgcgttc gggtagacgt ttgcggggag gggggcgttt 2880 

gtcgggtttc ggcgattatt ttgggggtcg cgggtcggtt cggggggcgt ttagtgcggg 2940 

ttttcgcggg cgtcgggtag cgattagttt tgagcggagt tgttggtcgc ggcgggaggt 3000 

ttttcggacg tttttagttt ttcgaacgtt cgttcgggtic ggcgggagtc ggcgtttttc 3060 

gggaggttcg ttcggtcgtt cgcggcggag cgtttgtttt tgggataggc ggtgqgatcq 3120 

gggcgtcgtc ggagacgttt ttagcgaagt tgggttttbt aggtgtgggg gtttcggggg 3180 

gtagcgacgt cgcggattcg g.tttgtggga tgggcggttc ggagaagatt gcgttcggtc 3240 

gtgtttatat ttgttcgtgg gttttgaggt* ttcggaggat gatttagtat tgaaaagttt 3300 
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cggtcggttt ttttagggtt ttcgaggacg aagttgattt tgatcgggtc gttttttagt 3360 

tttgaggttc gggttttatt ggaattcgcg tttgagtcgt cgtttcggat tttcggtgtt 3420 

cgtcggttcg tagattttgt atcgggtttg gattcgtagt cgggattgac gtgtagaata 3480 

atcgtttttg ttggaagaag ggtttttttt ttttttttgg ggtttttgtt gttttttttt 3540 

tttttttttt ttttgtaaaa ttttggagaa gggaagtcgg aatataagga aggatcgttt 3600 

attcgcggat ttagggttgg cggcgggatit ttaggatttt gggtttagta tggaggtggt 3660 

ggattcgtag tagttgggta tgtttacgga gggcgagttg atgtcggtgg gtatggatac 3720 

gtttatttat cgtatcgatt ttatcgaggt tatttattag tcgcgtcgta agcgggttaa 37S0 

gtttatcggt aagtatttga tgggggattt gttgggggaa ggtttttacg gtaaggtgaa 3840 

ggaggtgttg gattcggaga cgttgtgtag gagggtcgtt aagattttta agaagaagaa 3900 

gttgcgaagg atttttaacg gggaggttaa cgtgaagaag taagtatggt ttgttggggt 3960 

cggggtcggg tcgggttagt tacggtgttg atggttttgt tttttttttt tttttttttt 4020 

tttttttttt tatttttttt taatattttg agttggattc gtttggcgtt tgtgtttttc 4080 

gtgttaggga gagcgtggtt gggggtttgc gttacggatt tttatttagg taaggttagt 4140 

tgtcgtagcg gggcgtgcgt ttgtatgggt ttttggattt tagttaaaat gttttggtag 4200 

cgaaattttt ttgagaaggg agcggttttt aattttttaa gattagtttt ttggtttttt 4260 

tagttgttta aggagtagag gcgtttagtg gaattagttt gtgtttgttt gggtttcgag 4320 

agtttgtgtg cggtcgttaa tacgtttttt gcgtagtgtg tggtcgttat cggggttagg 4380 

cgaaatgtga tttgtttatt ttgttagagg ggaattttgg gttgttaaaa ataattgttt 4440 

gtatcggttt attagttagt aggagggaaa cgtagttttt ttttatttgt tagggatgtg 4500 

acgttggaag tatttttggt tttcgggg 4528 

<210> 219 
<211> 4528 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 219 

tttcgggggt tagggatgtt tttagcgtta tatttttggt aaatgaggaa aggttacgtt 60 

ttttttttgt tgattgataa gtcggtgtaa atagttattt ttggtagttt agggtttttt 120 

tttgatagga taaataaatt atatttcgtt tggtttcggt aacggttata tattgcgtag 180 

aaaacgtgtt ggcggtcgta tataaatttt cggggtttaa ataaatatag gttgatttta 240 

ttgggcgttt ttgttttttg gatagttggg ggagttaagg ggttagtttt aggggattgg 300 

gggtcgtttt ttttttagga gggtttcgtt attagggtat tttaattgga gtttaagagt 360 

ttatgtaaac gtacgtttcg ttgcgataat tggttttgtt tgagtgaaag ttcgtaacgt 420 

aggtttttaa ttacgttttt tttggtacgg aggatatagg cgttagacgg gtttagttta 480 

gggtgttaag aggaagtaag ggagggaggg aggagagaag gaaggaagat agaattatta 540 

gtatcgtgat tggttcggtt cggtttcgat tttagtaagt tatatttatt tttttacgtt 600 

ggttttttcg ttggggattt ttcgtaattt tttttttttg aggattttga cggttttttt 660 

gtatagcgtt ttcgagttta gtattttttt tattttgtcg taagagtttt tttttagtag 720 

gttttttatt aggtatttgt cgatgagttt ggttcgtttg cggcgcggtt ggtagatgat 780 

ttcggtggag tcgatgcggt ggatgaacgt gtttatattt atcgatatta gttcgttttt 840 

cgtgaatatg tttagttgtt gcgggtttat tatttttatg ttggatttag ggttttggag 900 

tttcgtcgtt agttttgagt tcgcgggtga gcggtttttt tttgtgtttc gatttttttt 960 

ttttaaaatt ttataaagaa aaaagaaaaa aaaaaggtaa taaaaatttt aaaaggaagg 1020 

gaaaaatttt ttttttaata gaaacgattg ttttatacgt tagtttcggt tgcgagttta 1080 

agttcggtgt agggtttgcg gatcggcggg tatcgggggt tcgggacggc ggtttagacg 1140 

cgagttttag tgggattcgg gttttagaat tgggagacgg ttcggttagg gttaatttcg 1200 

ttttcgggga ttttggggag gtcggtcggg gttttttagt gttaggttat ttttcgggga 1260 

ttttaggttt acggataagt atgaatacgg tcgagcgtag tttttttcgg gtcgtttatt 1320 

ttataggtcg ggttcgcgac gtcgttattt ttcgggattt ttatatttgg agagtttaat 1380 

ttcgttgggg gcgttttcgg cgacgtttcg gttttatcgt ttgttttagg agtaaacgtt 1440 

tcgtcgcgga cgatcgagcg gatttttcgg ggggcgtcga ttttcgtcgg ttcgggcgag 1500 

cgttcggggg gttgggggcg ttcgggaggt ttttcgtcgc ggttaatagt ttcgtttagg 1560 

gttggtcgtt gttcggcgtt cgcgagggtt cgtattgggc gtttttcgag tcggttcgcg 1620 

atttttaagg tggtcgtcgg ggttcggtag gcgttttttt tttcgtaaac gtttgttcga 1680 

gcgcgcgttt cgtttcgagt gtttttagcg ttcggttcgc ggggcggcgc ggcggcgggg 1740 

gttttacggg gttttcggcg gtcgaggcgg cgtaggtcgt aggtttcggg tcggggggtt 1800 

ttcgaaggtg tcgggtttcg ggcggcgcgt ttttttcgtc gcggtcgtcg cgcgtgtgga 1860 

ggaggcggag cggcgagggg ggtttagttt atttaaggtc gcggtagtaa gtgtcgttat 1920 

tcgcgttttt ttcgtcgttt cgcgttttat tcgcgggcgt ttcgggtcgt tttttttttt 1980 

cgcgttcgat acgtcgtcgt tattttgttt attttttttt ttttttttgc gtcgtcgcgg 2040 

atatcgtacg ttcggtcgta gggtacgttt tttcgcgagg acgacgtgtt tattcgttgg 2100 

ttgaatttgt tattttcggt tttttgttcg tttcgtcgtc gttgacgatt ggagcgtttg 2160 



seq_prot . txt 

gattcggtcg ggcgttcgag cggcgattgg tcggcgtttg gtttcgttcg aggggtggat 2220 

atagcgcgtt tattggtttt cggagacgtt cgttacgggt agggggcggg aagtaggtat 2280 

ggggagacga ggttacggtt gtgatttcgg tggggttcgg gaagggggtc gtaggttacg 2340 

ggtgttaacg gtttttttga acggtcgtag ggtgggttga gttcggtggg cgggaatttg 2400 

ggtttgcgga tttggaagga gcggggttat ttttattaga gttttttttt cgtatttttt 2460 

tttggggcga ggttaagggt taaaaagtaa tataaagtgg ttaattttag ttaaaaattt 2520 

tttttttgag ttatttttgg attaggggaa agagaaacgt ttagggcggt ttttgtttta 2580 

tcggaattta tgggggaagt tttaggatag cgttgggata agtttagcgg gggtttttcg 2640 

tgaaaaataa gagttgttaa aatatggttt gataaggttg aaaatattga atttttttgt 2700 

gttagtttcg tgttttttat ttttttttgt attagcgtta ggttgtttag ttggaataaa 2760 

ttattttggt ataattttcg ttttttggtt ttaagcgatt tttttgtttt agttttttta 2820 

gtagttgaga ttataggcgt gcgttattac gttcggttaa tttttgtatt tttagtagaa 2880 

attataggtg tatgtttgta attttagtta tttagaaggt tgagggagga gaatcgtttg 2940 

aattcgggag gaggaggttg cggtgagtcg agatcgtgtt attgtatttt agtttgggta 3000 

ataagagcga aattttgttt taaaaaaaaa aaaaagattt tttttagaaa attgaagttg 3060 

aaggaatatt ttttaattta aatttatttt ggtatttttt ttttgtattg attgttggtt 3120 

tatattttaa ttttttggaa tttggagata atgttggata attggtatcg gtaggtttta 3180 

cgttttttta gagaattgta ggatatatgg aagtgattta ggaaatattt ggtgattart 3240 

atagatagaa tttttgaaga tggatttttc gtttaaagat tatttttttt aagaattttt 3300 

tttagttatt tttttaagtg tttggacgga gatgtgttgt gagtgtgaat tatatacgga 3360 

atttttttgt ttttttgata cgtataattt tattataaaa tgttaaaaaa tgtagtattt 3420 

taaaaaacgt aagatgtatt gttgatatta atttttataa gtgtgttgtg ttgttatata 3480 

taataaattt tttaaaaatt attaaatatt tagggatatt aggtagggtg cggtggttta 3540 

tatttgtaat tttagtattn tgggaggtcg aagtaggcgg attacgaggt taggagtttg 3600 

aaagtagttt ggttaatatg gtgaaatttt atttttatta aaaatataaa aattagttga 3660 

gcgtggtggt atgtgtttgt aantttagtt atttaggagg ttgaggtagg agaattgttt 3720 

gaattaggat tcgggaggcg gaggttgcgg tgagtcgaga ttacgttatt gtattttagt 3780 

ttaggtaata agagcgaaat tttgttttaa aaaaaagaaa agaaaagaaa agtatttttt 3840 

gtatagagaa ttttatgagt tatacgaaaa tttgtatagt tttatattga agttaatatt 3900 

gagttgtatt agaatttata tttttagtaa aataagttta ttgtataggt attattaaag 3960 

gatagttatt tattttttat ttttattttt ttttttttta tttttatttt tttttttttt 4020 

tttttttttt ttattttttg gtttgttttt ttttgagttt tttttttttg ttttattttt 4080 

gtttttagta cgatatgtag taatattttt tttatatttt ttttgtagtt taattgtttt 4140 

ttgtttatat ggttgtttat ttttggttga ttgtttagat tatattttat ttagtttttt 4200 

ngtagttttt acgataaata ggtttgggtg tttatttttg atttttgggc gatgtttaga 4260 

gtaaaatagg aagaaggtag atggtggttt tttaggagta ttgcgatttt tttttttttt 4320 

ttttttattt ttttttatat ttcggaggaa tgtaattttt aatttttttg ttatagcgag 4380 

ttttgttatt ttttgttttt tttttaaatt ttattttttt tttgtagacg tggtttttta 4440 

tttttttaaa tattttttgg agaattttac gaaattgata gaagagttcg ggtgtttttt 4500 

tttgtgtttt ttttggtagg tttgcgcg 4528 

<210> 220 
<211> 4616 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 220 

tgaaatggtt ttattttttt ttagtttgtt taagttaatg gagttttatt agaatgtgag 60 

ttataaaata gtagagttat tttttattta atgttgtatt tgtagggttt agaatagtat 120 

ttgaaggatt ggcggaggtt taataaattg ttaaaagggt agattagtta ggtgtgatgg 180 

cgtgtgtttg tagttttagt tatttgggag gttgaggtag gaggattatt tgattttagg 240 

atttttgggt tgtagtgtag tatattagtt aggtgtttgt attaagtttt gtattaattt 300 

ggttgtttaa ggatggttga attgatttag gtggaaattg gagtaggtta aaatttaatg 360 

ttgattagta atagggtcgt atttgtaaat agttatcgtt ttttagtttg ggtaatatag 420 

tgagatttta tttttaaaat aaattttaaa aataattaat tagaaaaaaa aattagtttg 480 

taattttagt attttgggag gttaaggcgg gtagattatt tgaggttagg agtttaagat 540 

aagtttggtt aatatggtaa aattttaatt tttattaaaa atataaaaat tagttgggcg 600 

tggtggtagg tgtttgtagt tttagttatt taggaggttg aggtaagaga atcgtttgaa 660 

tttgggaggt ggagattgta gtgggtcgag atcgcgttat tttattttag tttgggtagt 720 

agagtgcgat tttgttttaa aaaaaaaaaa aaagaaagaa aagaaaagaa aaagaaaaga 780 

aaagaaaaaa attgggaggt ttaagtttat ttttggtgtt tttatatttt tttgttttat 840 

ttttttgtat ttagtttttt tttgtaattt gtgttttata ttagttttta agtttttaac 900 

gtgatttagt atgagaatrtg gattttgtta tttttttgtt tataatattt tatgtttttt 960 

ttttgtttag aatattattt tttttattgt ttttattaat ggaattggta tttttttaag 1020 

seite 119 



gatatgatta aatttgttta tttttatatt 
tttaatatga tgatattgat agggtttgtt 
taggtagcgt tttttgtttt tgtttttgtt 
tatttaggtt agagtgtaat ggtatagttt 
taaattatta ttttatttta gttttttgag 
tttggttaat ttttttgtat ttttagtaga 
tttcgaattt ttggatttaa gtaatttatt 
ttttatttat ttttatgttt ttagtttata 
taaatatttg ttgaatgtaa tagtaaatag 
aggtggtaaa aggtttggag aaaaaaataa 
gtagtaggag ataagatgga aaggtttttt 
gaagtatata atgtgtatat cgtggtaggt 
gagagtaatg tgttgaaaaa taaatatagg 
tatttgtttt ttatttaagt ttaagtttgt 
cgggtttttt tagtagtttg ttttattttt 
atatatrgagt tgtttttttt tttagttaga 
ttttttttgt gtttggagtt gggaagtagg 
gttgggtggt gttagggaga gtttgtatag 
gtttatggtt tcgataattt tttgtttgtg 
ttagttttgg tatgggggag agggtatagg 
tttttgtaaa agggcgtttt gtgagttagt 
tttagttttc gtttttaatg tacgtatagt 
agttagtcgt atggtttttt tgtgttttag 
tgttttaggt tttattgtgt aattgttgtt 
ttagaggttg tttcggatgt aggaggattt 
tgatttattg ggcgaggagg atttgtttag 
attcggagag gaggatttat ttggagagga 
agttaagttt aaattagaag aagagggttt 
ggtttttgga gatttttaag aattttagaa 
gttattaatt tttaaattta ggttttagga 
taggttttgt ttatttaggg aaggagggga 
ggttttatat taatattttt atttttattt 
aaaataaaaa gggtgtaaaa ggagagaggt 
gttggagaag agaaagggat gagaattgta 
aaaaataggt ggagaaggag agttagagag 
aggtgaagtg ggtattagag ataagtaaga 
tataatgagg aaattgagat ttaggaagaa 
ttgattttta agttaggaat ttggggaaag 
agtggggaga agaaagaagg gagaaaggaa 
gattgaagtg tttatttatt tttttttttt 
gtttaggttg gagtgtaatg gcgcgatttc 
aagtgatttt tttgttttag tttttagtta 
cgttcggtta atttttgtat ttttagtaga 
tttcgaattt ttgattttag gtgatttaat 
aggcgtgagt tatagcgttt ggtttgaagt 
tgatngtaag ttggtaggat tgttgtttgg 
cggttttttg tgttttgtat ttggttxgtt 
aaggtatttg cgtttgtgat atcgttttgg 
gagcggttta ttttttttat ttatataggg 
gagatattta ttcgttgtat agatttaatt 
agtcgttttt gaatattggt ttcgggcgtt 
ttttttttat tcgggttttt taagtttttg 
ttttatttta ggcgattcgt tttggtttcg 
gttttcggtg gatattcgtt tttagttcgt 
atttttgggt ttttagtttt cgtcgttttt 
tggtgagggg gttttttcgt cgagatttgg 
tcgcggtagt gtttgttcgg gggttgggtt 
ttttttttac gtagtgtaat tgattttgtt 
gcgggagtat cgggttttgt agttgtattt 
ggagtatatt gtggaaggtt atcgtttttt 

<210> 221 

<211> 4616 

<212> DNA 

<213> Artificial Sequence 

; <:220> , ; 

5<223> chemically treated genomic 



seq_prot.txt 

attttttaaa gtagaattta tttttttttt 1080 

tttatttatt agattgtgag ttgtttaggg 1140 

tttttttttt gagatagggt tttgttttgt 1200 

tagtttattg tagttttaat cgtttcggtt 1260 

tagttgggat tataggtata tgttattata 1320 

gatagggttt ggttatgttg ttcgggttgg 1380 

tattttagtt ttttaaaatg agggatcgtg 1440 

gtttagtgtt ggatttatgg tagtattaaa 1500 

tattttaggg agtaagaatt agattaataa 1560 

tagtttaatt tggttagagt atgagggaga 1620 

gggtaaggtt ttgaaggaag ttggaagtta 1680 

agtggggagt taatgaaggt ttttgagtag 1740 

ttaaatttat tagagttttt ttgatatata 1800 

tttttatata tttattattt aatttatttt 1860 

ttatttgttt tttggtggag ttagggatgt 1920 

ggatatgggg gttttagttt ttttgttttt 1980 

ttagggttag ttgaggttgg ttggtaagta 2040 

tgttaggtgg tgttttgggt tttaagttga 2100 

tatatatttg ttttttattt tatttttatt 2160 

gttagataaa tttgtgagat tttggtttta 2220 

ttgttttttt ttaggtttgt ttttttttta 2280 

tcgtatatat cgtgtgttgg gatattttat 2340 

tttttggttt tttttgttga tttcggtttt 2400 

gttattgttg tttttggtgt ttgtttattt 2460 

ttttttggga ggaggttttt ttggggaaga 2520 

tgaagaggat ttatttagag aggaggattt 2580 

ggatttattt ggagaggagg atttatttga 2640 

tttgaagtta gaggatttat ttattgttga 2700 

taatgtttat agggataaag aaggtaagtg 2760 

ggtttatgat ttttttttta tattttagtt 2820 

gattgtattt tttatagaag tttttttaga 2880 

tcggaggtag aaagggatag atgtggagag 2940 

gagttggatg agatgggaga gaagggggag 3000 

gatgagagaa aaaatgtgta gatagaggaa 3060 

tttgagggga agagaaaagg aaagtttggg 3120 

agagttggta gaagttattt tattttaggt 3180 

gggatatagt aggtagagaa acgtggtttt 3240 

gggttggaga ttatataagg tagagggatg 3300 

agatggtgta tttatttatt tgggatttag 3360 

ttttttttga gataaatttt tatttttgtt 3420 

ggtttattgt aatttttatt tttcgggttt 3480 

agtagttgcg attataggta tgcgttatta 3540 

gacggggttt cgttatgttg gttaggttgg 3600 

tattttggtt ttttaaagtg ttgggattat 3660 

agttatttat ttttatagat tttaagataa 3720 

tttatttagt tgcggtgttg agtttgggtg 3780 

taaggtattt gttattcgta atgtttttgt 3840 

tcgttaggaa gggattgggg ttttaagttt 3900 

gatgattaga gttattggcg ttatggaggt 3960 

tgggaattta gttttgtgga ttttttttat 4020 

ttattcgtcg tttatcgttt tattttttta 4080 

atttaggcgt tagatttttt tattatattt 4140 

ggtgttttta gtttgcgcgg gtcgttttta 4200 

cgttttttgt tcggttttgc gttttttgga 4260 

agaattgcgt ttgcgtaata atggttatag 4320 

ggatggggcg gggcgtaggg aagggaatcg 4380 

ggttttatcg ggcggggtcg gtttatttgt 4440 

ttttgggtta gagatggttt tgggtttcgg 4500 

gtattggggg gttgtaggtc gttcgggttc 4560 

tgtcgaggtg agcgcggagt tggtcg 4616 



dna (Homo sapiens) 
^eTte~12CT 



seq_prot.txt 

<400> 221 

cggttagttt cgcgtttatt tcggtaggga aacggtggtt ttttatagtg tgtttcgagt 60 

tcggacgatt tgtagttttt tagtgtagat gtagttgtag agttcggtat tttcgttcgg 120 

gatttagagt tatttttagt ttaggaggta gggttagttg tattgcgtag ggagaggtaa 180 

gtgagtcggt ttcgttcggt agggttagtt taattttcgg gtaggtattg tcgcgacggt 240 

tttttttttt gcgtttcgtt ttatttttaa gtttcggcgg ggagattttt ttattattgt 300 

ggttattgtt gcgtaggcgt agttttggga gcggcgggag ttggaagttt aggagtttta 360 

9ggggcgtag ggtcgggtag aaggcggcga gttgggggcg gatatttatc ggggattgga 420 

agcggttcgc gtaggttggg gatattcggg gttagggcgg gtcgtttggg gtgggagagt 480 

atagtgagga agtttgacgt ttaggttagg aatttaggga attcgggtag aaaaggtgag 540 

ggggtgggac ggtgggcggc gggtgggacg ttcgggatta gtgtttaggg acggttgtag 600 

gggagattta tagagttggg tttttagatt gggtttgtgt agcgggtggg tgttttattt 660 

ttatagcgtt aatgattttg gttatttttt gtataaatga aaaggatgaa tcgtttaagt 720 

ttagagtttt aatttttttt tggcgattaa aacgatgtta taaacgtaga tgttttatag 780 

gagtattacg ggtaataaat gttttaagcg ggttaggtgt aaagtatagg agatcgtatt 840 

taaatttaat atcgtagttg ggtgggttaa atagtaattt tattagtttg taattattgt 900 

tttagggttt gtaaaagtga gtggttgttt taggttaggc gttgtggttt acgtttataa 960 

ttttagtatt ttgggaggtt agggtggttg gattatttga gattaggagt tcgagattag 1020 

tttgattaat atggcgaaat ttcgttttta ttaaaaatat aaaaattagt cgggcgtggt 1080 

ggcgtatgtt tgtaatcgta gttatttggt tagaggttga ggtaggagaa ttatttgaat 1140 

tcgggaggtg gaggttgtag tgagtcgaga tcgcgttatt gtattttagt ttgggtaata 1200 

aaagtgaaag tttgttttaa aaaaaaaaaa aaaaaaagtg agtgggtatt ttagttttga 1260 

gttttaaatg agtgagtata ttattttttt tttttttttt tttttttttt tttatttatt 1320 

tttttgtttt gtatggtttt taattttttt ttttaaattt ttggtttggg agttaagaag 1380 

ttacgttttt ttatttgttg tgtttttttt ttttaggttt taattttttt attgtagttt 1440 

aagatgagat gatttttatt agtttttttt gtttgttttt ggtatttatt ttatttttta 1500 

agtttttttt tttttttttt tttaaatttt ttgatttttt ttttttattt attttttttt 1560 

tttgtttgta tatttttttt tttatttgta gtttttattt tttttttttt tttagttttt 1620 

tttttttttt ttattttatt tagtttattt tttttttttt gtattttttt tatttttttt 1680 

ttatatttgt ttttttttat tttcgagagt ggggatgggg atattggtat gggatttttg 1740 

gaagggtttt tgtggggagt atagtttttt tttttttttt gagtgaatag agtttaggtt 1800 

ggggtatggg aggggagtta tgaatttttt ggaatttgga tttggagatt gatgattatt 1860 

tatttttttt gtttttgtgg gtattatttt ggggtttttg aggattttta ggagttttaa 1920 

tagtaggtag attttttaat tttagggagt tttttttttt tgatttaggt ttaattttag 1980 

gtagattttt ttttttaggt agattttttt ttttaggtag attttttttt tcgggtggat 2040 

tttttttttt gggtgaattt tttttattgg gtagattttt ttcgtttagt gggttatttt 2100 

ttttagaaga gttttttttt aagggggaat ttttttgtat tcggggtaat ttttggggat 2160 

ggataggtat tagaagtagt agtgatagta gtagttgtat agtgaggttt ggagtagggg 2220 

tcgggattaa tagagggagt taggggttgg ggtatagggg agttatgcgg ttgattgtgg 2280 

ggtgttttag tatacggtgt gtacgggttg tacgtgtatt ggaaacgaga gttgggtggg 2340 

ggaggagtaa gtttggaggg gagtaggttg atttatagag cgtttttttg tagagatgga 2400 

gttaaagttt tataggtttg tttggttttg tgtttttttt tttatattaa agttaggatg 2460 

ggggtggagt gaggggtagg tgtgtgtata ggtagaaggt tatcggggtt atggatttag 2520 

tttggaattt aaggtattat ttggtattat gtaggttttt tttggtatta tttagttgtt 2580 

tgttagttag ttttagttaa ttttggtttg ttttttagtt ttaggtatag aaggggaaag 2640 

gtaggggagt tggggttttt atgttttttg gttgagaggg aaagtagttt atgtatatat 2700 

ttttgatttt attaggaagt aggtaaagag gtagggtagg ttgttagggg agttcgaggg 2760 

tgagttaagt aatgggtatg tgggagataa atttgagttt gaatgaaaag taagtgtatg 2820 

tgttagaggg gttttgatag gtttaattta tatttatttt ttaatatatt atttttttgt 2880 

ttaaaagttt ttattggttt tttattgttt gttacgatat gtatattgtg tatttttgat 2940 

ttttaatttt ttttaaaatt ttgtttaaga gattttttta ttttgttttt tattattttt 3000 

ttttatattt tagttaaatt aaattattat tttttttttt aaatttttta ttatttttgt 3060 

taatttagtt tttgtttttt gaaatgttat ttattattgt atttaataaa tatttattta 3120 

gtattattat aggtttagta ttgggttatg gattagggat atggaaatga ataagatacg 3180 

gttttttatt ttgggaggtt gaggtgggtg gattgtttga gtttaggagt tcgagattag 3240 

ttcgggtaat atggttaaat tttgttttta ttagaaatat aaaaaaatta gttaggtgta 3300 

atggtatgtg tttgtagttt tagttattta ggaggttgaa atgggatgat ggtttgagtc 3360 

gaggcggttg aggttgtagt gagttgagat tgtattattg tattttggtt tgggtgatag 3420 

agtaagattt tgttttaaaa aagaaaaata aaaataaaaa taaaaaacgt tatttgtttt 3480 

gagtagttta taatttagtg agtgagggta aattttgtta atattattat attgagggaa 3540 

gagagatgaa ttttgtttta gaagataata tagaagtagg taaatttgat tatgtttttg 3600 

aagaaatatt agttttattg gtaaaaataa tagggagaat gatattttgg gtaaagggaa 3660 

ggtataaaat attatgagta ggagagtgat agaatttagt ttttatatta ggttacgttg 3720 

agggtttgag ggttagtatg gagtatagat tgtaggggaa gattgggtgt aaggagatgg 3780 

ggtagaagag tgtgaggata ttaagaatgg gtttaggttt tttagttttt tttttttttt 3840 

tttttttttt tttttttttt tttttttttt ttttttttga gatagagtcg tattttgtta 3900 
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tttaggttgg agtgaagtgg cgcgatttcg gtttattgta atttttattt tttaggttta 3960 

agcgattttt ttgttttagt tttttgggta gttgggatta taggtatttg ttattacgtt 4020 

tagttaattt ttgtattttt ggtagagatt ggggttttgt tatgttggtt aggtttgttt 4080 

tgaatttttg attttaggtg atttgttcgt tttggttttt taaagtgttg ggattatagg 4140 

ttggtttttt tttttaatta attattttta aaatttattt tagagatggg attttattgt 4200 

gttgtttagg ttggagggcg gtggttattt ataggtgcga ttttgttatt gattagtatt 4260 

gagttttgat ttgttttaat ttttatttgg gttagtttaa ttatttttag gtaattaggt 4320 

tgatgtagaa tttagtgtag atatttgatt ggtatattat attatagttt agaaattttg 4380 

ggattaagtg atttttttgt tttagttttt taagtagttg ggattatagg tatacgttat 4440 

tatatttggt tagtttgttt ttttaataat ttattaagtt ttcgttagtt ttttaggtgt 4500 

tgttttaggt tttatagatg tagtattgaa tagagaatag ttttattgtt ttgtagttta 4560 

tattttagta aggttttatt ggtttaaata aattaaaaga gaataaggtt atttta 4616 

<210> 222 
<211> 4374 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 222 

tcggcgtagt agtgggggag gggatcggcg agaggggagg aaggaagggg ggaggaaggg 60 

ggagatttgt ttgaatattg taataaaaat aaagcgagaa gaaagaagcg gatttatttt 120 

tatgaggtat tttttttggt tgttatagtt tttgttaaga gttttgtttg gttggggttt 180 

tttgtgttgt tgttgttgtt tgaagattat aatggtttga aatgacggtg ttttaaagga 240 

gttgttggtg agagtttttt ttacggatgt tgttgggttg gtgtgtgaga gtaattttta 300 

gatttttggg aaggagatag agattgataa taaaatgggt tgttagcggt tggagagtga 360 

gagataaaga gtgtgggtga gggaagtggt tgtagttagt atatttatgt tgattggtga 420 

tggtttaagt gtgttaatgt gtgtgtgtcg gcgttcggtt tcgtttttat cgttttttat 480 

tggtttatta gcgaagtttg atttttgtta ttatttttta gtaaaatatt tttttttgcg 540 

tttgaattag agcgggaaat gaggtcgagt tacggttttt tttttaaatt tattaattat 600 

ttcgtaatat gtaaatgtat cgttcgtttt tatataaaat attgttttat gtaaagtagc 660 

gtcggggttt cgcgttagtc gattggatgt tttttttttt cgtttggtgt aattggttcg 720 

tttgtttgaa tatgaatata tttgttttcg gtttcgtcgg tgggtttcgg tttttttttt 780 

cggcgtaggg ttttcgagtt cgggttttgg cgttcgtttg tattttttcg tttcgcgtac 840 

gtcgagattc ggcgcggttc gttttgttgt cgcgaatttg agcggttaag tgaaggtttt 900 

tgggttgggg gtcgcgtttt tatttttttt aagttggatt cgggattttt agttttcgga 960 

gggtgtaagt ttttgtgttg tttttttttg ttaattttag gatagcgtgg tttgtcgttt 1020 

cgttttcgtt cgtttttatt ttattttagt cggggtttta agatatagag tatagcggcg 1080 

gtcgtatttt agtttatttg gttttttgga agaagaggaa aggggcggga gcgttattgg 1140 

gttttggtag ttttatttta gttgcggagg tcgcggcgaa gtttaggtcg agaggaggtt 1200 

gtcgggtcga gaattaaacg aggttagagg tttttttagt ttaagttttt agggtttgtt 1260 

taaatttttt atttcgtttt tttcgttttt tttttttttt tttttttttt cggaagtcgc 1320 

ggtgcgtagc ggagtagagg tatagttttg gttggagagg ttcgagtaaa tacgttattt 1380 

attttgtgta tagagggttt ttgtgaaaag ttttgaagag tttttattaa atatttatta 1440 

atttttttta cgtgttattt gttgattaag tgcggttgtt tttcgtagcg ttcgagagga 1500 

ggggaagtta ggggagataa gaggggaggg ggagtggaag tttgggtggg ggggcggatg 1560 

attaatgttg ggagggattt tttttttttt tttgtgtgtg tgtgtgtgtg tgtgtgtgtg 1620 

tgtgtgtgtg ttttattggc gatgtcggta tgttagcgtg tgtcgtgtgt gttatcggtt 1680 

atttgtgtgt ggtgcgtgtg ttttcgtatt acgatttttt tttttgggtt tgtttagttt 1740 

acggaggtag ttttttttta gttgttgtta gaaaatcggg gaggaaagta agtaaatggt 1800 

tttttttttt tgattatata ttagaagttt atttgttgaa tgtcgtacga ttaggatata 1860 

ttttagtata ttttaaaggc gtttttcgga gtttgtagtt aaggtgatag gagagcgagc 1920 

ggttgtttgg ttttgcgttg tttcgagttt tttcggcggt ttcgggttac ggagtcggga 1980 

aggagaagga aggtttgggt tgttgtatat attttaggat agggcgaggg ggaatagggt 2040 

tgaagagttg aggtagggag gttgaaatga agtagtcgaa gggtacgtta gatttttttt 2100 

gatttaggga aagtgttgaa agttagagtt tattaataaa atttgcgcga tttgaattcg 2160 

ttgttgattg gtcgcgtcgt cgtagttttt ggtagtcggg atagtgagga cgttgtaagt 2220 

cgttttgggt cgtcgggtta tatgtgggtt ttattttttc gtttttgttt gggatttagt 2280 

agttttgcga ttaaggagtt cgggtagggt tttcgggaag taaaggtttt ttcggtttta 2340 

gttagaggtg gggtcgtata attttttttt tttttcgggt tggggagggt tgtcggtttt 2400 

aatttttttt cgggagttcg tagcgtcggg aggaaatttg ggttagaggt gaaggaggtg 2460 

gcgtcgggtg tttaggttcg cggttttttg ggcgaggttt tatttcgtag gagcgcgatt 2520 

ttcggaggat gtcggtaagt ggggtcgcgg ttttgagttt tagattagcg aaagtcggtt 2580 

tttttttttt tttttttttt attttttaag cgagtattaa agaagtttgg atttaagatt 2640 

taiagttcgtt gtgtatgagc gagggatttg cgggggggcg _ 99 99999 ggg aagtgatttt 2700 
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gttcgaattg gaggcgagta ttatttttat aattttttaa ttagcgatcg agtggattag 2760 

tattttttta agttttgttt cggaaagaag tagtcgtaat tttgttgtta ttttttttta 2820 

gggtattggt tttttgaatt agtttttttt tttatttttt taatttttat ttcgttgcgg 2880 

agaaaagttt taataaaaaa ttagaaaagg taaggagcga ggagtgaatg ttattgacgt 2940 

tattgggtgg ggaagggggt ttcgggaaag atttttggaa attttttcgt ataaataaaa 3000 

aataaaaagg atttggttgt tttttatgtg ttaatatttg gggaggggag tttaatttta 3060 

gaatgaggtc gagtttttaa agtttaagga gagaggggtg agagtgagat tttttcgttt 3120 

tttttggatt ggttttaagg aaggtggggg tgttgtttta tttcgtgggg agttttattt 3180 

tttttgttaa gtatcggttt tgggggcgaa gcggtttagg ggtgtggggg tcgagattga 3240 

ggtgcgggcg cggttgttag aggtgtgtat aaaatcgaga taattcgtag gtcgttattt 3300 

taattgcggt tattaatacg ttcgaggttt atttgttgtt ttttaagacg cgggggggcg 3360 

gcggggattg cgttttaggt tttgtttttt tttttcgttt ttttataggt ttttgttttt 3420 

ttttattttt cgttttttgt ggttagaggg aggatttttt aggtcggcgg gtgcgttttc 3480 

gcgggggtcg tcggggtcgt tttattcggt agcggggttg tttgcgtgga gggagtttcg 3540 

ttgggggatc gtggttcgtt agagtttgtt tagagttttt agttagtatt gatgcgttaa 3600 

tatgaggtga ggtaagtatt ttattcgtat cgatttttgg tatttttttg tgtttatttt 3660 

attaggtttt cggggaggta aagaagtgtt tatttttggg gggaagtgtt tttagttttt 3720 

agttttagaa ttaagagttt tttattcgga gttttgatag ttttgatatt tttaaatttt 3780 

tttttttttc ggttaagtag aggcgtagcg tcggcgcgtt ggaaagggtg aagattggta 3840 

ttttttggag aaggaaagtt ttggtagata tttgcgttat tttgtagtgt tttaaacgtt 3900 

tttttttttg tagattttgg gaaggtggga ttttttgttc gcggagttgg taaacgggtt 3960 

tagtgagttt taaaggtaaa ggtttttatt tcggtcgttt agtttttggt gtttttttta 4020 

tttttaggtt tttgtttttt tttttttttt agtaatttgt cggttttcgt taattcggta 4080 

gtttcgtcga gtttagagtt tagaaaagtt aagaagattt tagagtcgtt tcgtggtatt 4140 

tttaggttgt ggttattgtt cggtttgggg ggaagtcgcg gtgttttcgt tttttttcgt 4200 

tttttgttaa atggttagtg ttaggggttt tttgggttta gtttttagtt ttttttcgtt 4260 

cgttttggga ggttagtttg tttttttttt tttttttttt tttttttttt tttttttttt 4320 

tttttttttg tttttttttt gtttattttt aaaatatttt taaaataata ttgt 4374 

<210> 223 
<211> 4374 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 223 

ataatgttat tttaaaggta ttttgggggt gggtagggag aagatagaga gagaaagaga 60 

gagagagaga gagagagaga gagagagaga gagataggtt ggttttttag agcgaacgga 120 

agaggattga ggattgaatt tagggagttt ttagtattgg ttatttggta ggaggcgagg 180 

aagggcgagg gtatcgcggt ttttttttaa gtcgagtagt ggttatagtt tgaaggtgtt 240 

acggggcggt tttgggattt ttttggtttt tttaggtttt gggttcggcg aggttgtcgg 300 

gttagcggga atcggtaggt tgttggagaa aggagaggga taaaggtttg gaagtggggg 360 

aggtattagg gattgggcgg tcgggatggg agtttttgtt tttagggttt attggattcg 420 

tttgttagtt tcgcggataa gggattttat ttttttaggg tttgtaaaaa gagagacgtt 480 

tggggtatta taaagtgacg taggtgtttg ttaagatttt ttttttttaa aaaatgttag 540 

tttttatttt ttttagcgcg tcggcgttgc gtttttgttt gatcggggaa agagagagtt 600 

tggaggtgtt agaattgtta gaatttcgga tgggaggttt ttggttttga agttgggagt 660 

tggaggtatt tttttttaga agtgggtatt tttttgtttt ttcgaaggtt tgatggggtg 720 

ggtatagggg aatgttaggg atcggtgcgg gtgagatgtt tgttttattt tatgttgacg 780 

tattaatatt ggttagaagt tttgggtagg ttttggcgag ttacggtttt ttagcgggat 840 

tttttttacg tagatagttt cgttatcggg tgggacgatt tcggcggttt tcgcgggaac 900 

gtattcgtcg gtttgagagg tttttttttt ggttataggg ggcggagggt aggggggaat 960 

aggggtttgt ggggaggcgg ggaaaggaga tagaatttaa agcgtaattt tcgtcgtttt 1020 

ttcgcgtttt ggagggtagt aagtgggttt cgggcgtgtt ggtagtcgta attaaaatgg 1080 

cgatttgcga gttgtttcgg ttttgtatat atttttggta atcgcgttcg tattttagtt 1140 

tcggttttta tatttttagg tcgtttcgtt tttagagtcg gtatttggta gagggagtga 1200 

ggttttttac gaggtagaat agtattttta ttttttttga gattagttta gagagaacga 1260 

ggaaatttta tttttatttt ttttttttta aattttgaaa attcggtttt attttggggt 1320 

tgaatttttt tttttaaata ttgatatatg ggaagtagtt aagttttttt tgttttttgt 1380 

ttgtgcggag gagtttttag gagttttttt cggagttttt ttttttattt aatgacgtta 1440 

gtggtattta tttttcgttt tttgtttttt ttgatttttt gttaaaattt tttttcgtag 1500 

cgagatgggg gttggagggg tgggagaagg agttgattta aagagttagt gttttggggg 1560 

gaaatagtaa taagattacg gttgtttttt ttcgaggtaa agtttaagaa ggtgttaatt 1620 

tattcggtcg ttaattgaga ggttgtaaaa atgatattcg tttttagttc ggatagaatt 1680 

attttttttt ttttcgtttt ttcgtaagtt tttcgtttat gtatagcgga tttgagtttt 1740 
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gagtttaagt ttttttaatg ttcgtttgga aagtaaaagg agagagaaag agaaagtcga 1800 

ttttcgttgg tttgggattt agggtcgcgg ttttatttgt cggtattttt cgaaagtcgc 1860 

gtttttgcga aatgaaattt cgtttaggag gtcgcggatt tggatattcg gcgttatttt 1920 

ttttattttt gatttaggtt ttttttcggc gttgcgagtt ttcggggaag ggttagagtc 1980 

ggtagttttt tttagttcgg ggaggggaga gggttatgcg attttatttt tggttagggt 2040 

cggggaggtt tttgtttttc gggagttttg ttcgggtttt ttggtcgtag ggttgttggg 2100 

ttttaggtag gaacgagagg gtgaggttta tatgtggttc ggcggtrtag ggcggtttgt 2160 

agcgttttta ttgtttcggt tgttaggggt tgcggcgacg cggttagtta gtagcgagtt 2220 

taggtcgcgt agattttatt gatgagtttt gatttttagt atttttttta agttaagaag 2280 

agtttagcgt atttttcggt tgttttattt tagttttttt gttttagttt tttagtttta 2340 

ttttttttcg ttttgttttg gggtgtgtat agtagtttag gttttttttt tttttttcgg 2400 

tttcgtggtt cgaagtcgtc gagagagttc gggatagcgt aggattaggt agtcgttcgt 2460 

ttttttgtta ttttaattgt aggtttcgag gggcgttttt ggagtgtatt gaggtgtgtt 2520 

ttaatcgtgc ggtatttaat aaatggattt ttggtgtgtg gttagaagag aaaagttatt 2580 

tattratttt ttttttcggt tttttggtaa tagttgaagg ggagttgttt tcgtggattg 2640 

agtagattta ggagagggag tcgtggtgcg gagatatacg tattatatat agatgatcgg 2700 

tggtatatac gatatacgtt gatatatcga tatcgttagt gggatatata tatatatata 2760 

tatatatata tatatatata tagagagaga gagagaattt tttttagtat tggttattcg 2820 

tttttttatt taggttttta tttttttttt ttttttattt tttttggttt tttttttttt 2880 

cgggcgttgc gaaaagtagt cgtatttagt taataaatgg tacgtgggag aagttggtga 2940 

gtgtttggtg aggatttttt agggtttttt ataagaattt tttgtatata aagtaagtgg 3000 

cgtgtttatt cgggtttttt tagttagagt tgtgtttttg tttcgttgcg tatcgcggtt 3060 

ttcgaaagga gaaaggagag aagaaagggc ggggagagcg gggtggagga tttggatagg 3120 

ttttggaggt ttgggttggg gaggtttttg gtttcgttta gttttcggtt cggtaatttt 3180 

ttttcggttt aggtttcgtc gcggttttcg tagttggaat ggagttgtta ggatttagtg 3240 

acgttttcgt tttttttttt tttttttaag gggttaggtg ggttggggtg cggtcgtcgt 3300 

tgtgttttgt gttttggggt ttcggttggg atggggtggg ggcgggcggg ggcggggcgg 3360 

taggttacgt tgttttggag ttggtaagaa aggatagtat agaaatttgt atttttcgag 3420 

gattgggagt ttcgagttta gtttaggggg agtgggggcg cgatttttaa tttagaaatt 3480 

tttatttgat cgtttaagtt cgcggtagta gggcgggtcg cgtcgaattt cggcgtgcgc 3540 

ggagcgggga gatgtaggcg agcgttagag ttcgggttcg ggggttttgc gtcggggaga 3600 

ggagtcggga tttatcggcg gagtcgaaaa taagtgtatt tatatttaaa taaacggatt 3660 

aattgtatta ggcggggaga gggagtattt aatcggttgg cgcgaggttt cggcgttgtt 3720 

ttgtataaag taatattttg tgtgagagcg agcggtgtat ttgtatgttg cggagtgatt 3780 

agtgggtttg aaaagggaat cgtggttcgg ttttattttt cgttttggtt taggcgtagg 3840 

aggaagtgtt ttgttggagg atgatgatag aggttaggtt tcgttaatgg gttagtgagg 3900 

agcggtggag gcgaggtcgg gcgtcggtat atatatatta atatatttga gttattatta 3960 

attagtatag gtgtgttggt tgtagttatt ttttttattt atatttttta ttttttattt 4020 

tttagtcgtt gatagtttat tttattgtta atttttgttt tttttttagg aatttgagaa 4080 

ttgtttttat atattaattt agtaatattc gtggagaaaa tttttattag taattttttt 4140 

aaaatatcgt tattttaaat tattgtggtt tttaagtaat aatagtagta taaaaaattt 4200 

taattaaata aaatttttga tagaagttgt gataattaga aaggatgttt tataaaggtg 4260 

agttcgtttt ttttttttcg ttttattttt attgtaatat ttagataggt tttttttttt 4320 

tttttttttt tttttttttt ttttcgtcgg tttttttttt tattgttacg tcgg 4374 

<210> 224 
<211> 2534 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 224 

aggtaggaaa gtgggatagt cggggagttg gattttattt tttgtgagtt tcgttggtat 60 

ttgatggtat gtggtttgga gagggtaggt gatttggcgt ggagggttag agggtaaatt 120 

tttaaataag tggtaatagg ttattaattt gaaagggaaa attgtgtagt gatgggaaat 180 

gtgtttaata aatttattgg gtgattaatt ataaaggttg ggttggagtt ttagaggttg 240 

tttgttaaat attttattaa gcggtatttt gaaagttgtt atttgcgtat tttgggagtt 300 

tagaggggat tttgaggggg aatgaggttt ggaggatgga attattttta ggtagattga 360 

gaaggagttt ggattttatt tttaaatata gtttggagtt tataggttag aggttttaat 420 

gggagaaaag ttaaaggaag agggtgtaga aaggagtttt agggaattgg tggttatgtg 480 

attttgagta aattttattt ttttttgaga tttagtgttt ttatttttat ggttttgtgt 540 

gtgttataga gatatggtgg ggattaaatt cgatcgtgat atgaaagtgt ttgggaaatt 600 

ttatggtttt atttaaata^ gagttatttt tatttgaatt aaggggggaa gttatttggt 660 

aggattagga attttatttt tttgaatttt tatgggtt^t gtcgaggttg aagtagttag 720 

gggttc^aagt tagtttttag tttttggaag ggtattgtga aagtggattt gatttgagaa 780 
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gtcgtttttt gatgtgggta gttatgtgat gttagtttcg aataagaggg ggtagtttgg 840 

agtttggaaa ggtgttagtg taggtggggt ttacgtttag attttttttg ttgattgttt 900 

tgatgattta tttttatatt ttagtttttt tatttttatt gtagagtcgg aaagggtgtg 960 

gggaagagag gagagggagg taggttttgg gttttggttt cgttttttgt ttttttttat 1020 

ttttttttgg gtttggttat ttagttaaaa ggtaggttaa gagtaggaga gatatagagt 1080 

tcggtattgg ttttaggtag tagttagttc gtcgttcgtt tgtgtgtttt tagagttatg 1140 

gagagagtta gtttgattta gaaggttaag ttggtagagt aggtcgaacg ttatgaggat 1200 

atggtagttt ttatgaaagg cgtcgtggag aagggcgagg agtttttttg cgaagagcga 1260 

aatttgtttt tagtagttta taagaacgtg gtgggcggtt agagggttgt ttggagggtg 1320 

ttgtttagta ttgagtagaa aagtaacgag gagggttcgg aggagaaggg gttcgaggtg 1380 

cgtgagtatc gggagaaggt ggagattgag ttttagggcg tgtgcgatat cgtgttgggt 1440 

ttgttggata gttattttat taaggaggtc ggggacgtcg agagtcgggt tttttatttg 1500 

aagatgaagg gtgattatta tcgttatttg gtcgaggtgg ttatcggtga cgataagaag 1560 

cgtattattg atttagttcg gttagtttat taggaggtta tggatattag taagaaggag 1620 

atgtcgttta ttaattttat tcgtttgggt ttggttttga attttttcgt tttttattac 1680 

gagatcgtta atagtttcga ggaggttatt tttttggtta agattatttt cgacgaggtt 1740 

atggttgatt tgtatatttt tagcgaggat ttttataaag atagtatttt tattatgtag 1800 

ttgttgcgag ataatttgat attgtggacg gtcgataacg tcggggaaga ggggggcgag 1860 

gttttttagg agttttagag ttgagtgttg ttcgttatcg tttcgttttg ttttttttag 1920 

ttttttattt tgtcgagagg attagtatgg ggtgggaggt tttatttttt ttttttaggc 1980 

gttgtttttg ttttaaaggg tttcgtggag agggattggt agagttgagg ttatttgggg 2040 

ttggggattt tatttttttt gtagttgttg agcgtattta attattggtt atgtttttat 2100 

ttttgttttt cgtattcgtt ttttttcgat tttaggatta ggttattttt tttttttttt 2160 

tgtttttttt ttgtttttgt tgtttttgat cgtaggaatt gaggagtgtt tcgttttgtg 2220 

gttgagaatt ggatagtggt aggggttgga gatgggtgtg tgtgtgtgtg tgtgtgtgtg 2280 

tgtgtgcgcg cgcgttagtg taagatcgag attgagggaa agtatgtttg ttgggtgtga 2340 

ttatgttttt ttttaataaa gtttttttgt gatatttttt ttgttttttt tttagttttt 2400 

ggcgatgggt tgggagtggg attggaattt gatttagaga ttttgatttt ggatttttga 2460 

gttagggttt tgaatttttt aggtggttta gtggttcgta cgtaagattt tgagtttagg 2520 

tgaggtcggg gttt * yy 2 534 

<210> 225 
<211> 2534 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna CHomo sapiens) 
<400> 225 ' 

ggatttcggt tttatttgga tttaaagttt tgcgtgcggg ttattgagtt atttagggag 60 

tttagggttt taatttagag gtttaaagtt agggttttta agttagattt tagttttatt 120 

tttagtttat cgttaagaat tggaagagag ataggaggag tgttataggg gaattttatt 180 

gagaggaaat atggttatat ttagtagata tgtttttttt tagtttcggt tttgtattgg 240 

cgcgcgcgta tatatatata tatatatata tatatatatt tatttttagt ttttgttatt 300 

gtttagtttt tagttataag gcgggatatt ttttaatttt tacgattaga ggtagtaggg 360 

gtaggaggga ggtaagagga ggggagaagt agtttggttt tggggtcggg aggaagcggg 420 

tgcggagagt aggggtgggg gtatgattag tggttaggtg cgtttaatag ttgtaagaag 480 

agtgggattt ttagttttag gtggttttag ttttgttagt ttttttttac ggagtttttt 540 

ggagtaagaa tagcgtttag gggagaaggg tggggttttt tattttatat tagttttttc 600 

ggtagggtgg gggattggag ggggtagggc ggggcggtgg cgggtaatat ttagttttgg 660 

ggtttttggg gagtttcgtt tttttttttt tcggcgttgt cggtcgttta tagtgttagg 720 

ttgtttcgta gtagttgtat gatgagggtg ttgtttttgt aggagttttc gttgagggtg 780 

tgtagattag ttatggtttc gtcgaaagtg gttttggtta gagagatggt tttttcgggg 840 

ttgttggcga tttcgtagtg gaagacggaa aagtttaggg ttaggtttag gcggatgggq 900 

ttggtgggcg gtattttttt tttgttgatg tttatggttt tttggtaggt tgatcgggtt 960 

gagttaatga l;gcgtttttt gtcgttatcg gtggttattt cggttaggta gcggtagtag 1020 

ttatttttta t^ttttaggta gaagattcgg ttttcggcgt tttcggtttt tttgatgagg 1080 

tggttgttta gtaggtttag tacggtgtcg tatacgtttt ggagtttagt ttttattttt 1140 

tttcggtatt tacgtatttc gggttttt^t tttttcgagt ttttttcgtt gtttttttgt 1200 

ttaatattgg atagtatttt ttaggtagtt ttttggtcgt ttattacgtt tttataggtt 1260 

attgagagta ggtttcgttt ttcgtaggag agtttttcgt ttttttttac ggcgtttttt 1320 

atgaaggttg ttatgttttt atagcgttcg gtttgttttg ttagtttggt tttttggatt 138b 

agattggttt tttttatggt tttggggata tataggcggg cggcgggtta attgttgttt 1440 

gggattaat;g tcggattttg tgtttttttt gtttttggtt tgttttttgg ttgggtggtt 1500 

2SHHHS ag ^ a 999tgggg aggagtaggg ggcgggatta gggtttaaga tttgtttttt 1560 

tttttttttt tttttatatt tttttcggtt ttgtagtaaa ggtaaaaagg, ttgggatgtg 162Q 
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ggggtgaatt attagaatag ttagtaggag aaatttgggc gtgggtttta tttgtattgg 1680 

tattttttta ggttttaggt tgtttttttt tgttcggggt tggtattata tggttgttta 1740 

tattaggaaa cggtttttta aattagattt atttttatag tgttttttta ggggttaagg 1800 

attggtttta gtttttggtt gttttagttt cgatagagtt tataaaggtt taggaggarg 1860 

gggtttttaa ttttgttagg taattttttt ttttggttta ggtgaggata atttatgttt 1920 

aggtagggtt atggagtttt ttaagtattt ttatattacg atcgaattta atttttatta 1980 

tgtttttgtg atatatatag gattatagag atgggaatat taagttttag agaggggtga 2040 

gatttgttta aagttatata gttattaatt ttttgaaatt tttttttgta tttttttttt 2100 

ttagtttttt ttttattgag gtttttgatt tatgagtttt agattgtgtt tggaagtgag 2160 

atttaggttt ttttttagtt tatttgaaga tggttttatt ttttaggttt tatttttttt 2220 

tagggttttt tttgagtttt tagaatgcgt aggtggtagt ttttagagtg tcgtttaatg 2280 

aagtgtttaa taagtagttt ttgaagtttt agtttagttt ttgtaattag ttatttagta 2340 

ggtttgttgg atatattttt tattattata taattttttt ttttaagttg gtggtttgtt 2400 

gttatttgtt tgaggattta ttttttggtt ttttacgtta ggttatttgt tttttttaag 2460 

ttatatgtta ttaggtatta gcggggttta tagagggtgg ggtttagttt ttcggttgtt 2520 

ttattttttt gttt 2534 

<210> 226 
<211> 10001 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 226 

ttaagggatt tgtttgtttt ggttttttaa cgtgttggaa ttataggtgt gagttatcgt 60 

gttcggttta ataaggaatt ttttaaataa agttttgcgg ggtcgattag attaatttat 120 

attttttgag tagattttgg tataatttat tatttgtagt attgaatgtt aaggtttgtt 180 

ttttattttg aaattatatt tttttttttt tattggaatt gaaattttat ttttttatga 240 

aatgatagtg atggtggatg gtatttgttt tttttaatat ttttatttga taaaataaaa 300 

gttagtaatt tatttcgatt tttaattttt tgttggtgtt tgaaatttta aaattgagat 360 

ttaaagtata gttttggttt tggagagatt tttaggagag ttagagttta gaagggagta 420 

ggatttagga ggtttttatt ttttagtatt ttagttgagt tagtcgggtt atggaatatt 480 

attgagtaat taaaatatta ttaatagata aaaaaagttt attgaatata aaatttaaag 540 

gtattaatag ttttgggttt aagagattta tggtaggaag ttaagagttt tgttttaggg 600 

tcggtttggg tagttttgga agaagttatt gtatatgata gtgatgagtg ttaggaaaat 660 

agtatatttt tggaagttta tttgttggtt attgttttta tttaggttgt ttattagttt 720 

ttttagtttt tttttattta tttttttttg aaagtgatag gaaatatatt tattattaag 780 

ttagttttaa ttttagtttt attaaatatt tagattttta tatatttagg ttggttttta 840 

gttttttttt tttattgggt agttgggtat aaggtgggta ggaggttttg ggagttatta 900 

agagggttta gtgagtaagg agagagatag atatttattg gtgagtattt ttagtgtgtt 960 

ttttttttgg atatagggag gatattggat gtttttttga gtatgtatta gttcgaattt 1020 

tataggagtt tagcgaggta ggtattatta ttttattttg ttgatgagga aatcgaaggt 1080 

tgttgttttt taatattagt taggtagatt tagaaaagaa tattttttat tttttaagat 1140 

aaggttttgt tttgttgttt aggttggagt gtagtggtat aattataatt tattgtagtt 1200 

ttaatttttt gggttttagt aattttttta ttttagtttt ttaagtagtt gggattatag 1260 

gtttatgtta ttatatttag ttaattgttt atttttagta gagataaggt ttcgttatgt 1320 

tgtttaggtt ggttttggta attagagtga ttttattttg ttatttaggt tgtattatag 1380 

tttattgtaa ttttgaattt ttgggtttaa gtaatttatt taagtagttg ggattataag 1440 

cgtatgttat tatgtttggt taatttttat ttttattttt gtagagatgg agttttgtta 1500 

tgttgtttag gttggtcgaa tttttgggtt taagagattt ttttgtttta gttttttata 1560 

ttgtggggat tataggtata ggttattgta tttagttgaa aaatattttt ttaatattag 1620 

tcgaaataat tagaaaaatt taattagtta taaaaataaa attaaaaaaa ggaaagaagg 1680 

atttgtttta tgtgtttaaa gtttgatatt taattttggg aattattaat gagttagtat 1740 

agggagggtg agggataaat tgaaggtcga tgtatttttt tataaaatgt tttagattat 1800 

atattttagt tgggtttttt tatttttata tattttttta gtttggtata taaaaatgga 1860 

aagaggttaa gtatagtgtt ttatatttgt aattttaata ttttgggagg tcgaggtggg 1920 

aggattgttt gaatttagga gtttaagttt agtttggata atatagtaag attttatttt 1980 

tataaaaata aaaattggtt aggtatggtg ataatatttt tgtagtttta attatttggg 2040 

aggattgttt gagtttagga gttcgagatt atagtgagtt atgatttagt ttgggtgata 2100 

gagtgagatt ttgttttttt attataaaaa aaaaaaaaaa aaaaaggagt tcgggtacgg 2160 

tggtttacgt ttgtaagttt agtattttta gaggtcgagg agggtagatt atttgaggtt 2220 

aggagttcga gattagtttg attaatatga tgaaatttcg tttttattaa aaatataaaa 2280 

attatttggg aagtcgaggt aggtagatta cgaggttagg agttcgagat tagtttggtt 2340 

aatatggtga aattttgttt ttattaaaaa tataaaaatt agttgggcgt agtgggatgc 2400 

gtttgtagtt ttagttattt aggaggttga gataggagaa ttgtttgatt tttggaggta 2460 

SeTte 126 



seq_prot. txt 

gaggttgtag tgagtcgaga tcgtgttatt gtattttagt ttgggtaata agagggaaat 2520 

tttattttaa aaaaaaaaaa aagaaaagaa aagagagaga gagaacgtgt tagtgttgtg 2580 

tgattttggg ataattattt tttttttttg ggttttggtt ttttaagttt ttaatgatag 2640 

gattaaatag atggattcgg aaggttttta tagtatatgg ttttttggaa tgttgtagga 2700 

aatagttatt aattaagtat ttttattcgg gttttatttt tattttagtg gagttatgaa 2760 

gttaaagtgg ggagggggtt gggttttatt ttggtatttg tatcgatagg tattcggaat 2820 

tgggggagag atttaatttg tatttttata tttatattag gatttttttt ataggtttgt 2880 

gttatttatt tttataaagt tgggtagttt tttgtgtaga agttttttta tttttttttt 2940 

atttagtttg aatttgtcgt ttttttggta ggagtatttg tggaaggtag tgattagtat 3000 

agttagcgtt tgttttagag aattgtatat tatggatttg tggttgtaga ggtgttaggt 3060 

gatgggtata ttgttgaggt ttttggggtt ttagtttagt ttgtaggatt tatttttgtt 3120 

ttatgttttt aagttatttt ataaaatttt tttttgaatg ggaaggtagg ggttgtattt 3180 

tttttttaga ttgtggtttt ttgaggtcgg aggcgatatt tttattttat tttgagggta 3240 

gacgttaaag ttttttttgt gtttgggagt tattaagttt tttttgatgg tatttgggat 3300 

atttttttga gggtagggtt ttttatttta gtgtttgtta gaagggaaat gatagggtga 3360 

ttagagtttt attttgatgg tatttgcgta ggttttgggg agtagggtta tatgtatcgg 3420 

ttggtagatg ggggtttttg tgtagtttgg taggagttat ggttatagtt taggagtagg 3480 

aagttttttg aaggagtcgt tttttttttt agattcggga tttttgaagt tagaggtatt 3540 

tttttttgtt atttaattgg aagttttttt gggtaggagt gttttttttt ttagattagg 3600 

agtttttaga gggtggggaa tgtgtttttt tttattttga aggtttttga gtttttaggt 3660 

agataggggt atggataggg tgggggtaat ttaatatttt ttttattagg aaggattttg 3720 

ttggaattag aattaaagat atgaaatgtt tattgttttg ggattaggat gggatagatt 3780 

ttagcggtaa ggtttttgag aatatttaga ttagtttgga aaggttaggt aggttttagg 3840 

gtgaggttag aggagttatg ttagaggtag ggtatgtttt gggtttatta tgttagtttt 3900 

gtttttatcg aggtaggggg tagagttttg ttgtttgggg ttatgattta ggtagagttt 3960 

tgagattttt gttttttgta ttttagtttt gttagaattt aggattttgt aaaaaaaaaa 4020 

aaaaaaaaaa aaaaaaattt taggattttt tttgtagtaa cgtttatatg tatatttttt 4080 

gtttttaggt ttagattttt aaagaagggt ttttagattg agatatttag gggagggttt 4140 

tagagtaatt tttgtattga gattttatag aggaatttgg agaaaaattt tttagtcggt 4200 

cgggcgcggt ggtttatttt tgtaatttta gtattttggg aggtcgagat gggtggatta 4260 

cgagattagg agatcgagat tattttggtt aatatggtga aattttattt ttattaaaaa 4320 

tataaaaaaa ttagtcgggt atggtggtgg gcgtttgtag ttttagttat tcgggaggtt 4380 

gaggtaggag aatggcgtga atttgggagg gggagtttgt agtgagttga gattatatta 4440 

ttgtatttta gtttgggcga tagagcgaga ttttatttta aaaaaaaaaa aaaaaaaaaa 4500 

aaatttttta gttaatattt tagtatttta tagagagttt ttttaagggg gaaggtttag 4560 

agaggtttag gttgatttta ttagggaagt tggaatgaga gaattgtgga gataaagata 4620 

gagatttacg gggatttttt ttattttgtt tttagtagga tagaggtatt tattagatta 4680 

ggtggtagag atagatttag gaaggagagt aaggtagtta ggtttttagg gtgaggattt 4740 

atatgtgggt tttattggtt tttaatttgg tattttaagg aaattaaatt tgttttaatt 4800 

tgattttatt aaattttggt tttttgttat ttttatttat ttttttttag tttttttttt 4860 

tttagagagt gttagtttta ttattagttg ggggagggtt ttaggttagg ttaagggtag 4920 

tttttgagtt tatttaggtt atagtgggaa gtgggaggtg tcgtggggat tgggtatttt 4980 

ggatcggggt gaggggaggt aatttttgtt ttggggtttg gagattggtg ttaggtttta 5040 

tttttatgat atgtttgtga gttatagttt atttttgtgt gagaggttgt gagtatacgt 5100 

tgagatttta tggtttagtg ttcgtgtata tatatgcgtt tgtgtgtgtg agagagaagt 5160 

aatttggggt tatttttagg aagttggggg ttagaggagg tagttggggg atattaggag 5220 

gagaaggata ggattgattt tagggtatgg atgaagtgga gtgattttta atggttagta 5280 

ttagaatatt ttttaatgag gtaatgaggt aattaagtgt gtttagaaat atttgggaag 5340 

gtttttttgt gtaggaaggt gtttggaagg atggggaagt agtgagaaat ggggttgttt 5400 

tatatatagt cgtagttggg ttgggttata gtttgtgggg agtataagga agttttaagt 5460 

agttaaaggt tgaataggaa aattttggtt tttttttttt tagtttgagg gttgggtggg 5520 

tgggtagggg tatattttta tttaaggttg aaaatattag ttggatagtt ttttaagtta 5580 

gaatttaggt ttgaagtata gtttttagtt atagtttttt tttattgttt tttatatata 5640 

tgtttttttt agattcggag aaaggtaaat attgtggtta tttagagttt agtttaaggg 5700 

agggttgttt ttttttaggg tttttttttt rttggttttt tgtttatttc ggttcgtttg 5760 

ggtattaggg tatcgttttt tttgtggttt tagttttttt ttttgtttta gttgtcgagt 5820 

agtttttttt ttttttttag tttttttatt ttagttttaa gtgatttatt cgaggtttta 5880 

ggtttagatg tttaggtagt gatttagcgg tatatttttt ttttttaggg ttagggtagt 5940 

agtagtttta gagtagtagg tggaagtttt agtttagttt ggtagggaaa gtattggtat 6000 

agagatttta aataggtttt gttaggagga gttttggata atgatatggt aggaggatta 6060 

gattgagaat aggttggttt ttttttagcg gttatcgtgt gatttggggg ttagtttgag 6120 

tagtgtatat gaagttaggt tttggggttt ttagtgtttt tttttatatt ttggtatagt 6180 

tatcgagtgt tttttgtttt ttttagtatt tgaatttttt atgttatttg tttttttatc 6240 

gaggtttatt ttaagtagaa ttcggaataa aaatttttta tattgagggt gagggataag 6300 

ggagcgtaat gtatcgattt ttatagttat tttttttttt taattaaggg tttttatagg 6360 

gttttttttg tttagaaatt gtagtgtatt tttatggttg ttttttaaag ttaggeggta 6420 

tgtttttatt tataagtggg agttgaataa tgagaatata tggatatagg gaagggaata 6480 

utatatatta gggtttgtgg gtagttgcgg ggagaaggat gggggtaagg ggagggagag 6540 

Seite 127 



seq_prot.txt 

tattaggatt aatatttgat gtatgcgggg tttaaaattt agatgatggg ttgataggtg 6600 

tagtaaatta ttatggtata tgtatattta tgtaataaat tatattttgt atatgtatgt 6660 

tagaatttaa agtaaaaata aaataaaatt aaattaaaat taaaaatttt taaaaagttt 6720 

ggtagtgagg cgttttagat tcggtttgtt gttgcgtgtt ttttaaatat atcggtatat 6780 

gtaattagtg tggttaagat tttcggtttg cgttgatttt tttgtttaga acgtgtttat 6840 

atatgtttat gttttttatt ttgttgaaat ttaatttttt ttttaaaggt tattgtaggg 6900 

agttttttta gggtttaatc ggatttagtg tttatttgtt ttgttttgta taaagcgttt 6960 

tttttgtttt tttcgtgtgt atttagagaa ttttgatttt gattgaatta tttgtgtatt 7020 

tgtttttttt tagttttatt tttatatggt gagtttatcg gttttattta tttttgtatg 7080 

aggtaagggt ttcgtttata ttaggtgtta attagtgttt acggttatta tatggtaata 7140 

ggggttattg gaggagtgtt taggagtaag ggagtttttt ttatttcgtg aaattatatt 7200 

attgattatg ttacgaaata attatttttt ttttggattt taagagtgga atcgagataa 7260 

cgggtcggaa aagagtttta gagagttata aatttttgag gagtgatagt aggaaattgc 7320 

ggtttttagg atttagtgag ggagttgagt aaatttttta gcgggtagag acggagatag 7380 

gtaggtagac gaagaagata aagagttagt tagatggaaa agttagggag aaaaatagag 7440 

atagtagtag agtcgcgggt tattagggtt agtttttgaa ggtttgtttt taattaggtg 7500 

tttttgtttt aggaattgag gttgcgattg cgtttattaa atggttatat atttgagatt 7560 

ggtgataatc gtatttattt gttattaatt tgttgcgtaa ttttgggtaa tttttttttt 7620 

tacggtaggg taggagagtt gtacggaatg gtttttagag ttcgatttta attttatttt 7680 

tatatttttt agttgtttat aaaggtagag tagttaagat ataagtaggt tttgcggata 7740 

tttaaagtag tgattttgtt tcggtttagt tttaggaagt tgggttttta tatttcgttt 7800 

attttatttt cgggttttag gtgttagaaa ttttttatta gcggtttcgc gggattttgt 7860 

ttagttgttt cgtgtttttt ggtggttaag gcgtgttatt gtagtgtcgg ttttttgtta 7920 

tttttttttt tttgttcgtt agattttttg gcgttttgtt tacgatttaa ataggagata 7980 

gtgttgattt atttttaagc ggttttttta tatttatatt tgttttatat agatgaggtt 8040 

tttcggatag tttttgttta gaagtttagg tggatgtttg agacggtagt tagatatttg 8100 

gaagttattt taaaagttta gggatatggg tattcgggag ataggaggcg gaaagattat 8160 

ttgagtttag gagtttgagg ttgtagtgag ttatgatcgt gttattgtat tttagtttgg 8220 

acgatagagt aagattttgt tttaattaat taattaatgt aagtttagag ataagggtag 8280 

gagaaaatta ggggatagaa gtgagggatt ggttattgtt aagttttgtt tatttatttt 8340 

ttttttttag gtcgtttagg ggaaagagaa atggttttat tgtgggggtt ataggtttag 8400 

tttaagtttt gatattgttt ttggtttatt gtgtgattgt cgattttggt tgagtggtgt 8460 

tacgaatttg agtttaaacg tttttagtag aaggattttt tttattatgt ttcgttttat 8520 

aaaataagtt ttgaggtatt ttgttttagt aaaatgtatt aaataattta ttttttttgt 8580 

tgtattaatt ggatttataa gtgtttttcg gtagtatgat tagtatgcga ggatttacgt 8640 

tattatattg ggtttaggtt attttggatt tgttatttat aatgggcgtg tgtatatttt 8700 

ttgttggggg gtgtagattg aaattttgaa gtattcgttg tgcgttaggt gtttttcgta 8760 

tattatttta tttaatttat ataataattt taggagttgg aataagatga ttttatttta 8820 

tagaggagaa tattgagatt taaagtttgt taaatttata taattagtaa ggggtgggga 8880 

ggtatgacga atataggtgg ttataacgtt aaattttttt tagaagtttt aattattttt 8940 

cgataatttt gagagtttta atattttatt tttatttaga ttagttttag tttttttgtt 9000 

tatttttttt ttattttata tttagtttat ttgttttatt ttttaatttt agtaggcgtt 9060 

agaagtttat ttttttttta tttttgattt tggtattttt tttttttgtt tttaaatgtt 9120 

ttgtggattg gatttttttt aatttttgtt tatttattta tttgagatat agatttattg 9180 

tcgttaaggt tggagtatag tggtattatt atagtttatt gtagttttat gtgatttttt 9240 

tatattaatt tttttagtaa ttggggttat aggtgtgcgt tattgtgcgt atttttttgt 9300 

attttttgtg gtgatggagt tttgttattt tgtttagacg ggttttgaat ttttgagttt 9360 

aagtaatttt tttgtttcgg ttttttaaag tgttgggata ttaacgtgag ttattatgtt 9420 

tagtttagtt cggattaaat ataaatataa aagagattgg gtatggtggt ttatatttgt 9480 

aattttagta ttttgggagg ttgaggtagg tagattattt gaagttaaga gtttaagatt 9540 

agtttgttta atatggtaaa attttatttt tattaaaaat ataaaaatta gttaggcgtg 9600 

gtggtataag tttgtagttt tagttattta gggaggttga gttaggaaaa tcgtttgaat 9660 

ttgggaggta gaggttgtag tgattcgaga ttatgttatt gtattttagt ttgggtgaaa 9720 

gagtaagatt tttttttaaa aataaattaa aaatgaatat gttatttaat ttaatatttt 9780 

tattgttaaa tatttatttt atgttgttat ttttttatta gtatgtaggt atataaatag 9840 

atatttgtaa gaaggaagaa atggatataa gaaaaatttt tattaataaa ggattgggtt 9900 

gggtatagtg gtttatgttt gtaattttaa tattttggga ggttagggta ggaggattat 9960 

ttaagtttag gagtttaaga ttagtatggg taatataatg a ~ 10001 

<210> 227 
<211> 10001 
<212> DNA 

<213> Artificial sequence 

<220> : 1 > i 

<223> chemically treated genomic DNA (Homo sapiens) 

'('... * _ > r , , , . 

<400> 227 > • ; » ■ i • ■ v - . • ■ ■ .: ,. , , *- 
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ttattatatt gtttatgttg gttttgaatt tttaggttta agtgattttt ttgttttggt 60 

tttttaaagt gttgggatta taggtatgag ttattatgtt tagtttaatt ttttattgat 120 

ggaaattttt tttatgttta tttttttttt tttgtaaata tttgtttgta tatttatata 180 

ttggtggaaa aataataata taaggtaaat gtttggtagt agaaatatta ggttaagtgg 240 

tatgtttatt tttaatttgt ttttgagaaa ggattttgtt tttttattta ggttggagtg 300 

tagtggtatg atttcgggtt attgtaattt ttgtttttta ggtttaagcg atttttttgg 360 

tttagttttt ttgagtagtt gggattatag gtttgtgtta ttacgtttgg ttaatttttg 420 

tatttttagt agagatgggg ttttgttatg ttaggtaggt tggttttgaa tttttggttt 480 

taggtgattt gtttgtttta gttttttaaa atgttgggat tataggtgtg agttattatg 540 

tttagttttt tttatattta tatttaattc gggttaggtt gggtatggtg gtttacgttg 600 

gtattttagt attttgggag gtcgaggtag gaggattgtt tgagtttaag agtttaagat 660 

tcgtttgggt aagatggtaa aattttatta ttataaaaga tgtaaaaaga tgcgtatagt 720 

ggcgtatatt tatagtttta gttattgagg aggttaatgt gggaggatta tatgaggttg 780 

tagtgagttg tgatggtgtt attgtatttt agrtttggcg atagtgagtt tatgttttaa 840 

ataagtaagt aaataaaaat taaaaagaat ttagtttata gggtatttga aggtaagagg 900 

aaaagatgtt agaattagag atggggagaa gatgggtttt tggcgtttgt tgaggttgag 960 

aaatgagata gataggttga gtgtggggtg gagagaggat gggtagagag attgaggttg 1020 

gtttgaatgg aaatgaaatg ttagggtttt tagggttatc ggggaataat tggagttttt 1080 

aggaaaggtt taacgttgtg attatttgtg ttcgttatgt ttttttattt tttattaatt 1140 

gtgtgaattt ggtagatttt gagttttagt gttttttttt gtgaagtggg gttattttat 1200 

tttaattttt gggattgttg tgtgaattaa atggggtaat gtacggagag tatttgacgt 1260 

atagcgagtg ttttaaaatt ttagtttgta ttttttagta aaggatatgt atacgtttat 1320 

tgtgagtgat aaatttagga tgatttgaat ttaatgtgat aacgtgggtt ttcgtatgtt 1380 

ggttatgttg tcgggagata tttatggatt taattagtat aataggggaa ataaattatt 1440 

taatgtattt tgttaagata gaatatttta gaatttattt tgtggggcgg ggtataataa 1500 

agggggtttt tttgttgaaa acgtttaagt ttaggttcgt ggtattattt aattaaggtc 1560 

gatagttata tagtaagtta gaggtaatgt taggatttaa attaaatttg tggtttttat 1620 

aatgaggtta tttttttttt ttttgaacgg tttggggaaa gggggtgggt gggtagaatt 1680 

tggtagtggt taatttttta tttttgtttt ttggtttttt tttgttttta tttttaggtt 1740 

tgtattgatt gattgattga gatagggttt tgttttgtcg tttaggttgg agtgtagtgg 1800 

tacgattatg gtttattgta gttttaaatt tttaggttta agtggttttt tcgtttttta 1860 

tttttcgagt atttatattt ttaggttttt aaaatggttt ttaggtattt ggttgtcgtt 1920 

ttagatattt atttgggttt ttgggtaggg attgttcggg aaattttatt tatgtgaagt 1980 

aggtgtgggt gtaggaaggt cgtttggaaa tgaattagta ttgttttttg tttgagtcgt 2040 

aagtagggcg ttagagggtt tggcggataa gaaagggagg atgataggag gtcggtattg 2100 

taatgatacg ttttagttat tagagggtac gaagtagttg ggtaaaattt cgcggggtcg 2160 

ttggtggaaa atttttggta tttggagttc ggagatgggg tggacggaat gtgaggattt 2220 

agttttttga ggttgggtcg gggtagagtt attgttttgg atgttcgtag ggtttgtttg 2280 

tgttttgatt attttgtttt tgtagatagt tggagaatgt gagagtggga ttgggatcgg 2340 

attttagggg ttatttcgta taattttttt gttttgtcgt gggggaggga gttgtttaag 2400 

gttacgtagt aagttagtgg taaatgaata cgattattat tagttttagg tatatggtta 2460 

tttgatgggc gtagtcgtag ttttagtttt tgagatagag atatttgatt aaggataggt 2520 

ttttaggagt tgattttagt gattcgcggt tttgttgttg tttttgtttt tttttttggt 2580 

ttttttattt gattgatttt ttgttttttt cgtttgtttg tttgttttcg tttttgttcg 2640 

ttggggggtt tgtttaattt ttttattggg ttttgggagt cgtagttttt tgttgttatt 2700 

ttttagggat ttgtagtttt ttgaagtttt ttttcgattc gttgtttcgg ttttattttt 2760 

gggatttaga ggagaggtga ttatttcgta gtatagttag tggtgtgatt ttacggggtg 2820 

agaaggattt ttttgttttt aagtattttt ttagtgattt ttgttgttat gtggtagtcg 2880 

taagtattgg ttggtatttg gtgtgggcga gatttttatt ttatgtagaa atgagtaaga 2940 

*<=ggtgagtt tattatgtgg gggtgaggtt gagagaaaat aagtatatag gtgatttagt 3000 

taaaattaga attttttaag tatatacgaa aagggtaaaa ggggcgtttt gtataggata 3060 

gaataggtag atattgaatt cggttgggtt ttgggaaggt tttttgtagt ggtttttgaa 3120 

gggogggttg gattttagta ggatagaggg tatgggtatg tgtgggtacg ttttgaatag 3180 

aggggttagc gtaagtcgag ggttttggtt atattagttg tatgtgtcgg tgtgtttaag 3240 

ggatacgtag tagtaggtcg agtttggagc gttttattgt taggtttttt aaaaattttt 3300 

aattttaatt taattttatt ttatttttat tttaagtttt ggtatatatg tgtagaatgt 3360 

ggtttgttat ataggtatat atgtgttatg gtggtttgtt gtatttatta atttattatt 3420 

taggttttaa gtttcgtatg tattaggtat tagttttaat gttttttttt tttttgtttt 3480 

tatttttttt ttcgtaattg tttataggtt ttggtatgtg gtgttttttt ttttgtgttt 3540 

atatgttttt attgtttaat ttttatttat gagtgagaat atatcgtttg gttttaaggg 3600 

atagttatgg ggatgtattg tagtttttga gtagggaagg ttttgtggag gtttttagtt 3660 

aaaaggaaag aatggttgtg aaaatcgatg tattgcgttt ttttgttttt tatttttagt 3720 

gtgaagggtt tttatttcga gttttatttg aagtaggttt cgatgggaag ataagtagta 3780 

tgaggggttt aagtattgag gggagtaagg gatattcggt ggttgtgtta aggtgtagaa 3840 

gaggatattg ggggttttaa ggtttgattt tatgtatatt gtttaggttg gtttttaagt 3900 

tatacggtga tcgttaggaa gggattagtt tgtttttagt ttgatttttt tgttatgtta 3960 

ttatttaaag ttttttttgg tagggtttgt ttggggtttt tgtgttagtg ttttttttgt 4020 
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taggttgggt tggggttttt atttattgtt ttgggattgt tgttgttttg gttttggggg 4080 

aggagggtgt gtcgttgagt tattgtttgg gtatttgggt ttggaatttc gggtgagtta 4140 

tttagggttg aggtagaggg gttgggggag gggaagaagt tattcgatag ttggagtagg 4200 

gaggggagtt ggggttatag gaagggcggt gttttgatgt ttagacgggt cgggatagat 4260 

aaagggttaa ggaggaaggg gttttgggag ggggtagttt ttttttgggt tgggttttga 4320 

atggttatag tgtttgtttt ttttcgggtt tggggaggat atgtgtgtgg ggggtagtga 4380 

gagagggttg tggttgaggg ttgtgtttta ggtttggatt ttggtttggg aagttgttta 4440 

gttggtgttt ttagttttgg gtagggatgt atttttattt atttatttag tttttaagtt 4500 

ggagaagagg aggttaaagt ttttttgttt agtttttaat tatttgggat ttttttatgt 4560 

tttttataga ttgtggttta gtttaattgc ggttgtgtgt agagtaattt tattttttat 4620 

tgttttttta tttttttaga tattttttta tatagaggga tttttttagg tatttttaag 4680 

tatatttagt tattttatta ttttattaag aggtattttg gtgttggtta ttaaaagtta 4740 

ttttatttta tttatgtttt gaagttagtt ttgttttttt ttttttgatg ttttttagtt 4800 

gttttttttg gtttttagtt ttttaagggt ggttttaggt tgtttttttt ttatatatat 4860 

aggcgtatgt atgtatacga gtattggatt atgaagtttt agcgtgtgtt tatagttttt 4920 

tatataggag tgggttgtga tttataggta tgttatgaga atgaggtttg gtattagttt 4980 

ttaggtttta gagtaggggt tgtttttttt tatttcggtt taggatgttt agtttttacg 5040 

atatttttta ttttttattg tggtttgggt gggtttaggg gttgtttttg atttggttta 5100 

gagttttttt ttagttggtg gtggagttgg tattttttgg gagggagggg gttgggaggg 5160 

aatgagtggg aatggtaaga ggttagggtt tggtgggatt aggttgaggt aggtttggtt 5220 

tttttaaaat gttaagttgg gggttagtgg ggtttatata taaattttta ttttgggagt 5280 

ttggttgttt tgtttttttt tttgggtttg tttttgttat ttggtttggt gagtattttt 5340 

gttttgttga gggtagggtg gggaggattt tcgtgggttt ttgtttttgt ttttatagtt 5400 

tttttatttt agtttttttg gtgggattaa tttgggtttt tttgggtttt tttttttgga 5460 

agaatttttt gtgaagtgtt gaagtgttga ttgaagggtt tttttttttt tttttttttt 5520 

ttgagatgga gtttcgtttt gtcgtttagg ttggagtata gtggtgtgat tttagtttat 5580 

tgtaaatttt tttttttagg tttacgttat ttttttgttt tagtttttcg agtagttggg 5640 

attgtaggcg tttattatta tgttcggtta atttttttgt atttttagta gagatggggt 5700 

tttattatgt tagttaggat ggtttcgatt ttttgatttc gtgatttatt tatttcggtt 5760 

ttttaaagtg ttgggattat aggagtaagt tatcgcgttc ggtcgattga agggtttttt 5820 

tttaggtttt tttgtgaggt tttagtgtag gggttgtttt gaggtttttt tttggatatt 5880 

ttagtttagg ggtttttttt tgggggttta ggtttaggag taggaggtgt gtatgtgggc 5940 

gttgttgtaa aaagaatttt gagatttttt tttttttttt tttttttttt tgtaaagttt 6000 

tggattttag taggattaag gtgtaagagg taggggtttt aagattttgt ttgggttatg 6060 

gttttaagta gtaaagtttt gttttttgtt tcggtgaagg tagggttggt atgatgggtt 6120 

tagggtatgt tttgtttttg gtatagtttt tttggtttta ttttgaaatt tgtttaattt 6180 

ttttaggttg gtttgagtat ttttagaggt tttgtcgttg aggtttgttt tattttgatt 6240 

ttaaggtaat gaatatttta tatttttaat tttaatttta ataggatttt ttttggtgga 6300 

gagaatgtta agttgttttt attttattta tgtttttgtt tgtttagagg tttaggggtt 6360 

tttagggtga ggggagatat attttttatt ttttgggagt ttttagtttg agagaggaaa 6420 

tatttttgtt taagggagtt tttagttaga tggtagagag agatgttttt ggttttagga 6480 

gtttcgagtt taaggaggga aacgattttt ttagggagtt ttttgttttt aggttgtagt 6540 

tatggttttt gttagattgt ataggagttt ttatttgtta gtcggtgtat gtggttttgt 6600 

tttttagagt ttgcgtagat gttattaaaa tgggattttg gttattttgt tatttttttt 6660 

ttggtagata ttaaaatggg gagttttgtt tttagggggg tgttttaagt gttattagag 6720 

gaggtttggt gatttttaga tataagggaa gttttagcgt ttgtttttag ggtgagatgg 6780 

aggtatcgtt ttcggtttta gggaattata gtttgagggg gagatgtagt ttttgttttt 6840 

ttatttagag aggggttttg tgaggtggtt tgggggtata gggtagaagt ggattttata 6900 

ggttgagtta aggttttaag agttttagta gtgtatttat tatttggtat ttttgtagtt 6960 

atagatttat gatgtgtagt tttttggagt aggcgttggt tgtgttggtt attatttttt 7020 

ataagtattt ttgttaagag ggcgataagt ttaagttgag taagggggaa atgaaggaat 7080 

ttttgtataa ggagttgttt agttttgtgg gggtgagtgg tataggtttg tgggggaggt 7140 

tttggtgtga gtgtgggggt gtaggttaaa tttttttttt agtttcgggt gtttgtcgat 7200 

gtaggtgtta gggtggggtt tagttttttt tttattttag ttttatggtt ttattggagt 7260 

ggaaatgagg ttcgagtggg agtgtttaat taatggttgt tttttgtagt attttagaga 7320 

attatgtgtt gtgagggttt ttcgagttta tttgtttaat tttgttattg gagatttgag 7380 

aaattagagt ttagaaggga aaagtgattg ttttaagatt atatagtatt ggtacgtttt 7440 

tttttttttt tttttttttt tttttttttt tttgagatgg agtttttttt ttgttgttta 7500 

ggttggagtg taatggtacg atttcggttt attgtaattt ttgtttttag gggttaagta 7560 

atttttttgt tttagttttt tgagtagttg ggattatagg cgtattttat tacgtttagt 7620 

taatttttgt atttttagta gagatagggt tttattatat tggttaggtt ggtttcgaat 7680 

ttttgatttc gtgatttatt tgtttcggtt ttttaagtga tttttgtatt tttagtagag 7740 

acggggtttt attatattgg ttaggttggt ttcgaatttt tgattttagg tgatttgttt 7800 

ttttcggttt ttgaaagtgt tgggtttata ggcgtgagtt atcgtgttcg gatttttttt 7860 

tttttttttt ttttttgtgg tggggggata agattttatt ttgttattta ggttggatta 7920 

tagtttattg taatttcgaa tttttgggtt taagtaattt ttttaagtag ttggaattat 7980 

aggagtattg ttattatgtt tggttaattt ttatttttgt agagatggag ttttgttatg 8040 

jttgtttagg t tgggtttgaa tttttgggtt taagtaattt ttttatttcg gttttttaaa 8100 
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gtattggaat tatagatgtg aggtattgtg tttgattttt ttttattttt atatgttaaa 8160 

ttaagaaagt atgtagggat agaaaagttt agttaagata tatagtttgg gatattttgt 8220 

ggagaaatgt atcgattttt aatttgtttt ttattttttt tatattgatt tattggtgat 8280 

ttttaaagtt aggtgttagg ttttgaatat atgaggtagg tttttttttt ttttttttaa 8340 

ttttgttttt gtggttggtt aaattttttt aattatttcg gttagtatta aaaaagtgtt 8400 

ttttagttgg gtgtagtggt ttatgtttgt aatttttata gtgtgggagg ttaaggtagg 8460 

aggatttttt aagtttagga gttcgattag tttgggtaat atagtaagat tttattttta 8520 

taaaaataaa aataaaaatt ggttaggtat ggtggtatac gtttgtagtt ttagttattt 8580 

999tggattg tttgagttta ggagtttaag gttatagtga gttataatat agtttgggtg 8640 

ataaagtgag attattttgg ttgttaagat tagtttaggt aatatagcga ggttttgttt 8700 

ttattaaaaa taaataatta gttgggtgtg gtggtatgag tttgtggttt tagttatttg 8760 

ggaggttgag gtgggaggat tgttggagtt taggaggttg aggttgtagt gagttgtgat 8820 

tgtgttattg tattttagtt tgggtaatag agtaagattt tgttttaaaa aataaaaagt 8880 

gttttttttt gaatttattt ggttggtgtt ggggagtagt aattttcggt ttttttatta 8940 

gtagaatggg gtgatgatat ttatttcgtt gggtttttgt gggattcgag ttgatgtatg 9000 

tttagaggag tatttagtgt tttttttgtg tttaggagga gggtatattg gagatgttta 9060 

ttaatgagta tttgtttttt tttttattta ttgggttttt ttggtagttt ttagggtttt 9120 

ttgtttattt tatatttagt tgtttagtgg ggagggagag ttgggaatta atttgaatgt 9180 

gtgagggttt gggtgtttgg tggagttggg gttggggttg gtttggtgat gagtgtattt 9240 

tttgttattt ttaggagaaa gtggatgagg aggggttgaa gaagttgatg ggtagtttgg 9300 

atgagaatag tgattagtag gtggattttt aggagtatgt tgtttttttg gtatttatta 9360 

ttgttatgtg taatgatttt ttttagggtt gtttagatcg attttgaagt agaatttttg 9420 

attttttgtt atggattttt tgggtttagg attgttgatg tttttgagtt ttgtatttaa 9480 

taaatttttt ttgtttgttg ataatatttt aattgtttag tgatgtttta taattcggtt 9540 

ggtttagttg gagtgttggg agatgagggt tttttggatt ttgttttttt ttgggttttg 9600 

atttttttgg aaattttttt aaggttagag ttatgtttta ggttttaatt ttggaatttt 9660 

aaatattagt aaaaaattgg aaatcgagat aggttgttga tttttatttt gttaaataaa 9720 

gatattaaaa aaggtaaata ttatttatta ttattattat tttataaaaa gataaaattt 9780 

taattttaat aaaggaggaa aggtataatt ttagaataaa ggatagattt taatatttaa 9840 

tattgtaagt aatgagttgt attaggattt atttaagaga tatgaattag tttaatcgat 9900 

ttcgtaaagt tttatttaaa agattttttg ttagatcgag tacggtggtt tatatttgta 9960 

attttagtac gttgggaggt taaggtaggt agattttttg a 10001 

<210> 228 
<211> 4449 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 228 

ttttttgtat atttgagagg taagaggagg tgtttgtggt ttttttgata tatacgtgat 60 

atatgtatta ttacggttta tgtgtgtgtt gttgtgtgta gggggtaggg ggtttttcgt 120 

ttttgatggt tttttttggt gggtgttttt cggtttttta ggttttttta tttgttgaac 180 

gtggtagtta agggtttggg aatgatttgt tagattggtt aatgtggaaa gagttggttt 240 

cgtttagttt tgtatttttt tttttttttt tggaatcgtt tggttgtaga gagtaggagt 300 

tttttagagt tttcggggga ttttattttt agtataattg tgtaacgtgt gtttagtata 360 

agagtattag atggagggat aggtgggagg aggaggaatt tagggtgtat ttagtttggg 420 

gtttgttgag ttgaggtttt tagagtaagg acgattaggt tggggggata ttttttttta 480 

atttttgtta agttatagag gggttagtgg gaataaagtg atattttggt ttttgtattt 540 

tttttttgtt gttttatttt ttagttcgtt tagatttgag gttggggagg ggtttttttg 600 

gaggggttag tttatgttat tttttgtaat ttttattata tatgttttta gatattttgt 660 

tataggtttt attttttacg ttaattttta tttgggaaag taaataaacg gaaagttaat 720 

ttgtgttatt tggtcgttgg ggtattcgta gtgagtagtt aagaaatgtt aggggagtcg 780 

gtgtttattt ttatgtttgg ataagagtgc gttttggatt tgcgttttgt tgtatatttt 840 

atcgtttcgt tgtatatttt atcgttttat tgtatatttt attgttttat tgtatatttt 900 

atcgttttat tgtatatttt attgttttat tgtatatttt attgttttat tgtatatttt 960 

attatttttg ttttttgtat ttagttgttt atttgtattt tagtttagga agtttagaag 1020 

atgtagaatt tttgcgagag tttagggtga aacgttgcgt tttattttta aagaaaggaa 1080 

aattatttat attttttaaa agaatgaata gtatagatta atatcgattt tttttaattt 1140 

ttaggttaat tttgagtagt taaagttaga gtagttaatt tgtgttgtga gtcgaggtat 1200 

agttgtagaa gcgtgtttga ggtgttcggt ggaggtggta gtcgagtttt gggattaatt 1260 

atcgtgttgg ggacggtatc gcgttaggat gtaggtagat ttttgtagaa gtgtttaaaa 1320 

tttatatttt tttatagggg tgagggggag ggagaaagag atgttttagt gaggataaat 1380 

attttttttt atatttaaat aattatagag tttttatttt aaagtattta taggtatatt 1440 

ttttagaaaa tatgaattgt tagtcgggta cggtggttta cgtttgtaat tttagttttt 1500 
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tgggaggttg 
tggtgaaatt 
atttgtaatt 
gttaagatcg 
aaaaaaaaaa 
ttttaggtaa 
gtgaagagaa 
tttttggatt 
gattttgtgt 
tttttaatta 
gacgtcgatg 
gtagttttta 
acggggggtt 
gttgaatttt 
gtaattgttt 
atattttgat 
tttgtatttt 
ttttatttaa 
gtatggtaaa 
gttattaata 
ggaggttgaa 
gtgaagtttt 
taattttagt 
tgtagtgagt 
ttaaaaaaaa 
gagataacgt 
tcggagaggt 
ttattaggtg 
ggaggtggaa 
cgatagaggg 
ggaggaagat 
ggttttaata 
tttattagga 
ttttcggttt 
ggtttaggtg 
attagggatt 
gtgtttagtt 
ttatcgcgac 
tttcgtggtg 
agatttttag 
ttgtcgtatg 
tataattttt 
tttgagtttt 
attttatagt 
ggtttggggg 
ttttttcggg 
agtttaatgt 
tagtagagat 
ttttttttag 
ttttatttt 



aggcgggtag 
tcgtttatat 
ttagttatta 
tattattgta 
aaaaagaata 
gttagaaagc 
gtcgggggga 
tgaatgtagt 
aaaggtatag 
attttaaatt 
gtattaggat 
tttgtatatt 
attgtatacg 
ttgatgttat 
agtgaggcgg 
gtgtaaatat 
tttaggtaat 
tatataaata 
agttaggtat 
agagtttaaa 
ggaggtgaat 
atttttatta 
tatttgggag 
taagatcgta 
aaaaaaaaga 
aaggttatcg 
taatgataag 
aagatggagt 
atttattgta 
tataagttgt 
acgtggatag 
ttagagtttt 
tttggtggtt 
ttgtttttgt 

agggcgtttt 

gtttcgagat 
ttcgttgttt 
gtgaaggtga 
agggaggatt 
tttcggtgtt 
atggaaataa 
tagattttag 
gagttagagg 
atagggggta 
tggttagtac 
ttttaaagtt 
tggttgtttt 
taagaggtgg 
gtttttttgg 



attatttgag 
taaaaatata 
ggaagttgag 
ttttagttta 
tatgaaatgt 
gttattaata 
aattaggttt 
ttgatttagg 
ttggtgatgt 
tttagaattt 
agaagtatta 
gggagggcgt 
tttttgaaag 
tatgtatata 
atataaaatt 
tacgtttagg 
ttatttattt 
tttattgagt 
tgtgttaggt 
tatttgggtt 
tatttgaggt 
aaaatataaa 
gttgaggtag 
ttattgtatt 
gtttaaggat 
ttttagggtg 
taggttgaat 
cgtatgggta 
ggtaaggtga 
gaaatagttt 
atggggttgg 
aggttttgag 
gatagtttgt 
tttggggtat 

aggggaggtc 
gtttttggag 
agtagttttt 
ttatcgttta 
attattttgg 
ttgttttcgg 
atggaaatgg 
ttgtaatagg 
gtgtgtgggg 
gagttggggg 
ggggatttat 
ttttcgtttt 
tttacgttgt 
ttggagggta 
aaatttttgt 
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gttaagagtt 
aaaaaaattt 
gttgaattta 
ggggtaacgg 
ttttagattt 
gtggtttttt 
ttttttaaat 
ttattttatt 
tgattagagt 
atcgtataaa 
ggggatttta 
gatattagga 
tgtataggta 
tttatttagt 
tttaggatat 
gaaaatgtaa 
attttttaat 
attatttgtg 
gagagagatt 
taggtttatg 
taggagttta 
aaattaggcg 
gagaattatt 
ttagtttgag 
ttgatggagg 
tttagggtaa 
aaagtagggg 
aaggttattt 
gaggatcggg 
gaagtagggt 
ttgggggttg 
tttcgcggaa 
agtagggggt 
aggaggggag 
gagaggggtt 
aaagtgtttt 
atgggaattt 
ttttcgggat 
gggtcggggg 
tagattttgt 
tttttatata 
attatggatt 
tggggagggg 
ttggggtggg 
ttagttcgtt 
ttgttatggg 
ttaagtttat 

gtgggggtac 

tttggaaacg 



txt 
taagatcggt 
agttgggcgt 
ggaggtagag 
agcgagattt 
cgttatgttt 
ttaggttttt 
tttttagttt 
gtattattat 
ttttgtagtt 
aaggttatat 
cgaacggtgt 
aaattataat 
agaagtaaag 
tttttttttt 
gagagggaga 
ggtgttttag 
tttaataaat 
tgtatgagaa 
tagaaattaa 
tttgtaattt 
agattagttt 
ggtattgtgg 
tgaatttagg 
tgatagagta 
agaaaggtaa 
ttacgggggt 
ggttttttgt 
tagagattta 
tggggtggtt 
agtgaggaaa 
tcgtaggatt 
ttaaaggttt 
gaaagaggag 
gaattrtagtt 
gtttttttgt 
tggtttgttg 
tgtttttttg 
attgtggggg 
ggtttttttt 
tttgtttatt 
cgaatgtatt 
tgagtgattc 
agtttgttac 
ggtaagggcg 
ttgtatatcg 
tttgggggtt 
tttttaggtt 
ggatagtatt 
tagaaagttt 



ttggttaata 
agtggtatat 
attgtagtga 
tattttaaaa 
ttttttttta 
ggttagagat 
tgttttttta 
tggcggtcgt 
ttaaatgatt 
tttttggagg 
cgtcgaaata 
tttgtttttt 
taagttgtgg 
aatgatatta 
cgtggttttt 
gtttgtggat 
gattattaaa 
gtgggagtta 
aattagagaa 
tagtattttg 
gattaaaatg 
tatacgtttg 
aggtagaggt 
agattttatt 
gaatatgtgc 
agggtatttt 
aggaggaggt 
ggtgtgttta 
taggaggagt 
gggatttaga 
ttatgtaaga 
tagaaagtaa 
tttagagtat 
tggttatatt 
ttgttttgtt 
ggaaggattc 
gggtttggat 
ttaagagagg 
ttagggagga 
cgttttgttt 
aaattgtaat 
gtaaagacgt 
ggtttttgtg 
taggtagatg 
aggttatttt 
tttttattgt 
cggtatttta 
attgggcgtt 
ttttttttgt 



<210> 229 
<211> 4449 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 229 

aaggtggggg 
tgaagaggga 
tttttgttgt 
tattgggttg 
tcgagagagg 
ttttaggttt 
ttgtgggatt 



1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4449 



taggagaaag 
gcgtttagtg 

ggggtatcgg 

tagtgggaag 
aagtggtttc 
atttgtttgc 
atagaggtcg 



ggttttttac 
gtgttgttcg 
gtttgagggg 
atttttaagt 
ggtgtataga 
gtttttgttt 
tggtaaattt 



gtttttaaag taagggtttt 

tgtttttatt gttttttagt 

tgggtttggg tagcgtagaa 

ttatggtagg gagcggggga 

acgaattggg tgggttttcg 

ttattttagt ttttagtttt 

ttttttttat tttatatatt 
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tagagaggtt 
tattttttga 
gagtagttag 
gttttggaat 
tgttggttat 
gttttttgtg 
ttttggttta 



60 
120 
180 
240 
300 
360 
420 



seq_prot. txt 

aggtttagag cgtttttgcg ggttatttag gtttatgatt ttgttataat tgaaatttag 480 

aaaattgtga ttatagttta gtgtattcgt gtgtggaaat tatttttatt tatttttatt 540 

atgcgataaa gataaagcgg gtgggtaaga tagagtttgt cggaggtaga gtatcggggt 600 

tggaaatttt tttttttgag gaggaaattt tttcgatttt taggatgatg attttttttt 660 

attacggggt tttttttgat ttttatagtg tttcgggggt gggcgatgat tatttttacg 720 

tcgcgatgga tttagatttt aggagggtaa ggtttttatg gaagttgttg ggtagcggga 780 

gttgaatacg gatttttttt agtaagttag gaatattttt tttaaagata tttcgaggta 840 

gtttttgata gtaaagtaga taagagaata gtttttttcg gttttttttg gggcgttttt 900 

atttgagtta gtgtggttag attgagtttt ttttttttta tgttttaagg tagggatagg 960 

gatcggaggg tgttttgggt ttttttttta ttttttgttg taggttgtta attattagat 1020 

tttaataggt tgttttttga gatttttgat ttcgcggagt ttagagtttg aagttttggt 1080 

gttagaattt tttgtataag attttgcggt agtttttagt tagttttatt tgtttacgtg 1140 

tttttttttt ttagattttt ttttttattg ttttgtttta agttgtttta tagtttgtat 1200 

tttttgtcgg ttttttttag attattttat tcggtttttt tattttattt gtaatgggtt 1260 

tttatttttt gaatatattt gggtttttgg aatggttttt gtttatgcgg ttttattttt 1320 

atttggtgaa tttttttttg tagggagttt ttttgttttg tttaatttgt ttgttattgg 1380 

ttttttcggg gagtgtttta ttttcgtggt tattttgggt attttgggac gatggttttg 1440 

cgttgtttcg tatatgtttt tgtttttttt ttttattaga tttttagatt tttttttttt 1500 

tttttttgag atggagtttt gttttgttat ttaggttgga gtgtaatggt gcgattttgg 1560 

tttattataa tttttgtttt ttgggtttaa gtgatttttt tgttttagtt ttttaagtag 1620 

ttgggattat agacgtgtgt tataatgttc gtttaatttt ttgtattttt agtagagatg 1680 

gggttttatt attttggtta ggttggtttt gaatttttga ttttaagtga tttatttttt 1740 

ttagtttttt aaagtgttgg gattataggt atgagtttgg gtttagatat ttagattttt 1800 

attaatgatt tttttggttt taatttttgg gtttttttta tttggtatag tgtttggttt 1860 

ttgttatgtt agtttttatt ttttatgtat ataaatggtg tttagtaaat atttatgtat 1920 

tgagtaaaat ttaataatta tttgttgaaa ttaaaaagtg aataaataag ttatttagaa 1980 

agatgtaaag tttataaatt tggggtattt tgtatttttt ttgagcgtaa tgtttgtata 2040 

ttaggatgtg aggattacgt ttttttttta tgttttgagg gttttatatt cgttttattg 2100 

gatagttgtt gatgttattg gagaaggaag ttggatgggt gtgtgtatga taatattaag 2160 

gaatttagtt tataatttat tttgtttttt atttgtgtat ttttagagac gtgtatagtg 2220 

gtttttcgtg aaagatagaa ttgtggtttt tttggtgtta cgttttttta gtgtgtaaat 2280 

aagggttgtt gtttcgacga tatcgttcgt ggggtttttt ggtgttttta ttttaatatt 2340 

atcgacgttt ttttagaagg tatggttttt ttatacgatg ggttttgaag atttagaatt 2400 

agttagaaaa gttatttaag attatagagg ttttgattag tattattagt tatgttttta 2460 

tatagagtta cggtcgttag tggtggtgta atggggtagt ttgagttagg ttgtatttag 2520 

gtttaggaat agaaaggtag ggttaaggga tttgggaaga aatttgattt tttttcggtt 2580 

tttttttata tttttaatta aaagtttggg aagagttatt gttggtaacg ttttttagtt 2640 

tgtttaggat agagggggaa ggtatgacga aatttgaaga tattttatgt attttttttt 2700 

tttttttttt tttgaaatgg agtttcgttt cgttgttttt gagttggagt gtaatggtgc 2760' 

gattttggtt tattgtaatt tttgtttttt gagtttaatt ttagtttttt agtagttgag 2820 

attataggtg tgtgttatta cgtttagtta aatttttttt gtatttttag tatagacggg 2880 

gttttattat gttggttaga tcggttttga atttttgatt ttaggtgatt tgttcgtttt 2940 

agttttttag agagttggga ttataggcgt gagttatcgt gttcggttga tagtttatgt 3000 

tttttaaaga atgtgtttat ggatatttta aagtaaaaat tttgtaattg tttaaatgtg 3060 

aaagaaaatg tttattttta ttaaagtatt tttttttttt ttttttttat ttttgtagag 3120 

gagtgtgaat tttagatatt tttgtaggga tttgtttgta ttttgacgcg gtgtcgtttt 3180 

tagtacggtg attagtttta gagttcggtt gttattttta tcggatattt tagatacgtt 3240 

tttgtagttg tgtttcggtt tataatatag attgattgtt ttgattttga ttatttaaaa 3300 

ttggtttaaa aattaaaaga gatcgatatt aatttgtgtt gtttattttt ttaaagaata 3360 

tgaatgattt tttttttttt gaaagtgaag cgtagcgttt tattttgggt tttcgtagag 3420 

gttttgtatt ttttgggttt tttgagttgg gatataagtg ggtagttgag tgtagaaagt 3480 

agggatggtg gggtgtatag taggatagtg gggtgtgtag taggatagtg gggtgtgtag 3540 

taggacggtg gggtgtatag taggatagtg gggtgtgtag taggacggtg gggtgtgtag 3600 

cgggacggtg gggtgtgtag taggacgtaa gtttaagacg tatttttgtt taggtatgaa 3660 

aatggatatc gatttttttg gtatttttta attatttatt gcggatgttt tagcgattaa 3720 

gtgatataag ttagtttttc gtttatttgt ttttttaaat agaaattggc gtaggagatg 3780 

aaatttgtag tagaatgttt gaaagtatgt gtaataaaaa ttgtagaggg tggtatggat 3840 

tgattttttt aggaaaattt ttttttaatt ttagatttga acgaattaga aaatagaata 3900 

gtagaggagg gatatagaag ttaaaatgtt attttatttt tattggtttt tttgtagttt 3960 

ggtaaggatt gggagagggt gtttttttaa tttagtcgtt tttgttttgg ggattttagt 4020 

ttagtaagtt ttaagttaaa tgtattttgg gttttttttt tttttatttg tttttttatt 4080 

tggtgttttt gtgttgggta tacgttgtat agttatattg gagatgggat ttttcgggag 4140 

ttttggaaag tttttgtttt ttgtagttag acggttttag ggaggaagag ggaggtgtag 4200 

aattgagcgg ggttagtttt ttttatatta attagtttgg taaattattt ttaaattttt 4260 

ggttattacg tttagtagat aaagaggttt gagaggtcgg gagatattta ttaagaagaa 4320 

ttattaggaa cgggaagttt tttgtttttt gtatatagta gtatatatat gagtcgtaat 4380 

aatatatatg ttacgtatgt gttagggggg ttatagatat ttttttttat tttttagatg 4440 

tgtaaggaa 1** 1 1 1 * «"■ ' t.- ; f i : Kt 4449 
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seq_prot.txt 

<210> 230 
<211> 4296 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 230 

agaaatttat atattttttt atttttaaga aaaggaagta gtggtgatat tattttcggt 60 

taaaatgtag tttgtttata ttttaaatat tcggtaataa tgtttttttt agagttatga 120 

agtaattaga ataattaaat ataatttttt tttaaatttt tattgtttga tttatttatt 180 

tatttaataa atatttatta aatatcgatt atgtgtttga tgtttaggga tataatagta 240 

agtggaggga aagatatata atatttgttt ttaagaaatt tggagttgag tggaggatag 300 

aaatataaat taaagaatga tataaataat tataaagtta tagttgttaa aagaaaagta 360 

tatggtgtta agagaacgtg taatataaga tttatttatg gaggtgaggg aaagtttgtt 420 

tattaaagaa gttatgattt aatttacgaa gattaggagt tggttgggtg aagaaaaaaa 480 

ggttagagga aggaagttta tattggggaa ggttttaagt ataaagggta ggaggattat 540 

agaggtatat ttacgaaatt tggagaaggt ttttagtaag taaggagaag ttaaatgaaa 600 

gtttacggga gagttggagg tttgaagata cgtttaagga tttggttttt attttttttt 660 

tattttaaga gtagtgggaa gttattaaat gattttaatt agagggttgg tataattagt 720 

tttgtatttt gaaaagttga atttagtttt cgtttgagaa attgagtgaa agagtttaga 780 

acggtcgtgg ttgagggtga ttcgtgggag atttttatat aagttatggt agtggtatgg 840 

gttggtggta gaagagggaa tagggagaag atttggaatt taattttttt ttattgataa 900 

agttatttta gttttggtaa ggtaattaat tggtgggaaa gaagatgttt agtttttttg 960 

attttattgt attttttgta tttttaatat gagtattggg aagtggtaaa atatttagag 1020 

gtagtttggg tgttaggtgg agtatgagtt aaaattttag gatgaagtaa atgaatattt 1080 

agaatgatag gaaagarttg ggagttgggt ttgggggagg gttatttatt tttatttttt 1140 

ggagattttg gtataaattt ttgtttttgt aatttttttt ttaggtaaag gaatttatta 1200 

aatgaattgt tagaagattt attgattaga gggttgtata gaattatatt tttgagagtg 1260 

ggaagtaggt tgattatata gtttattatt taattaggat atatttgaaa gagaaagggg 1320 

gttttattaa tatttaaatt ataaaatatg tatattagga atgttttggg taaatttggt 1380 

tgttttagta agaaaggaaa tttgaaagtt tatattgttt tgtttttatg ttatttcgtt 1440 

tgtatatgag agggtaagta tttttttttt ttatttgtat taagggaata aaagtataag 1500 

tatttaggtg atttttaatt tatttttaat tttatagttt ttgttatatt ttatatattt 1560 

tgaaaattat attttttatt attattattt cgtgataggt gattatttat aattatttat 1620 

tgatttagtt tcgggaagag gcggtgtaaa atgggacgtt ttatttaggt gtttattaga 1680 

aatgtagaat ttttgtttgt tttttagatt tattgaatta gaatttgtat ttttaaataa 1740 

gatttttagg tgattaatat gtatattaaa atttgagaaa aatttttaga tttcgattta 1800 

aagaaaaata ttttataatt tgatagtgta tgtatatata tatatgtata tagatataat 1860 

tgaagtataa atttaatgaa gtagaattta tcgttattat tttatttggg aaagaaatgt 1920 

gttcgcgatt taatagattg gagtatttat ttttggattt taatttgtaa tttgaaaacg 1980 

tatttttaaa gtatttagga gtaatttgaa gaaagttgag gggaggcggt agatgttttg 2040 

atttattagg gaaaacgtgg acgttttttg ttgttatttt gtgaattgtg tgtatttagt 2100 

tatttttgag taaatatttg gagcgaggaa tttttgagtg gtgtgggagg gcggtgaggg 2160 

gtagttgaaa gtcggttaaa gttttcggag gggttggttt aggaaatatg attggtagtt 2220 

acgagagagt taggggttgg acgtcgagga gagggagaag gttttcgggc ggagagaggt 2280 

tttgtttagt tgttggcgag gagttttttg tttttttcgt agcgttgagt tgaagttgag 2340 

tgagttattc gcgcgtacgg agcgacgata ttttcgcgcg tgtattcgtt cgggatagga 2400 

gtcggatttt tgtgtagttt ttttcggtcg tcgggggttt tttcgcgttt cgtcggtttt 2460 

taggtttttt tttggttggc gagcgggcgt tatatttggt tcgtatattt gcgttgtcgg 2520 

ttcggcgcgg ggttcggaga gggcgcggcg cggaggcgta gttaggggtt cgggaaggcg 2580 

tcgttcgttg cgttgggggt tcggtttatg acgagtagcg gggtttgtta tgggtcgggg 2640 

gttgtttagg ggtttgtggt cgttgtatat cgttttgtgg acgcgtatcg ttagtacgat 2700 

tttatcgtac gtttagaagt cgggtgagtg gtttttagtt cgggttcggc ggggcgtcgg 2760 

gggttttttt ggggttttcg ttttttcgtt gcgtttgata gtcgggttcg gtaattcggt 2820 

tttcgggcgg aaacgaggaa agtttttttc gcgatattta cgtagttcga ttttcgtagt 2880 

tgtagggatt gtgagttttt tttgaaaaag agaaggaaag tttagttgta aggggcgcgg 2940 

ggtacgtttg gttttttttg tgcgagtagg aaaggcgttg tgttggtcgc gttcgaggcg 3000 

agtttttatt ttcggaaagg gaagtttgag aagttggtta tttgaaggcg gtcggggagt 3060 

agcggttcgg agcggtagtt tgagttgtta aagtagttag cgtatttggg ttatttcgtt 3120 

tatggcgatg tcgcgtgttt atttagtttt ttttgagtta ttcgtttaaa aggttagttt 3180 

tttagttttt agtttttgga gacggttacg ttttttttta ggtcggtttt ttggttcgga 3240 

gtttttggaa ataagtttta agaaaataat cgatttttaa aagaaagtta gttggtttta 3300 

ttgacgtttt ggtatggatg gatagggagt ggagatgttt aggtgaaatc gagaattttt 3360 

tattgaatgt ttattgggtg ttagaggttg tagattttgt ttggaataag atagttttgt 3420 
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ttttagggag 
ttaattaggt 
aaaaaatata 
atataatatt 
agtgtaaaat 
tttgttatta 
tttaagcgat 
cgtttagtta 
tttcgatttt 
gcgtgagtita 
tatttgtagt 
ttttagtttt 
atgtttaata 
tggagatgat 
ggaggtattt 



ttgatggttt 
tatttatttg 
tatatatatt 
aatatatata 
ttttgagatt 
ggttggagtg 
ttttttgttt 
atttttgtat 
ttgatttcgt 
tcgcgttcgg 
tgtggtttgg 
tttatttgta 
ttttacgtat 
ttatgtagat 
attttattta 



tatgtaatta 
atcgttgttt 
atatatatat 
tgtatgtagg 
ttaaaagtta 
tagtggcgta 
tagtttttcg 
ttttagtaga 
gattcgttta 
ttaaaagtta 
gtaatcgagt 
aaatggttag 
cgtgtttttt 
tttggatgaa 
tttagggata 



seq_prot.txt 
tcgtttgttg gaaatcgaag 
ttgaaggttt ttaaggaaaa 
aaatatatat ataattatat 



aatgggggtt 
gatttttttt 
atttcggttt 
agtagttggg 
gacggggttt 
ttttggtttt 
gattttttgt 
atttgggtaa 
tttgaatttg 
tttttttttt 
atgtagagga 
tgggta 



tttaattatt 
ttttggagac 
attgtaattt 
attataggcg 
tattatattg 
ttaaagtgtt 
agttatgatt 
gttatagagt 
ggtaatttta 
ttttggtttt 
gttgatagtt 



ggttaaaatt 
aataaaaata 
atataaatat 
tgttatggaa 
ggagtttcgt 
tgtttttggg 
cgtgttatta 
gttagggtgg 
gggattatag 
taaataatag 
tttttttggg 
tattattttt 
tttggtgatt 
tatattattc 



<210> 231 
<211> 4296 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 231 



tgtttatgtt 
ttatatttta 
ggaggggagg 
tttaagttgg 
ttaaatattc 
aaagtttaat 
ttaaggtggg 
ttcgttttta 
gttattcggg 
cgagattgcg 
aaaatttaat 
tttatttttg 
atatttatat 
ttttagagat 
aaacgataat 
aatttatagt 
atttttattt 
aaatcgatta 
ggaaagcgtg 
ttagaaagag 
ttgttttggt 
taatttttta 
gttttttttg 
tttttttttt 
tgtcgcgggg 
aagcgtagcg 
ggggattatt 
aggacgatgt 
gttcgttata 
ttttcgcgtc 
gatgtggcgt 
tttcggcggt 

cgggggtgtc 

gggaaatagg 
tttttttttt 
tagttttttc 
taggagtttt 
taataataga 
gtttttttta 
ttaggagtga 
taacggtaaa 
tgtgtatata 



tttggataaa 
tttaaagttt 
ggtacgatac 
ttattttata 
ggttatttag 
ttttggtcgg 
cggattacga 
ttaaaaatat 
aggttaaggt 
ttattgtatt 
ttttgaagtt 
tatgtatata 
atatataata 
agcgattagg 
tgtatagaat 
ttttagtatt 
tttatttatt 
tttttttgaa 
gtcgttttta 
ttgggtgggt 
agtttaggtt 
aatttttttt 
ttcgtataaa 
tttaagaaaa 
gaaatttttt 

gagaggcggg 
tattcgattt 
gtagcggtta 
gatcgagttt 
gcgttttttt 
tcgttcgtta 
cgagggaagt 
gtcgtttcgt 
aaatttttcg 
cgacgtttag 
gagagttttg 
tcgttttaag 
aaacgtttac 
gattgttttt 
atattttaat 
ttttatttta 
ttgttaagtt 



taaaatgagt 
atatgggtta 
gtaagatgtt 
gatgaggaaa 
attatagttg 
gcgcggtggt 
ggttaagaga 
aaaaattagt 
aggagaatcg 
ttagtttggt 
ttagaggttt 
tatattaata 
tatatatata 
tgggtagttt 
tattaatttt 
tagtaggtat 
tatgttaagg 
atttgttttt 
ggagttaagg 
acgcggtatc 
gtcgtttcgg 
ttcgggggtg 

ggggattaaa 

atttataatt 
tcgttttcgt 
gattttagga 
ttgaacgtgc 
taggtttttg 
ttagcgtagc 
cggatttcgc 
gttaggaggg 
tgtataggag 
gcgcgcgagt 
ttaatagttg 
tttttagttt 
gtcgattttt 
tatttattta 
gtttttttta 
aggtgtttta 
ttattgagtc 
ttaaatttgt 
gtaaaatgtt 



gttttcgggt 
tttttaagtt 
aaatatgaag 
ttaaggttta 
tagatattgt 
ttacgtttgt 
tcgagattat 
tgggcgtggt 
tttgaattta 
gatagagcga 
tatatttttt 
ttatatatat 
tttttttatt 
agttaggatt 
ttgaagatag 
ttagtgaggg 
cgttagtgga 
aaaaatttcg 
attgaggagt 
gttatgggcg 
gtcgttgttt 
ggggttcgtt 
cgtgtttcgc 
tttgtagtta 
tcgggggtcg 
agattttcgg 
ggtgggatcg 
agtagttttc 
ggacggcgtt 
gtcgggtcgg 
ggtttggagg 
ttcggttttt 
gatttattta 
ggtaggattt 
tttcgtagtt 
agttgttttt 
agaatgatta 
gtagattaga 
gagatgcgtt 
gcgagtatat 
gttttagttg 
tttttttagg 



gatgtggatt 
attagaagag 
atgatgtaaa 
gaaaaagttt 
tatttggatt 
aattttagta 
tttggttaat 
ggtacgcgtt 
gaggtaaagt 
gatttcgttt 
ataataaata 
ttatatatat 
tttatttttt 
ttaatttttc 
gattattttg 
attttcggtt 
gttagttaat 
ggttaagaag 
tggttttttg 
gagtggttta 
ttcggtcgtt 
tcgaacgcgg 
gttttttgta 
cgggagtcgg 
ggttgtcggg 
cgtttcgtcg 
tgttggcgat 
gatttatggt 
ttttcggatt 
tagcgtagat 
tcggcgaggc 
gtttcgagcg 
attttaattt 
tttttcgttc 
gttaattatg 
tatcgttttt 
agtgtatata 
atatttgtcg 
tttaaattgt 
ttttttttta 
tgtttatatg 
tcgaagttta 



gttaattttt 
ttaggggaaa 
attatttaga 
tatgatttgt 
atagttgtaa 
ttttgggagg 
atggtgaaat 
tgtaatttta 
tgtagtgagt 
ttaaaaaaaa 
gttaagaatt 
aattatatat 
ttttaaaaat 
ggtttttaat 
ttttaggtaa 
ttatttgagt 
ttttttttgg 
tcggtttgag 
aacgggtggt 
ggtgcgttgg 
tttagataat 
ttaatataac 
attgaatttt 
gttgcgtgag 
ttcgattgtt 
agttcgggtt 
acgcgtttat 
agatttcgtt 
tttggttgcg 
gtgcgggtta 

gcggggaggt 
ggtgtacgcg 
agcgttgcgg 
gagagttttt 
ttttttagat 
ttatattatt 
gtttataaag 
ttttttttta 
aagttgagat 
aataaaatag 
tatgtatgta 
gaggtttttt 



3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4296 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
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ttaagtttta atgtatatat tgattatttg gaaattttat ttaaaaatgt agattttaat 2580 

ttagtaggtt taggaggtag gtagagattt tgtattttta atgagtattt ggatagagcg 2640 

ttttattttg tatcgttttt tttcgggatt gagttagtga gtaattgtaa atgattattt 2700 

attacgaagt gatagtggtg ggaaatgtaa tttttagaat gtatagagta tagtagaaat 2760 

tgtaaaatta aaagtgggtt gggagttatt tgaatgtttg tgtttttatt tttttaatgt 2820 

aggtgaagaa agagaatatt tattttttta tgtgtaaacg gggtaatatg ggagtagaat 2880 

agtataaatt tttaaatttt tttttttgtt agggtaatta gatttgttta agatattttt 2940 

ggtgtatatg ttttgtagtt taaatattaa tagaattttt tttttttttt agatatgttt 3000 

tgattggata ataaattatg tgattaattt attttttatt tttaaagata tgattttgta 3060 

tagttttttg gttagtagat tttttagtaa tttatttaat gaattttttt atttgagagg 3120 

aagattgtag aggtaagggt ttgtgttagg gtttttaggg aataaaggta aatagttttt 3180 

ttttaaattt aatttttaaa ttttttttgt tattttaagt gtttatttgt tttattttgg 3240 

aattttaatt tatgttttat ttagtattta agttgttttt ggatgttttg ttatttttta 3300 

gtatttatgt tgaagatgta gaaagtgtag tgaaattagg agggttaggt attttttttt 3360 

ttattaatta attgttttgt taaagttgga gtgattttgt taatggagga agattgagtt 3420 

ttaaattttt tttttatttt tttttttgtt attagtttat gttattgtta tggtttgtgt 3480 

aggagttttt tacgagttat ttttagttac ggtcgttttg ggttttttta tttagttttt 3540 

taaacgagag ttgaatttag ttttttaaaa tataaaatta gttatattaa ttttttgatt 3600 

aaaattattt aatggttttt tattgttttt gagataaaga gaagataaaa attagatttt 3660 

tgaacgtgtt tttaagtttt taattttttc gtaaattttt atttggtttt tttttgttta 3720 

ttgaaagttt tttttaaatt tcgtgaatat gtttttgtaa tttttttatt ttttatgttt 3780 

agagtttttt ttagtgtgga tttttttttt ttgatttttt tttttttatt tagttaattt 3840 

ttggttttcg tggattgaat tataattttt ttgatgggta agtttttttt tatttttatg 3900 

agtagatttt gtattatacg tttttttggt attatatgtt ttttttttaa tagttgtaat 3960 

tttataatta tttgtgttat tttttaattt atatttttat tttttattta attttaaatt 4020 

ttttgagggt aggtattatg tatttttttt tttatttatt attgtgtttt taagtattaa 4080 

atatataatc gatgtttaat aaatatttgt tggatgaatg aatgaattag gtaataaaga 4140 

tttagaagaa aattgtattt gattgtttta gttgttttat aattttggag aaaatattgt 4200 

tatcgaatat ttgagatata aatagattat attttagtcg agaatgatgt tattattatt 4260 

tttttttttt aagaatggag aagtatgtgg gttttt 4296 

<210> 232 
<211> 4001 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 232 

acgtttagtt aatttttgta ttttttagta gagacgggga tttattacgt tggttacgtt 60 

ggtttggaat ttttattttt aagtaattcg ttcgtttcgg ttttttaaag tgtaggcgtg 120 

agttatagcg tttagtttga ttttatttta tatgaagttt tttaataggt aaaatggtta 180 

tggagattaa aataaaggtg gggtcgggaa tcgattggga agagacgtga tgaaacgttt 240 

ttgggacgat gaaaagggtt tgtgatttgg taggtattac ggagcggtta ggggttaaaa 300 

tttatttttt tgtgtatttg ttgtgtgtat tggcgttgtg tgtaaatgtt atttcgattt 360 

aggaaaaaga tgacgtaagt acggtataaa gtggtcggta cgcggtaggt gtatgggaag 420 

aaattgcgga atgaaataat cgcgagttaa gagatggggt agcgggagaa atgaattcga 480 

gtttcgtttt ttattaggaa gaatcggttc gggtcggagg gttgtacgga ggattatacg 540 

gacgtttgcg ggttcgtttt tttcgtttta cgacgtttag tttgcgtttg gaattggaat 600 

ggtttagttt aaagttagat aataggtaga ttgttttttc gataaattat taaacgattt 660 

attattgtat ttttttaaaa tttgattttt agacgtattt attttttttt tttttttttc 720 

gggaagatga gatatattta tttttgaaaa tattttcggg tttgtttttt gtatattttt 780 

tttttttttt gttttacgtt atggtagcgt tcgtttaggt tgtaggcgat tcgcggggtg 840 

gggtatatta tttaaagaag gggagggatt gaggtttgta ttaaaataaa tatttttgtt 900 

tttgtaaagg ttataattaa gtaatttaga aaaagaaatg taggcggaga atagtagttt 960 

ttttttgtta agtaagagga atcggtttaa aggatatttt tttttttttt tttttttttt 1020 

tatcgggtga atagtgagtt gtttcggtaa aaagaaatcg gaaatgttgt tgtaagaggt 1080 

agaaatgtaa atgtggagtt aaataataat agggttgtcg ggttttttag attgcgacgg 1140 

tttttttcgg tttggcgggt aaatttttgg tttagtattt tttattttta cgattgatag 1200 

tttttaattg gatttttttt atttagcgga gtcgggggtt gtttggaaag atcgttttag 1260 

gaaggataaa ggttcggaag ttgtgggatt ttagtagttt gggtttttcg gattattttt 1320 

aaatgattat ttcggaatgg agttttagtt tttattagga tgttatgggt tttaaaatat 1380 

atagttatga gtttttaatg tttcgagatt taaaagtttt agattttaat gttttgtgta 1440 

ttttttattt tagggatttt ttacgtttag tatcgggtgg atgtgtaaag aagtacgttt 1500 

taggtcggtt taaggttttt taaagtttta tttttttgtt taggcgttta attttgagtt 1560 

cggatggttt taatattttt attatttata tttaggtttt ttaataatgt aatttttatg 1620 
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atgatttttt tagttaagtt tttattttat ttatttttaa attcgttaag tttttattgt 1680 

tttattttta gttttagcga ttttttcgag ttgaaaatat acggagtcga gagttcgtga 1740 

tttagagagg atttattaag tttagttagg agtttattta atttagggaa gcgtgttatc 1800 

gtcgtggaaa gtacgttttt agttcgaacg taaagtgttt tcggagttta gtagttattt 1860 

gttttttgga cggtggtttt agatttttga gaagtttaaa atttttagcg ttagttttga 1920 

gtatatggga ggggaaaatt ttaattttat taatttttgc gaggtttttg gtataaagtt 1980 

ggatagtcgt tatgataagt aagggtaagt aatrtcgtttg tcggaggaag taaaggaaar 2040 

ggagttgggg aggagggtgt agagttagga ttttcgtcga tttggtgtcg tagatattaa 2100 

tattttgggg tggaaaattt tgtaagttag agttgtgagg gtagaattgg tggaaattat 2160 

tttggaggaa ttttgtattg tgttaaatat gaagggtgga aggaagaaag tttttgcgtt 2220 

tgtttttagt tggatttttt ttttttatta gttaaaatgt tattttttag gaaggttttt 2280 

cgtaatatta tattttaacg ttttttttta gatattttat attatattat tttatttaat 2340 

tttttttata atttttatta ttttgataag atttatttgt ttattgtttt tagtatatgg 2400 

aaacgtaagt tttatgagga tatagaattt ttttattatt ttatttattg ttgtattttt 2460 

gagtgtttat attagtgttg ggtagtaagt aagagttcga taataaatat tttttgaatg 2520 

agggagatag gtttgaagtt tggagaatga gatgtagaag aggtgtaaga tttgttgcgt 2580 

tttttgtagg cggcgggggg gcggtgtagg tgttttaaga attatcgcgg gattcggtag 2640 

ggggagcgta ggcgtttttc gttaagatag aagcgtttag attataattt ttagtagtta 2700 

cgaggagttt tagggtttga tgggaacggg aaatttttta attttttacg tttcggtttc 2760 

gcgggtttcg tgggtcgttc gcgaaatttg attcgggatg cggcggttta atcggaaggt 2820 

ggatcgaaat ttcgcgatag taagaggttc gtagcgattc gcggtgttaa ggaatatagt 2880 

gtttttaaaa gaattggcgt tcgttgttcg tttttttttt cgggagtttt ttgtttattt 2940 

ttagaagagg agggaagtat aggtgggttt ttttagtttt gcgtcggatt tttgagaatt 3000 

tcgaagttat tttggttgag gttaattttc gttgtgtttt ttttgtagta tgaagatttt 3060 

ggagatttaa tcgttagttt cggattgttg ttttttagat taggatttag ttttagttta 3120 

tttttttttt tacgtttttt cgatgaataa aaatgcggat tttgaattga tgttatcgtt 3180 

tttcgaaagg ggggattcgt ttcggttgtt tttagatttt gtggttggtt tagttgtgtt 3240 

ttaggagtta cgggaggggg atttagtttt tttttttatt tttttggaaa tagagtttgg 3300 

tttttttagt gagttgagtt ttcgaatcga ggagtaagaa ttttttgaaa atataagttt 3360 

ttttgtagaa gaagtaaacg ggagtttttt tgaagaagaa gcgaacgggt tagagttggg 3420 

gtttggaaaa gttatggaag atatttttgg ggaattcgtt gtagaggacg agggagatac 3480 

gtaagtggtg atggtagtgg agtgtggagt ttggggagat gaagtgtgag gtcgatttgt 3540 

tttttggttt tgagatttat ttttttaggt ttttgttttt tttgtttggc gatttaggtt 3600 

tattgttttt tatttttttt ttagcgtttg gaattatagt tttttttagt tgtttcgatt 3660 

ttttagtggt ttttggttag agtttagtat ttaatttgag aattttttga aaggttgtaa 3720 

gtggt^aagga taataacggg gtagggagtt gattattttc gagattttta ttgtaaaata 3780 

gtttagtttt ttaggagagg gatgttttag agttgggaga agcggtacgt agttttttta 3840 

gattgagttt atattttatt tagtagtttg tgttttttat tttttttaag gatttagggg 3900 

ggtttattta ttttagaggt aggtttagtt ttagttttat atttgaaaag tataggtttg 3960 

gttagttttt taattttttt ttgtttttag ggtttttgac g 4001 

<210> 233 
<211> 4001 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400>-233 

cgttaggagt tttagaaata ggggagagtt agaaagttgg ttagatttat gttttttaag 60 

tgtagggtta gggttgagtt tgtttttggg gtaggtaagt ttttttgaat ttttgaggga 120 

agtagaagat ataaattgtt agataaaatg taagtttagt ttaaaagggt tacgtgtcgt 180 

tttttttagt tttggggtat ttttttttta gaaaattgga ttgttttata gtgaaaattt 240 

cgggggtggt tagttttttg tttcgttgtt atttttatta tttatagttt tttaagaagt 300 

ttttaggttg ggtgttgaat tttgattagg aattattgag aaatcgaggt agttgggaga 360 

agttgtagtt ttaagcgttg aaaggaagat gggggataat aaatttgggt cgttaagtaa 420 

agggggtaga ggtttggaga agtgggtttt aggattagag gatagatcga ttttatattt 480 

tantttttta gattttatat tttattgtta ttattattta cgtgtttttt tcgttttttg 540 

tagcgggttt tttagaggta ttttttatgg tttttttaga ttttaatttt ggttcgttcg 600 

tttttttttt agaaaggttt tcgtttgttt tttttgtagg aaggtttgta tttttagaaa 660 

gtttttgttt ttcgattcga ggatttaatt tattagggga attaaatttt gtttttaggg 720 

gagtggagag agaaattggg tttttttttc gtagtttttg ggatatagtt gagttagtta 780 

taggatttgg ggataatcgg ggcggatttt ttttttcggg aggcggtggt atrtagtttag 840 

agttcgtatt tttatttatc ggggaagcgt ggggagaagg atgggttgga gttgggtttt 900 

ggtttgaagg atagtagttc ggagttaacg gttgagtttt taaagttttt atattgtaga 960 

ggaagtatag cggagattag ttttagttag gatggtttcg aagtttttag ggattcgacg 1020 
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tagagttaaa gaaatttatt tgtgtttttt tttttttttg ggagtaggta gaagattttc 1080 

gggaggagag gcgaatagcg gacgttaatt tttttgaaag tattgtgttt tttagtatcg 1140 

cgggtcgtta cgggtttttt gttgtcgcgg gatttcggtt tatttttcga ttgggtcgtc 1200 

gtatttcgga ttagatttcg cgggcgattt acggaattcg cggagtcggg acgtgaaagg 1260 

ttagaaggtt tttcgttttt attaagtttt agggtttttc gtggttgttg ggagttgtag 1320 

tttgaacgtt tttattttgg cgagaagcgt ttacgttttt tttatcgagt ttcgcggtaa 1380 

tttttaaagt atttgtatcg ttttttcgtc gtttgtagag ggcgtagtag gttttgtatt 1440 

ttttttgtat tttatttttt aggttttaga tttgtttttt ttatttaaaa aatatttatt 1500 

atcgagtttt tatttgttat ttagtattga tataggtatt taggaatata ataatgaata 1560 

agatagtaga aaaattttat atttttataa ggtttacgtt tttatgtatt gaaagtaatg 1620 

aataaataaa ttttattaga gtgataaggg ttgtgaagga gattaaataa gatggtgtga 1680 

tataaagtat ttgggagaaa acgttagggt gtgatattac ggaaagtttt tttaaaaaat 1740 

gatattttaa ttgatgagaa gaaaggattt agttgagagt aaacgtaaaa gttttttttt 1800 

ttttattttt tatatttgat ataatgtagg atttttttaa aatgattttt attaattttg 1860 

tttttatagt tttggtttgt agaatttttt attttaaaat gttagtattt acggtattag 1920 

gtcggcgaga attttgattt tgtatttttt tttttaattt tatttttttt gtttttttcg 1980 

gtaggcggat tatttgtttt tatttgttat ggcgattgtt tagttttgtg ttaggagttt 2040 

cgtaggggtt gatgggattg gggttttttt tttttatgtg tttaagattg gcgttaaaag 2100 

ttttgagttt tttaaaagtt tagagttatc gtttagggag taggtagttg ttgggtttcg 2160 

gggatatttt gcgttcgggt tgggagcgtg ttttttacga cggtgatacg tttttttgga 2220 

ttgggtaagt ttttgattga atttgatgag ttttttttga gttacgggtt ttcggtttcg 2280 

tgtattttta gttcgggaaa atcgttgggg ttgggggtgg ggtagtgggg atttagcgag 2340 

tttgggggtg agtgggatgg aagtttggtt agagggatta ttataggagt tgtattgttg 2400 

ggagatttgg gtgtagatga tggggatgtt aggattattc gaatttaaag ttgaacgttt 2460 

aggtagagga gtggagtttt ggggaatttt gagtcggttt aaagcgtatt tttttgtata 2520 

tttattcggt gttgggcgta gggaattttt gaaataaaag atgtataaag tattgaggtt 2 580 

tgagattttt ggatttcgaa atattgagaa tttatagttg tatattttag agtttatggt 2640 

attttagtga aaattggggt tttatttcga aatgattatt tgggggtgat tcggggagtt 2700 

taagttgtta aggttttata attttcggat ttttgttttt tttggagcga tttttttagg 2760 

tagttttcgg tttcgttaga tggagaaaat ttaattgaag gttgttagtc gtggaagtga 2820 

gaagtgttaa attaggggtt tgttcgttag gtcgaggagg atcgtcgtaa tttgagaggt 2880 

tcggtagttt tgttattgtt tggttttata tttatatttt tgttttttgt agtagtattt 2940 

tcggtttttt tttgtcggag tagtttatta tttattcgat gagaggggag gagagagaga 3000 

gaaaatgttt tttaggtcgg ttttttttat ttggtagagg gaggttgtta tttttcgttt 3060 

gtattttttt ttttggatta tttagttatg gtttttgtaa aggtaggggt atttgttttg 3120 

atgtaaattt taattttttt ttttttttga atggtgtgtt ttatttcgcg ggtcgtttgt 3180 

aatttaggcg gacgttatta tggcgtgaga tagggaggga aagaagtgtg tagaaggtaa 3240 

gttcggaggt atttttaaga atgagtatat tttatttttt cggaggaaaa aaaaaaagaa 3300 

tgggtacgtt tgagaattaa attttgaaag agtgtaatga tgggtcgttt gataatttgt 3360 

cggaaaaata atttatttgt tatttagttt tgggttaggt tattttagtt ttagacgtag 3420 

gttgaacgtc gtgaagcgga aggggcgggt tcgtaggcgt tcgtgtggtt tttcgtgtag 3480 

tttttcggtt cgagtcggtt ttttttggta ggaggcggaa ttcgaattta tttttttcgt 3540 

tgttttattt tttagttcgc ggttgtttta tttcgtagtt tttttttatg tatttgtcgc 3600 

gtatcggtta ttttgtgtcg tatttacgtt attttttttt taaatcgagg tggtatttat 3660 

atatagcgtt agtgtatata gtaagtgtat aggaagatga gttttggttt ttaatcgttt 3720 

cgtgatgttt attaagttat agattttttt tatcgtttta gaaacgtttt attacgtttt 3780 

tttttagtcg attttcgatt ttatttttat tttgattttt ataattattt tgtttgttgg 3840 

agaattttat atagaatgga attaggttgg gcgttgtggt ttacgtttgt attttgggag 3900 

gtcgaggcgg gcggattatt tgaggatagg agttttagat tagcgtggtt aacgtggtga 3960 

attttcgttt ttattaaaaa atataaaaat tagttgggcg t 4001 

<210> 234 
<211> 4607 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 234 

tgttatttat ggaacgaaga ttttaaattt tagttatgag gataattatt tttttatttg 60 

gggatagaat attagtattt aaattattta ttcggtatgt ggtagaggag aagagaatta 120 

gaggagaagt agagatgata aagtagttat attatttatt agtttatagg taatagaatt 180 

attaatttgt tttttgtgat aaagtaataa taaagagtcg atatttttta tatttttatt 240 

tgtgtgtagt cggtattgtc gatatttatt tttttaaggg tattttttta gattttttat 300 

ttgtttacgg tatttttttg ggtttcgtat ttgttcgggt atttgtcggg gttttgtatt 360 

tgttcggttt tta tat ttgt ttggggtatt tgtttgggtt tcgtatttgt ttggggtatt 420 
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tgtttgggtt 
tttttgggtt 
tgttcgggtt 
tacgcggtcg 
ggtttaggtt 
tggagagttt 
cgttcgaaaa 
tcgaggtcgg 
cgcggtattg 
tttttcgatt 
tatttttagt 
tttataataa 
aaaaattatt 
tgtatatatg 
tatgtttatt 
atcgtgtatg 
gagttcgatt 
aaggagagaa 
tcgtttatta 
tttgggggtt 
ttaggggcgt 
gtttttttta 
ttttagggtt 
gttgagagtc 
cgcggacgtt 
ggtgtcggtt 
ggtttttatt 
ggattttttt 
gttcgttcgc 
gggtagagga 
gcgtcgttat 
tggtgtacgt 
Qcggcgggtg 
gtagagaata 
ttagagttcg 
ggagtgttcg 
tgggagagta 
ataggagaag 
tttggggcgg 
attttttgtt 
cggttaggag 
gggggaaacg 
cgcggtcggg 
ggcggtttcg 
ggtagagcgc 
aggagatttt 
aaaaggcgtt 
ataattcgtt 
gttcgcgcgt 
ggacgtcgat 
tcggtttcgc 
gaagttggga 
tttttcggag 
gtttagtttg 
atagtttcgt 
tcgagggtcg 
tggaaatcgt 
taatttcgag 
aaggatttgg 
tcgagggata 
gagcggtttt 
attttacgtt 
ttggatttgg 
gataagttta 
taattagtaa 
tttttaaaga 
tattaatata 
ttgtttagtt 



tcgtatttgt 
tcgtatttgt 
tcgtatttgt 
ttttcggttt 
ttcgtcgttt 
gaatatttcg 
ttcgcgggat 
aagttagaag 
tggggtttgt 
agggattttc 
tagggttgtt 
atttagttat 
cgttagtttg 
tatatatata 
tataaatata 
taatgtataa 
tttttatttg 
cgaatttgtc 
ttcgcgcgta 
tagagggagc 
gtttcgtttt 
tttttggttt 
ttttagatcg 
ggttagggtt 
ttaatttttt 
ttttttgttg 
tcgtgcgttt 
cgttaggcgc 
gaagttacgg 
tatcggttcg 
tgagagtcgt 
ggtcgagggt 
tcgttggtcg 
attttaagcg 
agtttatagg 
cgttttgggt 
gtttttagag 
tgggtggtaa 
cgttttttat 
ttttggattt 
agattcgggt 
taggcgtcgg 
tttggttcgc 
tcgggttttg 
gttcgggatt 
ggtttcgtcg 
aacgttcgcg 
taggggtcgg 
tcgcgaaggg 
gttcggggtt 
gggcgggggt 
ggtgagggga 
ttggatcggt 
cgaggagacg 

aggggtttcg 
ttgggagggt 
tagtaagttt 
gttttagttt 

gcgggggttt 
gttttttgtg 
aggtagagaa 
tttttagttt 
ggagcgatga 
tgtagtatgt 
gggtgtaggg 
tgttgttatt 
atgttttggt 
ttagttttgg 



ttggggtatt 
ttggggtatt 
tcgggtatcg 
tgttcgtcgg 
attaggtatt 
gagaagttta 
ttttgggcgc 
gcggaagtga 
agtttttgtg 
gattcgagaa 
gataaaaatg 
aagtatgttt 
aaatttaaat 
tatttatatt 
tttttaataa 
gtatttattt 
tcgtttggtt 
gttgattggg 
tttgggttgg 
gagtttttgc 
tatttttatt 
tcgtaggtcg 
cgtagatttt 
ttgttcgtat 
cggtcgaatg 
ttaaaattag 
agtaaatagt 
ggattcgttg 
gttttattga 
gagttagatt 
tcggaattcg 
tttggaagtt 
gtagagagtt 
tatcgacgtt 
cggcggtcgg 
gatttttgga 
gtttttcgcg 
gttttgtttg 
ttttgttttt 
taagcgtttc 
gttaggaaag 
gtatagagtc 
gggaggggtt 
taggagcgac 
tcgatttgga 
cgattcggtg 
gtcgtttatt 
gtagttcgtt 
gacgtagcga 
gcgacggttg 
tggagcgtag 
gagcgatcgg 
ttttgggatt 
cgggttttgt 

ggagggtttt 

aatttcggga 
tttttcgttc 
ttttgaggag 
tgatttatgg 
gttttgagtt 
agttcgtgaa 
ttgggtcgtt 
ttatttttgt 
atacgtttgt 
aaaagtaatg 
tgtttattta 
ttttgttgtt 
gggaggagta 



seq_prot.txt 
ttttcgggtt ttttatttgt 
tgtttagata ttgtatttgt 
tatttgtagg atttttaagt 
ggacgttggt atcgaggatg 
tgtcgttcgg ggagaatttg 
tggtcgtcgt ttgtcgcggg 
gtagtaggtt gtaatagtcg 
attgtagttt attagcgtcg 
tcgtagggtt ttaaaggaaa 
ttttatttta aaggtcggga 
tagaaagtat agtaaaattt 
ttaaatattg tacgggatat 
ttaattgagc gatttgtgtg 
tatatgtaaa tgtatgttta 
gtaatacggt gtttgtcgta 
cgtttgtttg gggttttgtt 
tttgttttac gttttagtgt 
tagagcgagc gcgtggatcg 
tatcgggcga agaatcgtgc 
gcgggcgttc ggttcgtagg 
tcggatttcg gttttttttt 
ttagtagttc gcgttaggtt 
gatattttcg tttggttttg 
tttcgggcgt ttagtttcgg 
gatggtggtg cgcgcgcgtt 
atttaaattt ttgtatggga 
gggtgagtta tgaagatgtg 
cggttagaga atttagtttg 
cgcgattttt taagacgtgg 
acgggtttta taagtattag 
tttagtatgt cgggtttttt 
tcgatggttt aggaggagta 
tcggtttgat ttagcgtagg 
cgcgagtttt ttttaaatat 
gggagggagt agggtgttgg 
aggacgtggg gtttaaattt 
ggattttttg tcgggcggga 
gcggaaagta gtcgtttttt 
cgttttttat ttttgttttt 
gcgcgtcgag gagtttagcg 
atgggtcgtt tgggggatag 
ggtatcggcg tttttagttt 
ttggcgtcgg atttgtttcg 
gcgcgttaaa aggcggcggg 
cgcggttagt tggagaggcg 
gttcgcgttg ttttttcgcg 
tttcgcggcg tttttttttt 
ttgttttttc gttcgcgtat 
aatcggggtt cgcgttaggt 
taggtaggag gtttagggtc 
cgttgggtag gtatttgggt 
ggacgagttg ggataaggcg 
tggcgttcgc gagaggttgg 
ttttagcgtc ggtcgttttt 
ttttttgttg tttttttttt 
agaggtcggg gtgcggggcg 
gcgcgttttt ttcgatttgt 
ttagggttag gttttttttt 
agttggttat taagcggttt 
tgtttgtcga gggtttttgg 
gaaatggttc gggtcggttt 
tttttttgta gtttttgttt 
ttagtttgta ttttggtttg 
tatataaata ggggatagat 
tattttaaat ttttgattag 
ttgtttggat atttggaaat 
ttgttttttg ttgttgttat 
aataaagcgc gtggtttttg 
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ttagggtatt 
ttcgggtatt 
tgtttttatt 
ttttgttcgt 
tagagtaagt 
cgttattttg 
acgtcggttt 
tcggttttcg 
cgtttacgtt 
ggtttttgag 
gaattttaga 
gttaatacgg 
tttgcgtgtg 
tatgtaaata 
tatatattat 
tgtttttgtt 
tattgagatt 
cggttatcgt 
gggtttggga 
tttcgtaggt 
ttagatagcg 
tcgtcggcgt 
ggttttggga 
gtttgttttt 
ttatttcggc 
ttcgtttttg 
cgagttagtc 
cgttagttcg 

gggttattat 

attataagta 
agttagggtt 
ggcgggcggg 
tttggtgcgc 
cgaacgggat 
tcgtcgttcg 
cggttggggt 
tcgtggtttt 
tttttttggg 
cgtcggttat 
ttagtggcgg 
tagggagttc 
tgtcgaagat 
gttttgcgtg 
aaggaggcgg 
gagcgtcggg 
cgtcgggtta 
tcgcgtttat 
tcgttcggag 
tagtcgggac 

ggggggcggt 

tcgtagtttc 
atataggggt 
agcggttaga 
ggcgttaaag 
atttcgggtt 
cgggtgtagg 
agggttgtgt 
ggatagggag 
cgatggtttt 
tttgttttcg 
ggagggagat 
ttttcggggt 
gacgttagga 
agacgatttt 
aggttatttg 
ggtttaggtt 
tgttgtttat 
gtatttattg 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 



seq_prot.txt 

agcgttgagt tatttttttt tggatttatt cggggaaaga ttaaaaagta ttttattaag 4560 

aataggatac ggtgtttgaa atgttgttat atatgaatgt atgtatt 4607 

<210> 235 
<211> 4607 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 235 

aatgtatata tttatatatg gtaatatttt aaatatcgtg ttttgttttt aatgaaatgt 60 

tttttaattt tttttcgaat aaatttaaag aggggtggtt tagcgtttag taagtgttag 120 

aggttacgcg ttttgtttat ttttttttta gagttaaggt tgaataaata aatagtaatg 180 

atagtaataa aaaataaaat aataaaaatt aaagtattgt gttaatgagt ttgagttatt 240 

tttaaatgtt tagatagtga ataaatagat ggtagtattt ttggaagtaa atgatttttg 300 

gttagaagtt tggggtgtat tgtttttttt tgtatttttg ttggttgaag atcgtttatt 360 

tgttttttgt ttatgtagta gacgtgtgta tattatatgg gtttattttt tagcgtttag 420 

gttaaaatat aagttgagta aaggtaatta tcgtttttta agtttaagtt tcgggaaagg 480 

taggggttgt aggaggaggc ggtttaggag ttaagggggc gtgaggtgtt tttttttagg 540 

tcggttcggg ttattttttt acgggttttt tttatttggg atcgtttcgg agataggtta 600 

ggagttttcg atagataaat ttagagttat agggagttgt ttttcgggaa attatcgaaa 660 

tcgtttagta attaatttta tgggttaagg ttttcgttta gatttttttt tttgtttaga 720 

ggggggtttg gttttggttt tttagggaag ttgaggtttc gggattggta tagttttgta 780 

ggtcggaggg agcgcgcggg cggggaggag tttgttggcg atttttattt gtattcgcgt 840 

ttcgtatttc ggtttttttt cggggttatt tttttaacgg ttttcggagt tcggggtgga 900 

gaggggatag taggaggagg gttttttcgg aatttttgcg gggttgtttt tggcgttaaa 960 

ggcggtcggc gttgagggta ggattcgcgt tttttcgtag gttagatttt ggtcgtttta 1020 

gtttttcgcg agcgttaagt tttaggggtc gatttaattt cgagggagtt tttgtgtcgt 1080 

tttgttttag ttcgttttcg atcgtttttt ttttattttt tagtttcgga gttgcgagtt 1140 

taggtgtttg tttagcgttg cgttttagtt ttcgttcgcg gagtcgaatc gtttttcggt 1200 

tttgggtttt ttatttgtag tcgtcgtagt ttcgggtatc ggcgttcgtt tcggttggtt 1260 

tggcgcgggt ttcggtttcg ttgcgttttt ttcgcgggcg cgcgagtttt cgggcgggtg 1320 

cgcgggcggg gaggtagggc gggttgttcg gtttttaggc gggttatatg ggcgcgggga 1380 

ggggaggcgt cgcggggagt aggcggtcgc gggcgttagc gtttttttag ttcggcgcgc 1440 

gggaaggtag cgcgggttat cgagtcgcgg cggggttaag gttttttttc ggcgtttcgt 1500 

ttttttagtt ggtcgcgttt aagtcggggt ttcgggcgcg ttttgtttcg tttttttttc 1560 

gtcgtttttt ggcgcgcgtc gtttttgtag agttcggcga ggtcgtttac gtagggtcga 1620 

agtaggttcg gcgttagggt ttttttcgcg ggttagattc gatcgcgatt ttcggtagag 1680 

ttggggacgt cggtgtcgat tttgtgttcg acgtttgcgt ttttttcgga ttttttgttg 1740 

ttttttagac ggtttatttt ttttgatatt cgggtttttt ttggtcgtcg ttattagcgt 1800 

tgggtttttc ggcgcgcggg gcgtttggaa tttaaggggt aggggatgtg gtcggcgggg 1860 

aataggggtg aggggcgggg aataggggtg aggggcgtcg ttttaggttt aggaggaggg 1920 

gaacggttgt ttttcgttaa gtagggtttg ttatttattt tttttgtgga gttacggttt 1980 

cgttcggtag aggatttcgc ggagagtttt tgggggttgt ttttttaatt ttagtcggag 2040 

tttgggtttt acgttttttt aggggttatt taggacgcga atattttcgg gcggcggtta 2100 

gtattttgtt ttttttttcg gtcgtcgttt gtgggttcgg gttttggatt tcgttcggtg 2160 

tttggaagga gttcgcgggc gtcggtgcgt ttggagttgt tttttgcgcg tattagattt 2220 

gcgttaggtt aggtcgaagt tttttatcgg ttagcgatat tcgtcgtttc gttcgtttgt 2280 

tttttttagg ttatcggggt ttttagggtt ttcgattacg tatattaggt tttggttagg 2340 

ggattcgata tgttgggcga gtttcgagcg gtttttagtg gcggcgttgt ttatggtttg 2400 

atgtttatgg ggttcgtgat ttggtttcga atcgatgttt tttgtttatg gtgattttta 2460 

cgttttggaa agtcgcgtta gtgaagttcg tggtttcgcg agcgagtcgg gttggcgtag 2520 

attgggtttt ttggtcgtag cgggttcgcg tttgacgggg agggttcggt tgattcgtat 2580 

atttttatgg tttatttatt gtttgttggg cgtacggggt ggggatttaa agacgaattt 2640 

tatgtagagg tttgggttta gttttggtaa tagaaaaggt cggtatcgtc ggagtaggac 2700 

gcgcgcgtat tattatttat tcggtcgggg aggttggggc gttcgcgggg agtagattcg 2760 

aggttgggcg ttcggaggta cgagtaggat tttggtcggt ttttagtttt tagaatttag 2820 

ggttaggcgg ggatgttagg gtttgcgcga tttggggggt tttagaggcg tcggcggggt 2880 

ttggcgcgga ttattagcgg tttgcgggag ttaggggtgg agggggtcgt tgtttaggga 2940 

agaggatcgg ggttcggagt gggggtgaga acgaggtacg tttttgagtt tgcgaaattt 3000 

gcggatcgag cgttcgcgta ggagttcgtt ttttttgggt ttttaggttt tagattcgta 3060 

cgatttttcg ttcggtgtta gtttagatgc gcgcgggtga tgaacgggcg gtggtcgcga 3120 

tttacgegtt cgttttgttt agttagcggt aaattcgttt tttttttgat tttagtggta 3180 

ttggagcgtg aggtaagggt taggcggtag gtagaggggt cggatttagt aaaagtaaat 3240 

ata^ttttaa_gta[aacgaaa taaatattta tatattatat atacgatata atatatgtgc A 3 P^ 
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gatagatatc 
atatatattt 
ataggtcgtt 
tttcgtgtaa 
ttttattgtg 
cggtttttga 
tttttaaagt 
cgttgatagg 
ttgttgtagt 
gcggtaggcg 
gtttttttcg 
tttcggtgtt 

tggggatttt 
ggtgtagtgt 
ggtggagggt 
ggtgcggggt 
ggtaggtgtt 
ggaggtgttt 
gagatgtcgg 
gtaaattggt 
ttttattata 
agttgttttt 



gtattattta 
atatataaat 
taattaaatt 
tatttgggaa 
ttttttatat 
ggtaaggttt 
tttgcggtat 
ttgtagttta 
ttgttgcgcg 
gcggttatga 
gacggtaagt 
agcgttttcg 
gtaggtgcgg 
ttgagtaagt 
tcggggaggt 
ttaggtaggt 
cgggtaggtg 
ttggaaaggt 
ttttttgttg 
aagtggtgtg 
tgtcgaataa 
atggttggga 



ttaaagatat 
ataaatatat 
tgaattttag 
tatatttata 
ttttattagt 
tcgggtcgga 
aagaattata 
ttttcgtttt 
tttaggggtt 
attttttcga 
atttggtgag 
gcgggtaggg 
tgttcggata 
gttttagata 
gttttaagta 
attttaagta 
cgggatttag 
gggtatcggt 
ttattttgtt 
gttattttgt 
atgatttaaa 
tttgaggttt 



seq_prot . 
atttgtaggt 
atatatgtat 
attagcgagt 
attagatttg 
agttttagtt 
aatttttggt 
agttttataa 
ttgattttcg 
tcgcgggttt 
ggtatttaag 
cggcggggat 
tcgggggcgg 
ggtgcggggt 
ggtgcgggat 
ggtgcggggt 
gatatggggg 
gaagatatcg 
agtgtcgatt 
ataaaaggta 
tatttttatt 



txt 

aaatatatat 
gtgtatatat 
aattttttcg 
ttgtgaattt 
ggaggtgttt 
cggaagaaac 
tgtcgcgcgg 
gtttcggagg 
tcgggcgtag 
ttttttagtt 
ttgggttacg 
tcgcgtgggt 
tcgggtaggt 
ttaggaagat 
ttaggtaggt 
tcggataggt 
tagataggtg 
gtatataggt 
aattgatgat 
ttttttttaa 



tattagtatt ttatttttag 
tcgttttatg ggtggta 



ttatatgtaa 
acgtagatat 
tattagtatg 
gaaatttaaa 
aaaggttttt 
gtgggcgttt 
aagtcggcgg 
tcggcgtcgg 
ggtggcgttc 
tattttgtaa 
ggtaggatat 
ggaggtagtt 
attcggggta 
attttagata 
attttaagta 
gtaggatttc 

gggggtttgg 

gagggtatag 
tttgttgttt 
tttttttttt 
gtaaggaagt 



<210> 236 
<211> 4453 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 236 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4607 



ttggttttga 
gtaagtatag 
taatttgtat 
aggagttttt 
ggtggattta 
ggaaagggtt 
gagcgagttt 
agatgggggt 
cgtgtgtgtg 
gtagttattt 
tttatttgaa 
taaaaaatgt 
taagagtgta 
ttttaatttt 
ttagaattgt 
aattgtttat 
agtttatgtt 
taatttttat 
ttggttttgt 
aataagttcg 
aatagacgta 
gtatgtttcg 

gggggtttga 

gattagagga 
tttttttttt 
ggaggaagta 
ttttcgttgg 
tcgggtcgtt 
tcggggcggg 
gtttcggagt 
gtagttatcg 
ttttcgagcg 
tgtacggagt 
ggagttcgga 
ttttgcgttg 



agtttatagt 
aagttttttt 
tttttttagt 
ttttttttta 
gatttaaggt 
taaagtcgtt 
tattgggttt 
gatagaatta 
tgtgtgtgta 
atatatggtt 
tagaataaat 
ataattattt 
aaaggataaa 
tttatttaga 
ttatatgttt 
tatggaaatt 
ttaggatttt 
taagttttgt 
tttttttttt 
gttttgtttt 
tagagggagt 
tatatttgtc 
ggtagtttta 
ttttaggagg 
tgtaattcgg 
cggagaattt 
agatcgcgtt 
gagcgttgta 
ttttgatata 
tcgggttagg 
gtgagtgtcg 
tttgtttttt 
taaggcgttt 
gtttagagag 
tagtcgttcg 



atcgttgatt 
agaagatagt 
aggtagttgg 
tttttttggt 
ttagatttgt 
agtggattgt 
aaaatgattt 
tagaattttt 
tatataaatg 
tgtttttttg 
aaggttttat 
ttttggaatg 
tagtgaaaaa 
ggtgagaata 
ttattgtttt 
ttttaaaaga 
gtttttataa 
taaaaacgga 
ttgatagtat 
atttattggt 
ttttataggt 
gcgttgatga 
ggtaagttgg 
ttttagagtt 
ggattttttg 
ataagttttt 
ttttttaaat 
agatagggga 
gagtaggcgt 
gtttatttgt 
cggttttgag 
tcgttttggg 
cgtttcgggc 
gagagagata 
tttggtttgt 



tagtttgttt 
gggttatttg 
tattttgagt 
attttttttg 
aaggtaggaa 
tgggatttag 
taagttttgg 
gttatgtttt 
tttgtttatt 
gaggataatt 
gatgaagtaa 
ggttatatag 
taaatttttt 
gaatttttat 
tatttttatt 
ttcgttaata 
ttttgtaata 
ttttaaatta 
tattaaataa 
tgtgttggtg atatttgggg 
aggggttttt ttgtttgtgt 
agattttata gttttattag 
ggtttagcgg ggagaagttg 
gggcgaggta gagagttttt 
tattggggta ggttttcggt 
cgatttttta gtttagacgt 
ttttgtgagc gttgcggaag 
gggagtcggg cgggagaggg 
cgcgggtcgt agtatagtgc 
tttcgtagcg tcggttcgcg 
attttcgggt cggatgcgcg 
ttgttcgggt tttttgggtt 
gtttttcgcg ggtgtcgatt 
gttggggagt ttggttatcg 
tttttcgttt tttcgttttt 
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tttggaaggt 
ttttttaaaa 
tttcggttgg 
tttttttttt 
aatgttgtag 

tttttttttt 
tttttgatat 
tttaagtgtg 
tttaggtagg 
ttatttgata 
aatattaaat 
agatgtgttt 
ttttattttg 
attttttaga 
tttgtttgta 
ttttaatagg 
ttatattacg 
agttgtaaat 
atttttttat 



tggtagttta 
gttgaaaggt 
ggtagagtaa 
gttattttta 
gtttaggttg 
tttattaaga 
taggggaaag 
tttagagatg 
aagggtggat 
aataattgtt 
atatatgtat 
tttaaaatgt 
ttttttagtt 
atttttatag 
taaataaatg 
aagtattaat 
atatttaatt 
ttttagtaat 
tgtcgaaagt 
attgttattg 
ttttgggaga 
ttgcgggtaa 
tagaagaatt 
tgtgcgtttt 
taggtgtatg 
tgttgggttt 
tacgcggggt 
aggggcggcg 
ggagatcgta 
tttttttgtc 
gcggttttag 
tttcggcggt 
taggttgttc 
cgggtatttt 
tgttttgatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 



tttttttttt 

tggcgcgttt 

tttgattgat 

ggagttgggg 

tagggtggta 

aggtcgtcgg 

ttattttttt 

aggcgagagg 

agtttttgga 

tgaattttat 

ttgggaaggt 

tattattttt 

ttaaatggag 

ttaaagaaat 

agagaaattt 

tggaggtagg 

ttacgaaata 

ttggtaggtt 

gtttgtaatt 

aagattagtt 

ggtatggtag 

tgaatttggg 

gatagagtaa 

tttttatatt 

gaatggttat 

ttggaatttt 

gttgggtttg 

ttttttatag 

tataagaqaa 

tttgggttgg 

cgattttggt 

gattgcgtag 

tttatatgtt 

atgttttttt 

tttttttttt 

taagattatt 

taggaggtat 

ttgggtgatt 

ttttgtttta 

ttttttattt 



ttgtagagtc 

gtttttttgc 

gttgtttaag 

tttttcggat 

gcggaggttt 

gattgtttta 

tggttggaaa 

gagagggaag 

gtatgggatt 

taagttttat 

tttaggggat 

attttttttc 

gaatatgtgt 

tcggagggta 

ggggataggg 

gtgggtgata 

gtggttatat 

tgaaacgata 

ttagtatttt 

tggataatat 

tggatgtttg 

aggtagagat 

gattttattt 

tttttatttt 

ttttatcgag 

gttattgttt 

gggaaatata 

ttttttagaa 

gtgcggtttt 

aatgatattt 

gttatgtgta 

atggtgagta 

tttttatttt 

ttatttgttt 

ttagaagaat 

gggggagaat 
cgggggggtt 

agcgttatat 
ttttaagtat 
gtt 



gtcgtttagc 
gttttggtcg 
gatttttgat 
tttatttata 
gagaattacg 
gtttgcgggt 
tttatggttt 

agtgggtttt 

tttgatgaag 

gtgagtattt 

attttttttt 

gtttatttta 

gtttaataag 

gtattgtgaa 

agggatgggt 

tttttatttt 

aaatgtgagg 

tttttaaaat 

gagaggttaa 

ggtgtaattt 

tagttttagt 

tttagtgagt 

taaaaaaaaa 

tttgtttttt 

gaaaggttag 

tttagtaaac 

attattgtag 

attttgttat 

tggttgagtt 

tttatttttt 

ggtgggttta 

ttattgattt 

attataggag 

ttttaagata 

taaaatttta 

ttattattat 

ttgttattta 

attgttttat 

attttttttt 



seq_prot. 
gtttcgattt 
tgagcgattt 
tagtattagg 
gtagggttag 
ggggttggta 
atttggtaga 
tttatttgag 
gggattgggg 
ttttttttcg 
atttttataa 
tgttttttgt 
tttttgtttt 
tattttttta 
ataggtatgg 
gggaggtaag 
tatgtgataa 

tggggttgga 

gttcgttggt 
ggtgagtgga 
tgtttttatt 
tatttgggag 
tgagattata 
aaataaaagt 
tgtagataag 
tattgatatt 
gtattatgtt 
gtgaggtggg 
tatttttttt 
ttttttgagg 
tgtgttgtta 
aagttgtttg 
gttgatgata 
gattgaggag 
tttttttgtt 
gtgtggttaa 
cgagaattag 
cgtgtgtgga 
gtacggtttt 
tttttttagt 



txt 

cgttattatg 

taaagtgagt 

ggagaggagg 

atttttttta 

ttggttggta 

tgaagtttgt 

aattagatac 

ttagtttatt 

aatttttttt 

tagttggttg 

tgtagggttg 

ttttagcgaa 

atatttattg 

ggatttttat 

agtaggtagg 

gtataaatat 

aggagatttt 

agtcgggtat 

ttatttgagg 

aaaaatgtaa 

gttgaggtag 

ttattgtatt 

tagttggaat 

ttaaaaattt 

atgggtcggt 

tatagatttg 

ggtaataagg 

ttttagaggg 

ggaggaggta 

ggaatttaga 

tttaagagtg 

gtggggtgga 

gtggggggtg 

tgttttttag 

aagattttga 

ttttggtttg 

ggtagtttta 

gggttttttt 

aaagtgtttc 



agagttttgt 

gcgtttttgt 

ggttgtttag 

ggaaatggga 

agggaggaag 

ttgggttaat 

gaatagggtg 

tttattttgg 

agggtagtaa 

tatagataag 

cgttattttt 

ttgtgattgt 

gtgtaattgt 

tgtaattggg 

agttaggagt 

atatatacgt 

gtttagtttt 

ggtggtttac 

ttaggagttt 

aaattagttt 

gagaattgtt 

ttaattgggc 

gttttttttt 

gttatgaggg 

tttgtttgtt 

atgtttttta 

attaaaagtt 

ttggttatag 

gggaaggttt 

taatcggagg 

tatggtgtat 

aggggataaa 

ttcgagaggg 

gaaaaaagtt 

ggtttcgttt 

cggttattta 

ttagtttttg 

ttttcgattt 

gttttatttt 



<210> 237 
<211> 4453 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 237 

ggtagatgag 
gggtggggta 
gttgattatt 
tcggtgtttt 
tttaataatt 
tggaggagag 
taggagaaag 
ggggatatgt 
tatttgcgta 
agtattaggg 
attttaattt 
tatttttttt 
aagttgtagg 
ttttaggttt 
ggtagagttt 
aaagtgatta 
gagaatatga 



ggagaaatga 
gaagagtcga 
tagtaagggt 
ttgtagatgg 
ttaaagcggg 
gagggttttt 
tatttttttc 
aagtttgttt 
gttatgtatt 
tcgttttcgg 
agagggtttt 
gtgttatggt 
gagggttttt 
agttgaagag 
tagggtaggt 
ttttttttat 
gaaagagaag 



ggcggaatat 
agaggagaaa 
tgatgaggtt 
tcgtaaatta 
gttttagagt 
tttttggagg 
gggtattttt 
ttttttattt 
atgtattttt 
ttgtttgggt 
ttttgttttt 
tagttttttg 
ggtttttgtt 
tattagattt 
agggtcggtt 
agtaggtttt 
aatattttaa 



tttgttggga 
tttagggtcg 
gtttttatat 

gggttggttt 

tttttggtta 
ataaatagag 
tattttttta 
tattgttatt 
ggataagtag 
ttttggtaat 
tttttttagg 
ggagaagggg 
gtttttattt 
gtgggtatgg 
tatggtgtta 
tgatttattt 
ttaattttta 



aggagaaagg 
tatatgaagt 
acgtaggtga 
tcgatggtgg 
tattgaaatt 
ggatgtttta 
gtttttttgt 
agtaggttag 
ttttaggttt 
atagaaagat 
gaagatttaa 
atggtaataa 
tatttgtagt 
tacgtttgtt 
gtgttggttt 
ataaggggat 
tttttttttt 
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gatgtgtttg 
agtgtgtggc 
tagagttttt 
tgaatttttt 
ttaatttttt 
gggaggtagg 
gatggaataa 
tgatgtttat 
atttgtatat 
aggggatgtt 
ttagaggtcg 
ggtttttggg 
aattatgttt 
gaaggatagt 
tttttcggta 
aggaggatga 
_ttgagatgga_ 



2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4453 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020. 



seq_prot.txt 

gttttgtttt gtcgtttagt tggagtgtag tggtgtgatt ttagtttatt gagatttttg 1080 

ttttttaggt ttaagtaatt tttttgtttt agttttttaa gtagttggga ttataggtat 1140 

ttattattat gttaggttga tttttgtatt tttagtagag gtagggttat attatgttgt 1200 

ttaggttggt tttgaatttt tgattttaaa tgatttattt attttgattt tttaaaatgt 1260 

tgggattata agcgtgagtt attatgttcg gttgttaacg gatattttaa agatgtcgtt 1320 

ttagatttgt tagaagattg gatagggttt ttttttaatt ttattttata tttgtgtggt 1380 

tattgtttcg tgagcgtgtg tgtgtgttta tgtttgttat atagggatga agatgttatt 1440 

tattttattt ttagttttta atttttgttt gtttttgttt tttatttatt tttttttgtt 1500 

tttaaatttt ttttttagtt gtagtggaga tttttatatt tattttatag tgttgttttt 1560 

cgaatttttt tgggtagttg tattagtgaa tgttggagaa gtatttgttg gatatatatg 1620 

tttttttatt tagatagtta tagttcgttg gagagaataa aggtggggta agcgaggggg 1680 

agtggaagtg gtaaggggtg gcgtagtttt gtagtagagg gtagggaggg gatgtttttt 1740 

gaagtttttt taatttgttt gtgtagttaa ttgttgtagg ggtggatatt tatatggaat 1800 

ttgatgaagt ttattgttgt tttggaagag attcgggagg aggttttatt aaaggtttta 1860 

tgttttaggg attttagggt gagggtaaat tggttttaat tttaaaattt attttttttt 1920 

ttttttttcg ttttattttg ttcgtattta gtttttaaat ggaagattat gggtttttag 1980 

ttaggagaaa tggattgatt taagtaagtt ttatttatta gatgttcgta ggttggggta 2040 

gtttcggcgg tttttttttt ttttgttagt tagtgttaat tttcgtggtt tttaagtttt 2100 

cgttgttatt ttgttttatt ttttggggag agtttggttt tgttgtggat ggaattcgga 2160 

ggattttagt tttttgagta gttttttttt ttttttggtg ttgattagag gtttttgggt 2220 

agtattagtt aaagtaagag cgtatttatt ttggagtcgt ttacgattag gacgtagaga 2280 

agtaggcgcg ttagtagggt ttttatggtg gcgaggtcgg ggcgttagac ggcggttttg 2340 

tzaaaggaagg agaagttagg gtaagaggcg gaggaacggg aaggtaggtt aggcgggcga 2400 

ttgtagcgta ggggagatgt tcgcggtgat taggtttttt agttgttttt tttttttttg 2460 

ggtttcggat ttcgggtagt ttggatcggt attcgcgggg gacgttcggg acggggcgtt 2520 

ttgatttcgt gtagtcgtcg gggagtttag ggagttcggg tagtttaggg cgggggaggt 2580 

agacgttcgg gagttggggt cgtcgcgtat tcggttcggg gattttagga tcgcggtatt 2640 

tatcggtggt tgcggtagga gggcgcgagt cggcgttgcg gggataggtg gattttggtt 2700 

cgggtttcgg ggttgcggtt ttcgtattgt gttgcgattc gcggcgtttg ttttatatta 2760 

gggttcgttt cggcgtxgtt tttttttttt tcgttcggtt tttttttttg ttttgtagcg 2820 

tttagcgatt cggatttcgc gtgttttcgt aacgtttata aagatttggg ggaagcgcga 2880 

tttttagcgg aggggattta atagcgtttg gattgaggaa tcgagaggtt tgtaaatttt 2940 

tcgtgttttt ttttatgtat ttggtcgggg gtttgtttta gtgtaaggag ttttcgaatt 3000 

gtagagagga gagaaggcgt ataggagatt ttttatttcg tttagttttg aagttttttg 3060 

gggtttttta attagttttt ttgtaatttt ttttcgttgg gttttaattt gtttaagatt 3120 

gttttagatt tttttgttcg tagttgatgg agttgtgaag tttttattaa cgcgataaat 3180 

gtacgagata tattttttta gaagtataga tagaaaaatt tttgtttgta ggggtttttt 3240 

ttgtgcgttt gtttagtggt agtttttaga tattattaat ataattagtg gatggaataa 3300 

agtcgggttt attgttttcg gtagtaaggg ggtttgtttg atggtgttat tagaggggga 3360 

aaggtaaggt tagattattg aaaatttgta gtttggttta aagttcgttt ttgatagggt 3420 

ttgataagga ttgggttagg tgtcgtgata tgatgttata ggattgtggg aataaagttt 3480 

tagggtataa attgttggtg ttttttattg aagtgttaac gggttttttg ggaagttttt 3540 

ataatgagta atttatttat ttgtgtaggt aagaataaaa gtaaagataa tggaaatatg 3600 

tagatagttt taattgtgga ggttttggag ggtgtggaag ttttgttttt atttttgagt 3660 

agaggaattg ggagattgga ggataaaata agaggaagat ttatttttta ttgtttgttt 3720 

ttttatattt ttaatatttt aaaaagtata tttttgtata gtttatttta aaaagataat 3780 

tatgtatttt ttaatgtatg tgtatttagt gttttatttt attatagagt tttgtttatt 3840 

ttatttagat agaaataatt gtttattaaa taaaattgtt ttttagaaaa atagattatg 3900 

tgtaaatgat tgtatttatt ttttttgttt gaggataagt agatatttgt gtatatatat 3960 

atatatatat acgtattttt gggtatattt ggaggaatat agtagggatt ttgtgatttt 4020 

gttattttta tttttttttt ttagtgttag gaattagggt ttggggttat tttgaattta 4080 

gtaggattcg tttttttagt gggaaggagg aggttgagtt ttagtaattt attagcggtt 4140 

ttgggttttt ttttagttta ggtttgtagt atttttttgt tttgtaaatt tggattttgg 4200 

gtttgggttt atttgagagt gatagaagga aggtagggag agtgttagga aggtaggaag 4260 

gaggaaggtt tttttgtttt gttttagtcg agggtttagg gtgttagttg tttgttgggg 4320 

aaagtataag ttagtttttt agtttttggg aggtaggtga tttattgttt tttggagaga 4380 

tttttgtgtt tgttgagttg ttagtttttt aggggataga ttgagttagc gatgttatag 4440 

gttttagagt taa 4453 

<210> 238 1 
<211> 6001 " i 

<212> DNA 

<213> Artificial Sequence « 
<220> ' i • * ■ 

<223> chemically treated genomic DNA (Homo sapiens)^- ».•«.. 

<400> 238 » \ i't ' < *• < . ; i, f- 
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seq_prot.txt 

aaatttatag gtgttggtgt tatagaggaa gagattattt tttttgttat tataatttta 60 

aatttaaaga gaatatttta gttattaaaa tattttatag atttttaata aaaaaaagta 120 

ttattttgaa gttttaaaat atttgtttta aattttaaat ttaataatta tagttgtatt 180 

gtaaggttat tgtttattga taattttaaa atttagttaa gtgattaata ttcgttttta 240 

ttrttttataa ttttttttta gttaattttt ttttagtatt tttttttgat agtgttattt 300 

tttaaagttt gcgttaatat tgatagtggt gaatqaaaat ttaatattta ttttttattt 360 

tttttttatt tttatttttt gtttattagg tggataatat ttatgattat aaaattttat 420 

attttggaaa agagrtagtg atgatttttg aatatttttt tattattttt ttatttttaa 480 

ttgttttttt gttttaacga ttgaaataat tttttatttg aaatgtattt agataaagag 540 

gaaataaagt tttaataata aagataaata ggtatagtgt tttttgtgat ggtttgtttg 600 

gtttaaatga agattgatta tttttaagtt aataggggtg gaagcggggt gttaagtttt 660 

tgataattta tttgtaaaat tagtttattt tttttagttt atgtagtttt ttttaaaata 720 

tttggtaaat atgtaatttt ttgattgtaa atgttaattt tatatttaag ttagttattt 780 

tttaaaataa tgtaagggtt aggaatgaag taaattagtt tgtgttggat tataaagtta 840 

ttaatatttt taaaaattgt ttttgtaggt ttataattat tattataata aagtatttaa 900 

aaagtgatta ggtaatagta aagtgaaatt tattttttta aaaataatat atatgtacgt 960 

atgaattaag aagttataga aatatgttga gtrttattaa aatgttaaat ttagaaattg 1020 

ttaaaaaaga gaataattta ttgatttaaa tttaataggg ttgtatattt taatttgttt 1080 

ttgtaaagga taaattagaa tgatgtataa taattttttt tttggtattt atattagtaa 1140 

taattaggaa ttatataggt ttttattttg agttatagtt ggttattttt ttttttttaa 1200 

agttatatat attttagttt atatatattt ttgaaagata ttttatttag agttagattt 1260 

aattatagta aaattatatt tatagaagat gaaaaattat atatatattt gttaaaatta 1320 

gaatagatta tatttagggg ataatattta ggtatgttaa cggagtttaa aatgttaagg 1380 

aaattatatt ataattttgt ttagtatatt ataggttgtt aaatcgaaat gttatgttag 1440 

ttaggagtgt agtaattttt attttttggt tttatttaat taggaagttt tagtagagcg 1500 

aagtttgtta agcgttcgtc gttagaattt gaaggaattc gagcgagtaa gaagagtgtt 1560 

tgatttattt tatagaagtt tgtttagaaa tggaggagtt agcgtttatt gaagtcggtt 1620 

tcgttttcgg ttcgtttata tggagtttga ttagttttag ttatgtttat ttcggtttgg 1680 

gagattcgta aagtgttttt ttttttaatt tttttgtatt attttgaagt ttagggaagt 1740 

aaagagaggg gtatatttgg attgtaaaat taatgttttt tgtcgtttag gagagaaggg 1800 

aatgagagag agagagagat agatagatag agagagagag agagagagag agagagagag 1860 

agagagagag agagaaattt tattgaaatt tagttttttt agaatttgtg tgatttggtt 1920 

tttaacggga gattagtgcg attttatggt atttttgtta ggaattagcg atttttttgt 1980 

agttattatt tgatttattg ttttttcgtt tatttttttt tataaagtta tttttttttt 2040 

attttagtaa gatttttttt tttaatgatg ataaagtttt tgttttagtg ttttttttag 2100 

gattggtgtt tttttaaaat agtgaattta gaaaattatt tcgtttaata ttttttaaaa 2160 

ttttcgtagt tttaatgtaa gcgtaagtat gtaaaggttt tttgttatat ttgtattttt 2220 

tgtttatttt agaattattt tttattttcg ggtttgtaat agtttttttt gtttttttgg 2280 

atagaggtgg gtggtattag gggtttaggg tagtaggagg tgaggggttg aggaggcgcg 2340 

ttagggtagg ttggtttgtg ttggatacgc gtgttttttt gcggagttaa agggtcgggg 2400 

acgggggttt tggatttatt agagtaattt tagtcggtgg gcgtttggta gttatttaag 2460 

gaggtaggga aagtagcgag ttttatcggg cgggttacga tgagtagtat gacgggtagt 2520 

agtagtagtt agtaaaagtt ttcgtaaagt gtttagttgt tgtattgtcg cggggatttt 2580 

tatagtatta tgattagttc gtgtaatttt gtagtagtaa acggttttcg aggaatatag 2640 

gatcgcgggg gtcgggtagc gggttattga gtatttcgcg gacggcggta gtagaggcgg 2700 

cggcggtggt agtggtattc ggcggggaag tagtagttaa attcgcgtat gatttcgaga 2760 

gttttagtaa tatttaggga ttgggtttag tttcggagcg agagggtcgt tcgttgagaa 2820 

gttgcgtcgg agacgcggga agttgttgtt ataaggaggg agttttggga agtcggagga 2880 

taggaggaga cgggagttta ggggtagacg agtggagttc gaggaggtag ggtggaggga 2940 

gagttaaggc gtttcgtagt tcggtagtcg tttttcgagt tttgtcgttc gtattttttt 3000 

ggcgtttggg aagtagtagg tttttagttc gttcggggtt acgtgggaag aggtagtcgg 3060 

gttttgattg gtggagtagg atgtaggttt cgggagggag gggtcgacga ggaggtgtaa 3120 

ggatgtaagg aggaggcggt cgcggaagtt atagatgggt tcgttcgtta ggcgttggtt 3180 

cgagtggggt taggcgggga tggtttaaat gagaagttcg ggttttaggg tgggttattc 3240 

gtatatttat atattattcg ttttattttt cgttttagga cgttttttat cgaaggcggg 3300 

gttcggatta gcgttttttt ttcgcgcgtg atttcgggtc gcgagtgcgg gtcgcggttg 3360 

ggtggcgttt tttcgagttg gagatggtgg gggcggaggt gttagaggag tagtagtagt 3420 

agggtagaga ggggcgagtc ggcgcgggag agggcgtttt gttggcgatc ggcgttttag 3480 

cgtgcgggag cgcgtcgttt aggttgtagg gggatgtagg ttgggaatgt cgcggcggag 3540 

aggttaggga cgttttttta gggatttata ggaaagaggg tgagaggcga tggtgttaga 3600 

atcgtttttg tcgatttgga agtaatagta gtatttttta taagagcgtg taattttaag 3660 

gttgttcgtc gaggtagttt agttatttcg gtaggcgttt tttttttttt tttttttttt 3720 

tttttttttt tttaggtttt tcgtagtttc gatttagttt aagcgttcgt aggtttgaat 3780 

tttttttttt attattcgtt tttttttagt tcgtagttta ttagtgtgtt tatttgggag 3840 

gtgcggttag atgtgtttgg aaggttagat tggtcgggat aagtggtttg agagaaagag 3900 

aaaggttttt ttgtatacgt cgcgggtggg ttgtcgggag tatcggtcgg gtagcggcgt 3960 

tcgggaaggg gagagcgggt tttatttgtt ggtttaggya_gtgattttgc gttttttatt _4020 
" " seite 144 " " " 



seq_prot.txt 

cgggtttttg tcggatggtc ggtgatttgg ggcgacgaga gaaggtttaa ttcggtagga 4080 

gtttttggtt ttgcgcgttt tttttatttt ttttagcggg aagggtaaac ggtatagcgg 4140 

gattcgtttt tcgtttgttg tattttttag gtagttagat atatttttta gtttaatgga 4200 

attttagtcg ttagtaacgg gattaagagt tttcggggat aagggtggag aggaatattt 4260 

tttttttatg atcggggtta ttattgtagt tttagtgttt tggatgtttt atagggaaga 4320 

gttttttttt tggtgtgtga ttatttagtg atttttgttt ttgtttttgt ttattttttt 4380 

ttcgtttttt ttttttattt ttttttgtta tttttttttt tttttttttt ttcgttttta 4440 

aaagttttcg gatttttttt ttttttattt aaattttttt tttgtgtttt tttttttgtg 4500 

ttttttgaat ttaggagagt atttgataat atttaatagg taattagtgt ttatttttaa 4560 

ttatttaaaa gaggtattta tatattttga aaacgggatt atttattttt tgtagatatt 4620 

agtagaaaaa taaattgtat tcgagtaatt tttttaagta ttttaatttt taattttttt 4680 

ttattttttt gttttttaat tttttttttg agagatgtga tcgtgtagta ttttagtgtt 4740 

ttaacgaaat tttttttttt tttttgtgtg aaatttattt ttttatttta tattttcgtt 4800 

ttcgttcgag attgtttttt ttttttttta tttttaaaga tttttgaatt ttagtgtttt 4860 

ttatttttgg taattaagta gtagatttta gtattttagt cggtggtatt tcgtttttta 4920 

tcgacgaaga ttttattaaa atagattaat tagattagac gttggaggta ttagaaaatc 4980 

ggtttttaga tagagtagtt aaatttttta aggaaataga atatttatta gatagagttg 5040 

ttaattaata ttgtaaaata aggaattaga aatttttttc gttataggtt tttagtagag 5100 

aaggtaatat aaatatagat taagatttaa taattttata gtagagaatg agaatatgtt 5160 

attttttata gtaaggttgg tgtggtaatt aattaggttt atgaaaataa gttatgtttg 5220 

aaattaaagg taaagttttt aaaagtgttt atgtagtaat tatgataatg aaataggatt 5280 

tgttaggatt ttagagtttg gttatgtaag tagaatttta gagaattttt tagtagagga 5340 

aaattgtttt tgaatttttt gttaagtaaa tttttggtat attttttaat aatatatgtt 5400 

ttttttaaga cgttttgtta aaagtaagtt aaaattttaa aggagttaat tattggttgt 5460 

aattggttaa taaatgcggt tgtttttata gaggtttttt aaattattaa atagtttgaa 5520 

gtaaagtttt tttaatggga atgttgtaat tttgttgtat ttattttgta tttagtgtta 5580 

tagtgttatt aagaaataaa ttttgaaatt ggtaagtatt attaagtggt agaagaatat 5640 

tatttattga gtagagaatt gtattattga atatgtaaat aaaaatatat atattattta 5700 

gatttgttat taggtattaa agaagtagat aagattgtat tagtaattgg attagtgttt 5760 

taattttttt ttagtaaggt aaaattagtt tatttattag aattaaattt aagtttatga 5820 

attgtatttt gtattgcgta ttatatgatt gttagtaata tgatataatt atattatgta 5880 

tttgtaaaat ttttatttta aaatattata ttatatttat ttttaatttt tttgagttag 5940 

aatattttat ttgtggtata tatattttag aattgatgta gaggagtaga gtttagttgt 6000 

* 6001 

<210> 239 
<211> 6001 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 239 

aataattgga ttttgttttt ttgtattaat tttagagtgt atatgttata aataaagtgt 60 

tttagtttaa gaagattgaa agtaaatatg gtatagtatt ttaaaataag aattttgtaa 120 

atatatggta tgattgtgtt atattattag taattatatg atacgtaatg taaagtatag 180 

tttatagatt taaatttaat tttaataagt aaattgattt tgttttgttg gggaaaagtt 240 

aaagtattaa tttaattgtt aatgtagttt tgtttatttt tttggtattt agtgataagt 300 

ttaaataatg tatatatttt tatttatata tttagtaata taattttttg tttaatgagt 360 

gatgtttttt tgttatttgg tggtgtttgt tagttttaga atttgttttt tggtggtatt 420 

ataatattaa gtatagagta agtgtaataa aattgtagta tttttattga aaaggttttg 480 

ttttaaattg tttaataatt taaaggattt ttgtggaagt aatcgtattt gttaattagt 540 

tataattagt aattaatttt tttggagttt taatttattt ttggtaaaac gttttaggaa 600 

gagtatatat tattagaaag tatgttaaaa atttatttag tagaaaattt aaaaatagtt 660 

tttttttgtt aagaggtttt ttaaaatttt atttatatag ttaaattttg aaattttagt 720 

aggttttgtt ttattattat aattattgta taaatatttt taaggatttt gtttttagtt 780 

ttaagtatga tttatjtttta taagtttgat tagttattat attagttttg ttatggaaaa 840 

tgatatgttt ttattttttg ttgtagagtt gttaaatttt gatttatatt tatgttgttt 900 

tttttgttga aagtttgtag cgaaagaaat ttttaatttt ttgttttgta atattagttg . 960 

gtagttttat ttaatgggta ttttgttttt ttaaagaatt tagttgtttt gtttagaagt 1020 

cgattttttg atgtttttaa cgtttggttt aattgatttg ttttaatgga gttttcgtcg ,1080 

gtgaggagcg agatgttatc gattagaatg ttgggatttg ttgtttaatt gttaggagtg 1140 

agagatattg agatttagaa atttttggag gtgggagggg agagggatag tttcggacgg 1200 

aggcggaga ( t gtaagataaa gggatggatt ttatatagga aaaaaaaaaa gatttcgttg : 1260 

^ggtattgag gtgttgtacg attatatttt t£aa;aggaga agttaaaaag taaggaagtg 1320 

gaaggaggjtt ggaggttaaa gtatttaaaa ggattattcg ggtataattt gpttttttijt 1380 
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tggtgtttgt aaaggataga tagtttcgtt tttaaagtat atgaatgttt tttttaagtg 1440 

attgggaatg gatattaatt gtttgttaaa tgttattaaa tgttttttta aatttagggg 1500 

atatagaaag aggggtataa aaggagaatt taaatagaaa aagggaggat tcggaggttt 1560 

ttgaaagcgg ggggagaaga aggaggaggg ataatagaga ggaatagaga aggagagcgg 1620 

agagaagata aataaaaata aaaataggaa ttattgaata attatatatt aaaaagaaag 1680 

ttttttttta tggggtattt aaaatattga gattgtaata gtgatttcgg ttatggaaga 1740 

aagatgtttt tttttatttt tgttttcgaa agtttttggt ttcgttattg gcgattaaaa 1800 

ttttattagg ttaaagagtg tgtttaattg tttgaagaat gtagtagacg gaaggcgggt 1860 

ttcgttatgt cgtttgtttt tttcgttgga gagaatgaaa gaaacgcgta gagttagaga 1920 

tttttgtcga gttagatttt ttttcgtcgt tttaggttat cggttattcg gtaaagattc 1980 

gagtaaggaa cgtagggtta ttgtttgggt taataaatgg agttcgtttt ttttttttcg 2040 

gacgtcgttg ttcggtcgat gttttcggta atttattcgc ggcgtatgta gaggagtttt 2100 

tttttttttt ttagattatt tgtttcgatt aatttgattt tttaaatata tttgatcgta 2160 

ttttttaggt ggatatatta ataggttacg ggttggagag gagcgggtga tgaggagagg 2220 

gatttaaatt tgcgaacgtt tgggttgggt cggagttgcg gggggtttgg gaggagagag 2280 

gggagaagag agaaggaagg agagcgtttg tcgggatggt tgagttgttt cggcgagtag 2340 

ttttggggtt gtacgttttt gtgggagatg ttgttgttgt ttttaggtcg gtaagagcgg 2400 

ttttaatatt atcgtttttt attttttttt ttgtaaattt ttagagaaac gtttttggtt 2460 

ttttcgtcgc gatattttta gtttgtattt ttttatagtt taggcggcgc gttttcgtac 2520 

gttggagcgt cggtcgttag taggacgttt tttttcgcgt cgattcgttt ttttttgttt 2580 

tgttgttgtt gtttttttga tattttcgtt tttattzattt utagttcgga gagacgttat 2640 

ttagtcgcgg ttcgtattcg cggttcgggg ttacgcgcgg aagaggggcg ttagttcgga 2700 

tttcgttttc ggtagggggc gttttggagc ggagagtgag gcgaatggta tatgagtgtg 2760 

cgggtagttt attttgaagt tcgagttttt tatttgagtt attttcgttt agttttattc 2820 

gggttagcgt ttggcgagcg agtttatttg tggttttcgc ggtcgttttt tttttgtatt 2880 

tttgtatttt ttcgtcgatt tttttttzttc gggatttgta ttttgtttta ttaattagag 2940 

ttcgattgtt: tttttttacg tgatttcggg cgggttgagg atttgttgtt ttttaaacgt 3000 

tagagggatg cgggcggtag agttcgagag gcggttgtcg ggttgcgggg cgttttgatt 3060 

ttttttttat tttgtttttt cgggttttat tcgtttgttt ttggattttc gttttttttt 3120 

gtttttcggt tttttagagt ttttttttta tggtagtagt ttttcgcgtt ttcggcgtag 3180 

ttttttagcg gacgattttt tcgtttcggg gttgagttta gtttttggat gttgttgaaa 3240 

ttttcgagat tatgcgcggg tttggtrgtt gttttttcgt cgggtgttat tgttatcgtc 3300 

gtcgtttttg ttgtcgtcgt tcgcgggatg tttagtagtt cgttgttcgg ttttcgcgat 3360 

tttgtgtttt tcggaagtcg tttgttgttzg tagagttgta cgaattagtt atggtgttgt 3420 

gggagttttc gcggtagtgt agtagttgga tattttgcga gggtttttgt tggttgttgt 3480 

tgttgttcgt tatgttattt atcgtagttc gttcggtgaa gttcgttgtt ttttttautt 3540 

ttttaagtga ttgttaaacg tttatcggtt ggaattgttt tggtaagttt agaattttcg 3600 

ttttcgattt tttaatttcg tagaagaata cgcgtattta gtatagatta gtttatttta 3660 

gcgcgtuttt ttagtttttt: attttttatt gttttagatt tttaatatta tttattttta 3720 

tttagagaaa taaggggaat tgttgtaggt tcgggggtga ggggtggttt tgggatgggt 3780 

agaaagtgta ggtgtagtag gaaatttttg tatgtttgcg tttatattgg agttgcgagg 3840 

attttgagaa atattaaacg ggatggtttt ttgggtttat tgttttgaaa gagtattaat 3900 

rttaggggaa atattgaaat agaagttttg ttattattaa agaaaaaagt tttattagga 3960 

tgaggaagaa ataattttat gagaaagaat gagcgagaaa gtaataaatt aaatggtgat 4020 

tgtaggggaa tcgttgattt ttggtaaagg tgttatgagg tcgtattggt ttttcgttga 4080 

agattaggtt atatagattt tagaggagtt gggttttaat agaatttttt tttttttttt 4140 

tttttttttt tttttttttt tttttttttt tttatttatt tatttttttt ttttttttat 4200 

tttttttttt tttaggcggt aaaagatatt ggttttgtag tttagatatg tttttttttt 4260 

tgttttttta agttttaagg tagtataggg gagttgagaa aaagaatatt ttgcgggttt 4320 

tttaggtcgg agtgggtatg attgaggttg gttaggtttt atgtaggcga gtcgagggcg 4380 

gaatcgattt tagtgggcgt tgattttttt atttttggat aggtttttgt ggagtgggtt 4440 

aggtattttt tttgttcgtt cgggtttttt tagattttga cggcgaacgt ttggtaggtt 4500 

tcgttttgtt gaagtttttt aattaaatag ggttagagga tgggagttgt tgtattttta 4560 

gttggtatag tatttcggtt tgatagtttg tagtatattg ggtagaattg tggtgtaatt 4620 

tttttggtat tttaaatttc gttaatatgt ttgggtattg ttttttaggt gtggtttgtt 4680 

ttagttttag taagtgtgta tgtaattttt tattttttgt gaatataatt ttgttgtagt 4740 

taaatttggt tttgaataaa gtgtttttta aagatgtata taagttgaag tgtatgtaat 4800 

tttagagagg agggaatgat taattgtaat ttagggtgaa agtttgtata gtttttagtt 4860 

attattgatg taaatgttaa aaggaaaatt attatgtatt attttaattt attttttata 4920 

aagataagtt gagatatgta attttattag atttgggtta atagattgtt tttttttttg 4980 

gtagttttta aatttggtat tttaataaaa tttaatatgt ttttataatt ttttgattta 5040 

tgcgtatatg tgtgttgttt ttgaaagaat aagttttatt ttgttattgt ttaattattt 5100 

tttagatgtt ttattatggt aataattatg agtttgtaaa aataattttt ggaaatgttg 5160 

atggttttgt agtttaatat agattggttt gttttatttt tagtttttgt attgttttag 5220 

gaaataatta atttaaatgt gaagttgata tttgtaatta agaaattata tatttattag 5280 

atattttaaa ggggattgta taaattaaag agaataaatt ggttttgtag ataggttgtt 5340 

aagaatttgg tatttcgttt ttatttttgt taatttagag gtgattaatt tttatttgag 5400 

--ttaaatagat^tattatagaa-aatarttgt-g* -ttgttfeat-tt -ttattattga- ggttttgttt 5460 
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tttttttgtt 
attaaagatg 
tgtggagttt 
aaaataggaa 
aaaatagtat 
atggaaacga 
tagtatagtt 
gtgttttttt 
ttgaggttat 
t 



tggatatatt 
gggaaatggt 
tgtggttata 
gtaaatgtta 
tattagaaaa 
atattgatta 
atagttgttg 
ttgttaaaaa 
gatgatagag 



ttaaataagg 
aaaagggtat 
aatattgttt 
agtttttatt 
ggatattaaa 
tttaattaga 
gatttaaaat 
tttgtaagat 
aaaatgattt 



seq_prot. 
ggttgtttta 
ttagagatta 
atttaatgag 
tattattgtt 
ggagaattga 
ttttgaggtt 
ttaggataag 
gttttaatga 
ttttttttgt 



txt 
gtcgttgaag 
ttattagttt 
taaaaaataa 
agtattaacg 
ttagaaaaga 
attagtagat 
tattttaaag 
ttggagtgtt 
gatattaata 



taaaagaata 
ttttttaaaa 
aaataaaaaa 
taagttttaa 
attgtggaaa 
agtgattttg 
ttttaaagta 
ttttttgaat 
tttgtaaatt 



<210> 240 
<211> 7001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 240 

aatgtaatgg 
gtgtggaagg 
ttggagagat 
gaatggtgat 
ttggagatat 
tagtaagtta 
gagttgggag 
gtatattttt 
aagagaatgt 
ttttgattgt 
tttttatgtt 
tttttttttt 
tggttattgt 
tttaatattt 
gtgtttaaat 
ggtattatta 
tataaatttt 
gtgattagag 
gggtgttatg 
gttagtatta 
gttattggga 
attagtttaa 
tacgtatttt 
gggtttggtt 
ttaaaatatt 
gtttagagat 
tttattgtta 
atttatatat 
tgtttttttt 
gtaggggtaa 
ttgttttttg 
tgtgtagtag 
tgtttttcga 
ttatatattg 
tttttgattt 
aggtttgttt 
tggttagtgt 
tgtgtttttt 
taggtttata 
ttgiaaattg 
gtaatgattg 
ttatttgttg 
gtatrattgtt 
aaatagttaa 
atttcgtt&g 
gtattagagt 
attatagacg 



aaaaagagag 
taagggaaaa 
agtgatgttg 
ttatatttga 
atttaacgga 
cggttggaac 
ttcgagtgaa 
ttatttatgt 
tggagagaaa 
aagtttttga 
aatagtgaat 
tttttttttg 
tgatttatag 
atttatattt 
attgtaatat 
atatttttat 
ttagtaattg 
ttatttaata 
tgttttggga 
tggtcgagag 
aatgagagat 
aataaaggta 
ttatatgttt 
aacgaagtgg 
tattttaaag 
tgggggttag 
aattatattt 
ttttgattgg 
tttgtttggg 
aggggttggg 
tgagtagata 
tttagttgcg 
atttttgtta 
agttattcgt 
ttataagttt 
tgattttttt 
ttgtatttta 
ttttttttta 
ttttttaaga 
aaagtgtgat 
ttggttataa 
ataatttata 
taataatggt 
aaatgtaatt 
atattttgtt; 
ttkttaggjtt 
tatgatatat 



attgtaaagt 
ttagttttcg 
ggtatagata 
ataagtttgt 
ggagttagat 
gtgtgtgttt 
gtttttgtta 
ttattaagtt 
gtggttaaga 
ataattaaag 
aattgtagat 
aatatttttg 
gaggtattat 
agtatttatt 
tgggggtttt 
ggttttttat 
tatagtgttt 
tttaaggtgg 
ttttgagcgt 
aatagttgag 
ttcgttttta 
tttattttta 
atagaaagta 
aggagtatta 
tttggttttt 
ggttagtaag 
tttcggaaaa 
tattttattt 
gtggtagtag 
gttttttttt 
gtaagttttt 
tgtttgtcgg 
ttttacgtat 
atatgcgagt 
atggaatatt 
ttttgtagtt 
gtagggagat 
ggatttgtag 
gttttatgaa 
ttatggaaat 
agggaaatat 
gttattaatg 
ttatttttgt 
tttagtaata 
gggtgtttgs) 
gtagtaaagg 
tttatttatt 



tagaaggttt 
aagaagatag 

cggggaagtt 

aatgtttagt 
taatttttat 
agtaggagag 
aaggtagaag 
tagggataag 
aaattgtttt 
tttgtgagga 
tttttttttt 
tttttttttg 
tgttattaat 
tttaatattg 
ttaaagtata 
tttttttttg 
tagggttaga 
tgattaatgt 
gggtatttta 
aaagtggtta 
attacggtta 
ttttatgttt 
gttaattgag 
tattttagtt 
tggtgtaatc 
gtatttgatt 
attatttatg 
tagtcgtaag 
aaagtatagg 
tttagtatag 
ttttattttt 
gaggggttgt 
aaatatattt 



aggaattgtt 
tgagatttta 
gagaggaata 
agatcgttgg 
ttttttttat 
ggtagggagg 
aggaaagtcg 
gtttattaag 
tattgaattt 
gatagttaat 
tttttttttt 
ggattggttt 
aaagggtaat 
attatggaga 
aaaatatata 
tattggtttt 
gatcggttat 
ttggtaataa 
ggagtatttt 
agaggtggat 
gtgtaattcg 
atattttagg 
ttatttatgg 
ggaaatatat 
ggaggtatgg 
tatatgtatt 
ttttattttg 
attatgattt 
gtatttttta 
ttttttttgg 
tattgttatt 
taagtgtttt 
atatattttt 



atattttttt tttttttttt 
tttggaaaga cgtttttgat 
ttagtatttt gagaaagtaa 
gaggattgtt gttttttatg 
gattttttgt gttatttgta 
gtgttttttg tatgtgtttt 
taaattattt gtaaaaaatt 
ttgcgatgta tttaatgtgt 
ttaaattcga ttttggtttt 
aagaattaga taaaatgtat 
gtaa^tttgg tatttagata 
gatagtaatt aaaataaagt 
aagtttattt aaaagtataa 
trttfifttttt ttaatatgta 
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ttttgattag 
atttgggtgg 
ttatgtttga 
aaaagtgggg 
ttgagagagt 
gaagttaaga 
gcgtagtata 
atgagtttgg 
tttgggttaa 
ttttttattt 
ttttttttgt 
agagtatggg 
agtttttttt 
gagttttttt 
tttgtatgat 
aagagttatt 
ttttggtatt 
agtttttatt 
agtattgcgt 
ttatgtgaac 
aaagtttaaa 
tttttaataa 
aaaggtttgt 
ttttagaatg 
taatgttttt 
ttagaaggtt 
taaatttgat 
atagtaagtt 
gtttttaagg 
ttgtgttata 
tatttagcgt 
gtttattggt 
tttgtttagt 
attttttcgg 
ttagtagggt 
tttatttttt 
ggggtatgtg 
tataatttgg 
aaaaaggtat 
gttttggaaa 
ttttaatttt 
agttatattt 
atttgatata 
gattatgaat 
attgatagtt 
attatttaag 
tatatatata 



5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6001 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 



seq_prot.txt 

tatatatata tatatatata tatatgtgtg tgtgtatgtg cgtgtgtatg tttaattttt 2880 

aatttagtta aaaatttttt tttatttgtt ttttatttgg atatttgatt ttgtatattt 2940 

tagtttaagt gaatcgagaa gatcgagttg taggattaaa ggatagatat gtagaaatgt 3000 

attttaaaaa tttgttagtt ggattagatc gataatgtaa tataattgtt aaagttttgg 3060 

ttcgtgattt gaggttatgt ttggtatgaa aaggttatat tttatattta gttttttgaa 3120 

gttttggttg tataattaat ttgtggaagg tatgaatatt tatgtgcgtt ttaattaaag 3180 

gtttttttga attatttttt atatgagaat ttttaatggg attaagtata gtattgtggt 3240 

ttaatataaa tatataagtt aggttgagag aattttagaa ggttgtggaa gggtttattt 3300 

attttgggag tattttgtag aggaagaaat tgaggttttg gtaggttgta tttttttgat 3360 

ggtaaaatgt agtttttttt atatgtatat tttgaatttt cgtttttttt tttttagatg 3420 

ttttttgtta gtttttttag ttgttaaata tagttgtttg tggttggttg cgtatgtaat 3480 

cgtatatttt attttatttg ttttatttcg gttatagtgt agtttttttt agggttattt 3540 

tatgtatata ttacgtattt ttagttaacg aggaggggga attaaataga aagagagata 3600 

aatagagata tatcggagtt tggtacgggg tatataaggt agtatattag agaaagtcgg 3660 

tttttggatt cgtttttcgc gtttatttta agtttagttt tttttgggtt atttttagta 3720 

gattttcgtg cgttttcgtt ttttggtcgt gaaatttagt ttttatttag tagcgacgat 3780 

aagtaaagta aagtttaggg aagttgtttt ttgggatcgt tttaaatcga gttgtgtttg 3840 

gagtgatgtt taagttaatg ttagggtaag gtaatagttt ttggtcgttt tttagtattt 3900 

ttgtaatgta tatgagttcg ggagattagt atttaaagtt ggaggttcgg gagtttagga 3960 

gttggcggag ggcgttcgtt ttgggattgt atttgttttc gtcgggtcgt tcggttttat 4020 

cggattcgta ggttttcggg gtagggtcgg ggttagagtt cgcgtgtcgg cgggatatgc 4080 

gttgcgtcgt ttttaatttc gggttgtgtt ttttttttag gtggttcgtc ggtttttgag 4140 

ttttttgttt tgcggggata cggtttgtat tttgttcgcg gttacggatt atgattatga 4200 

ttttttatat taaagtattt gggatggttt tattgtatta gatttaaggg aacgagttgg 4260 

agtttttgaa tcgttcgtag tttaagattt ttttggagcg gtttttgggc gaggtgtatt 4320 

tggatagtag taagttcgtc gtgtataatt atttcgaggg cgtcgtttac gagtttaacg 4380 

tcgcggtcgt cgttaacgcg taggtttacg gttagatcgg ttttttttac ggtttcgggt 4440 

ttgaggttgc ggcgttcggt tttaacggtt tggggggttt ttttttattt aatagcgtgt 4500 

tttcgagttc gttgatgtta ttgtattcgt cgtcgtagtt gtcgtttttt ttgtagtttt 4560 

acggttagta ggtgttttat tatttggaga acgagtttag cggttatacg gtgcgcgagg 4620 

tcggttcgtc ggtattttat aggtattcgc gttcgcgtcg ttcgtcgggg tggtcgtcgc 4680 

gttcggtagg agggagggag ggagggaggg agaagggaga gtttagggag ttgcgggagt 4740 

cgcgggacgc gcgattcgag ggtgcgcgta gggagttcgg ggcgcgcggt ttagttcggg 4800 

ggttttgcgt gtagttcgcg ttgcgtttag agttaagttt tttcgtcggg tagttgaaaa 4860 

aaacgtattt tttatttatt tatcgttcgt gcgagaggta gattcgaaag ttcgggtttt 4920 

ttaataaaat atacgttgga aaattagata aagtagtagt tatttgtggg ggaaaatatt 4980 

tttaggtaaa taaatacggg gcgttttgag ttatttggga aggtttcgtt tttggtattt 5040 

aaagttgggg gtgtttggag ttagtagagt ttagtagagt tttatttatt tttttaatgt 5100 

ttttgtttaa tgtgtttttt aaattttttt ttatttagat tatttgattg gaaatatgtt 5160 

agttatgatg atgatttttt gggaagcgat ttttgttatt cgtttttttt ttttttttat 5220 

tttacgtttt ggggttttag agagcgattg ggagttgaat gggtttgatt tcggagttag 5280 

ttggttgagt tcgcgttgga gcggattgtt ggtatgtgat ttttgatagt cggaaatttg 5340 

taggtgtttc gcgagtttaa aataagttat atggaagtat aagtgtttaa aaataatttt 5400 

ttgttagttt agtgataagt ttgttttatt cggggagaat gtttcggagt ggcgtgcggg 5460 

ttagttaggg tttgcgtttc gtagttattg tggaaggagc gcggtcggtt taggatatag 5520 

gagattattt tgtgatttta atggcgaagg ttgtgtgttt ttattttaat tttttttttt 5580 

ataagaattg tttttttttt tttttttttt ttttttattt ttttttgttt agtttttttt 5640 

tttgtttttt gttttttgtt tttttgatgg gtttgtagag ggattaggtg ggcgtttttg 5700 

gtgaatattt ttttaggtgg ttataggata ggtgtatttc ggattgggtt tggaagtttt 5760 

agggcgttat atggttgggt tttgaattag gtatttttta attgtatatt ggtattcgga 5820 

ttggtgtttt tatatttttt tgttttgtaa gtcgtggatt agtttttgtt tagtattttg 5880 

tttttaggga tatttatagt agaaggaagg ggattaaagt gtagtttggt tttagaggat 5940 

attgaagggt agattttggg ggtatttagt gtgtattttt agtcgttttg gagaaattta 6000 

gagtatttta tagttacgta gatttaagtt gtttttattt aaaagataaa taatgaataa 6060 

aatttttaaa ggttggtata ttttaaatta attttatttg ttttaattta gggttaaaat 6120 

agagaaaaag gatttttttt gtttattttt ttttttttaa atggaagaat aaagtatagc 6180 

gattaagttt aattttatat aatatttaaa attgtttgat gtgaaggaag gtattggtat 6240 

gatgtgaatt ttataatttt atgatggatt ttagaaatta tttttttttt tatttaattt 6300 

ttagtttttt tattgtaaat taatgttgtt gaattttaat gggtattaat gagattgttt 6360 

tttggtagat tatttattgt tttgttaata attataaagt gaatttggtt aaatatagag 6420 

gggatcgtat tttatttaaa attgtttatt attttagtga taagtggtat tagtgtaata 6480 

tgttttattt tatatttttt gtattatatg atatttaaat atttttagaa taataaaaaa 6540 

agagataagg aatttaaaaa ttaaaaaaaa aatttgtata aatgggattt tgtgtggaaa 6600 

tttagtttta gaatgatttt ttttgtgttt tatttttcgg attatttttt tttttttgtt 6660 

agaattttgt ttgttattat ttagtaagga aaagaagtat ttatgtaagt tttttatatg 6720 

gatagatatt atttagtatt tttttttttt tagttttttt gtttaaatga ttttgggtat 6780 

aaaggaaagg attgattggg tttttttagg aaattttaag tt*tttaagt agtttttaaa 6840 

agttttgggg ttgaaagtag tgtttttaaa ttgtttgtta tgatttagag ggttatgaat 6900 
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ttagtttagt gagtttagaa tattttttaa aaggattaaa atggaaagga atataataga 6960 

aaatattaga gtgtatggta tttcgtaagg ataagttttg t 7001 

<210> 241 
<211> 7001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 241 

ataaaattta tttttacgaa atattatgta ttttgatatt ttttattata ttttttttta 60 

ttttagtttt tttaaaaaat attttagatt tattaaattg agtttatgat tttttgggtt 120 

atgataagta gtttgaaaat attgttttta gttttaaaat ttttgagaat tatttaagaa 180 

atttaaagtt ttttaaaaga gtttaattaa tttttttttt tatatttaga gttatttaag 240 

tagaaaaatt gagaggggaa aaatattaaa taatatttgt ttatatgaag aatttgtata 300 

gatgtttttt ttttttgttg gataataata ggtagaattt taataaaaga ggaaagataa 360 

ttcgggaaat aaaatatagg aaaaattatt ttaaaattga atttttatat agagttttat 420 

ttgtgtaagt ttttttttta atttttaagt tttttgtttt tttttttatt attttaagag 480 

tgtttgaata ttatgtaatg tagaaagtgt aagatagggt atattatatt gatattattt 540 

attattggga tgatgaataa ttttgaataa gatgcgattt tttttgtatt tgattaggtt 600 

tattttgtaa ttattagtaa ggtagtaaat aatttattaa ggagtagttt tattagtgtt 660 

tattgaaatt tagtagtatt aatttgtaat aaaagaattg aaaattaaat agggaagaaa 720 

atggtttttg gagtttatta taaggttatg gaatttatat tatattagtg ttttttttta 780 

tattaagtag ttttaaatgt tgtgtggaat tagatttaat cgttgtattt tgttttttta 840 

tttaaaaaaa aaaaggtggg tagaagaaat tttttttttt tgttttaatt ttaaattaaa 900 

ataagtaaaa ttaatttgaa atatgttaat ttttaaaagt tttgtttatt gtttgttttt 960 

tgagtaaaga tagtttggat ttgcgtggtt gtgggatgtt ttaaattttt ttaaggcggt 1020 

tgaagatgta tattgaatat ttttagaatt tgttttttag tattttttgg ggttaaattg 1080 

tattttagtt tttttttttt tgttataaat atttttggaa atagaatatt gaataaaaat 1140 

tggtttacgg tttataaggt agaaagatat agggatatta gttcggatat tagtgtatag 1200 

ttgggaaatg tttaatttag gatttagtta tgtggcgttt tgaagttttt aaatttagtt 1260 

cggggtatat ttgttttgtg gttatttagg aaggtgttta ttagaagcgt ttatttaatt 1320 

tttttgtagg tttattagga aaataaaaaa taaaaaataa aaggagaaat tgggtaagag 1380 

aaaatgggag ggagaggaga gggagaaaga ataatttttg tagggaaaaa aattaaaatg 1440 

aggatatata attttcgtta ttgaagttat aaagtggttt tttgtgtttt ggatcggtcg 1500 

cgtttttttt atagtggttg cgaggcgtag attttggttg attcgtacgt tatttcgggg 1560 

tatttttttc gggtgggata ggtttgttat tgggttggta ggagattatt tttaagtatt 1620 

tgtgttttta tatggtttgt tttaaattcg cgggatattt ataaattttc ggttgttaga 1680 

agttatatgt tagtaattcg ttttagcgcg gatttagtta gttaatttcg aaattagatt 1740 

tatttaattt ttaatcgttt tttaaagttt taggacgtgg ggtggggagg aggggaaagc 1800 

gggtgatagg aatcgttttt tagaaagtta ttattatagt tgatatattt ttaattaaat 1860 

agtttagatg aaaggaaatt tggggagtat attaaataaa aatattaaaa ggataaataa 1920 

aattttgttg agttttgtta attttaaata tttttaattt taaatgttaa gagcgagatt 1980 

tttttaagtg atttaaagcg tttcgtgttt atttgtttgg aggtgttttt ttttataaat 2040 

aattgttgtt ttgtttggtt ttttaacgtg tgttttgtta ggaagttcgg gttttcgggt 2100 

ttgtttttcg tacggacggt aagtgggtgg agagtacgtt ttttttagtt gttcggcgag 2160 

agaatttgat tttgaacgta gcgcgggttg tacgtagaat tttcgggttg ggtcgcgcgt 2220 

ttcgggtttt ttgcgcgtat tttcgggtcg cgcgtttcgc ggttttcgta gttttttagg 2280 

tttttttttt tttttttttt tttttttttt ttttgtcggg cgcggcggtt atttcgacgg 2340 

gcggcgcggg cgcgggtatt tgtagaatgt cggcgggtcg gtttcgcgta tcgtgtagtc 2400 

gttgggttcg ttttttaggt agtagggtat ttgttggtcg tggggttgta ggaaaggcga 2460 

tagttgcggc ggcgggtgta gtagtattag cgggttcgga gatacgttgt tgagtggggg 2520 

gaaatttttt aggtcgttgg agtcgaacgt cgtagtttta gattcggggt cgtaggggag 2580 

gtcggtttga tcgtagattt gcgcgttggc ggcggtcgcg gcgttgaatt cgtaggcggc 2640 

gttttcgggg tagttgtata cggcgggttt gttgttgttt aggtatattt cgtttagggg 2700 

tegttttagg gggattttga gttgcggacg gtttaggggt tttagttcgt ttttttggat 2760 

ttgatgtagt agggttattt tagatgtttt ggtgtggagg gttatggtta tggttcgtgg 2820 

tcgcgggtag ggtgtagatc gtgttttcgt agggtagaag gtttagaaat cggcgggtta 2880 

tttggaaaaa gagtatagtt cgaggttaga ggcgacgtag cgtatgtttc gtcgatacgc 2940 

gagttttggt ttcggttttg tttcgggagt ttgcgggttc ggtgaagtcg ggcgattcga 3000 

cgggagtaag tgtagtttta ggacgaacgt ttttcgttag tttttgggtt ttcgggtttt 3060 

taattttaag tattggtttt tcgagtttat atgtattata aaggtgttgg aggacggtta 3120 

gggattgttg ttttgttttg atattggttt aaatattatt ttaggtataa ttcgatttgg 3180 

agcgatttta aagagtagtt tttttgaatt ttattttatt tgtcgtcgtt gttggataga 3240 

ggttgagttt tacggttagg gggcgggggc gtacgaggat ttgttaaagg tggtttaggg 3300 
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aagattgggt 
tgttttatgt 
tttttttttt 
tatattgtaa 
atagatagtt 
ggagatttag 
ttaggatttt 
tttttgagat 
ttttattagg 

gggtgtttat 

aatgtgattt 
gttatattgt 
ttttagtttt 
tttagatgaa 
gtatatatat 
aggagataaa 
tgaataaatt 
taattgttat 
gttaagttta 
atttaatttt 
atcgagttta 
gtgtatcgta 
tagatgattt 
gtaaaaagta 
tataaagaat 
tagtaatttt 
taaaatgtta 
gtttttttag 
ggaaggaatg 
tggatgtgtt 

ggtagttttt 

Qgggagtgag 
gttgtattgg 
ttttatgttt 
ttttacggtt 
atatgggtgg 
ggattaaatg 
cgattgtatt 
tagttgaaat 
gtttagttga 
tgaatataag 
ttgatcgtga 
ttgattattt 
ttagagtgtt 
gatattagtt 
ttttggtttt 
ataaaaaaga 
ttgtgttttg 
ttagtattga 
tgttggtgat 
ttagaaaaaa 
aaggaaaggg 
ttttttatag 
aaaggtagtt 
ttttgttttt 
ttttttgttt 
ttttttttgt 
ggtaaagatt 
tgttgggtat 
taattttttc 
attgtttttt 
taagtttttt 



ttaaaataaa 

gtttcgtgtt 

tcgttggtta 

tcgagatagg 

atatttagta 

ggtatatata 

agtttttttt 

ttttttagtt 

aatttttatg 

gttttttata 

ttttatatta 

cggtttggtt 

ataattcgat 

aaataaatag 

atatatatat 

aaataggtga 

ttttttgttg 

tttaaatatt 

ttgttattaa 

tataaaaata 

gtattaatga 

gatatttttt 

aatttttata 



ttttataggg 
tttataagtt 
tatttttttg 
aagttataga 
aaatgtttta 
tgttcgtatg 
tgtgcgtgga 
ttcggtagat 
gggagatttg 
ggaggagaaa 
tttgttgtta 
aaaatagaat 
ttttttcgaa 
ttttattggt 
aggaaattag 
gtaatatttt 
ttgttttttg 
ataaaggtag 
ttgggaacga 
ttttaattat 
tacgtttagg 
attattttgg 
gggatattat 
aatagaaagt 
aaagattttt 
aaataaatat 



cgcgaaagac 
agatttcgat 
gaaatacgta 
gtagatagaa 
gttgggggaa 
taggaagagt 
tttgtaaaat 
tgatttgtgt 
tgaaggatga 
ggttggttat 
aatataattt 
tagttaatag 
tttttcggtt 
aaaaaagttt 
atgtgtatat 
agtatattat 
tagtttaata 
taatagagta 
gagttatatt 
gattattgtt 
ttatagattg 
tttatggtta 
aattatattt 



agtggtgttt 
gtaagagtat 
gatttgtaat 
gttttgattg 
tttttaatta 
gggtttggtg 
ttggtaaaaa 
tggatatata 
gatttggttt 
tgtaggtttg 
gtgtttgtgt 
tcgagggttg 
agttttataa 



tttttaaaaa 
ttgaaagaaa 
ttaggatata 
gagagaaatt 

tgggtttgta 

tgcgagtggt 

atggtaggga 

acgtagttgg 

ttgtttgttt 

ttttagtttt 

ttttaagtaa 

gttagttaaa 

agaatataat 

tttagttttt 

attttggaat 

tttatttcgt 

taaatatgtg 

atatttttgt 

ggttttttat 

tttttcggtt 

gttttaggat 

atattaaatg 

gtagttattg 

tataaaaatg 

agtattgtag 

tggatataaa 

tttgtaggtt 

ttagggaggg 

tgtttattat 

tttagagatt 

tttttttttt 

agtatgggtg 

ttttattcgg 

cgttttagtc 

tttcgttgaa 

tttaaatatg 

ttagtattat 

attttttttt 

tttttttttt 



seq_prot. 
ggatttaggg 
atatttttgt 
gtgtgtatat 
tggggtgtgc 
ttgatagggg 
tgtattttgt 
gtttttaaag 
gtttatgttg 
tttagaaaaa 
gtaattaaaa 
taggttacga 
atttttaaaa 
tatttgggtt 
ttaattgaat 
atatatatat 
gcgtttataa 
gattttgata 
tttgacgaag 
tttgattatt 
ggataatatg 
ttagtaaata 
gtaattatta 
ttaattttta 
atgtgaattt 
aaggagatat 
gatattagtt 
aaaataagtt 
gaagttaagg 
ttagtgtgtg 
ttcgggaagt 
gttattgtat 
atagggagta 
tttgttttta 
gaggaaaaat 
agtgtatgga 
ttaataataa 
agtttttgaa 
aaatgttttg 
taattaaatt 



txt 

gtcggttttt 
ttgttttttt 
aggatgattt 
ggttgtatac 
gtatttgagg 
tattaggaga 
tagatagatt 
gattatagta 
tttttggtta 
ttttagaaaa 
attaaagttt 
tgtatttttg 
aggatatgta 
taaaagttaa 
atatatatat 
ttttggatag 
taattattaa 
tgtttatggt 
ttatattaag 
taaatgtagt 
aagggttaaa 
ttttttaaag 
aatgtttttt 
gttaaattat 
atatatattt 



aaaaatacgt 

tttaaattga 

tttttagtgg 

atagtattaa 

atatgatatt 

attttaatta 

agagatttat 

ttaatgatgt 

tgtttgagta 

taaatattga 

aataatggtt 

aggagagagg 

tgatatagga 

tataattaaa 

agtatttttt 

ggaaagtata 

gtttttagtt 

gtgatttatt 

tgtgttttta 

agttattatt 

tgttttttta 

gttttttata 

tttattgtat 



agaaaggtaa 
tattttgttg 
gtcgagagag 
aattaggtag 
agttagtagg 
agcgttggat 
gtgtggtata 
tttttggagg 
aggtttgttg 
tattaagttt 
aagttttttg 
tagaggtatt 
gtattttagg 
tataaatttt 
attattaaaa 
ttttaggttt 
cgtttatatg 
tacgtaatat 
taatggaggt 
taatgttagg 
gagtggtttt 
tattatgtaa 
taggagagtt 
aaagaaagat 
atttatgttt 
aataggggaa 
agaataagaa 
gttagtttaa 
tttaaattta 
ttgtgttacg 
tttttggttt 
ggttttttta 
gttttatttt 
tttaaatatg 
gttatttaga 
tttaattaag 



tttaatgtgt 
ttttgtttga 
tggggaggat 

gtagttagtt 
ggaagggggc 
atgtaatttg 
tttttataat 
ttgtatttgg 
gggcgtatat 
ttgaatataa 
tggtaattat 
tatgtttatt 
gaattaaata 
atatgtatac 
atatatatta 
tttatatttt 
tattttgttt 
ttatttaaat 
tatatatttt 
tgaagttaaa 
aatatattag 
taatttttta 
taaaatatat 
atgtaaatgg 
ttatggagaa 
gttgtttttt 
gattaagaac 
tgagaaggaa 
agagagtgtg 
tagggtattt 
gaatggtagt 
gttagagaaa 
ttggaaagta 
tgaattatag 
ataaaatagg 
ggatatatgt 
gttatgtttt 
gatgtgtttt 
tttatgaata 
gtttaggata 
tcgagttgta 
gatttatttt 
tgaggtgttt 
tttgttgtta 
agtggtcggt 
tgagattagt 
atatatgttt 
ttttttatag 
tgttattttt 
tagattagtt 
aggagaagga 
ggttagttgt 
gaagtttagt 
ttttggtgag 
tcgatttttt 
ttttttttat 
ggtgaagaag 
tttagcggtt 
gtgttttttt 
ttaaaatttt 
aggtaatttt 



3360 

3420 

3480 

3S40 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7001 



<210> 242 

<211> 3501 

<212> DNA ( 

<213> Artificial Sequence 
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<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 242 

taattttgaa aattatttgt agatattttg taggttttat tttaggaata atggttattt 60 

tttcgggtag ttgaagtaaa attaagttta atgataagta aatataatta ttattaaaat 120 

tttttattta tgtttgttaa agtaatttaa gtatgatttg agaaggattt tgtattttat 180 

atttgagttt ttgtggatga attgtaattt agtttaatag gtagataaga ttgaaaattt 240 

aatttaggag tatgtgtttt taataatagt tgagttttgg ttaattttag tggttatatt 300 

ttaattattt atatattgtt gagtgtttaa attgtgttta aagaaggtaa aagttaattt 360 

gtaattaatt tagttgtttt tttgttttat ttttaatttt tgtatgttat tttttttttt 420 

ttgtttataa atatgttttg attatgaggt atttttggag tttttgaatt cgttgtgatt 480 

ttggaagttg ttttattcgt aaattattta ttatttaatt aaattgtttt aaatttaatt 540 

ttgttgaagt ttttttttaa taggtttaga aaaaataatg gtaaaaatga atgaaaattt 600 

aataattttg gaagtagaaa aggttggggg ttttaataag tgtaaatagt tttattttta 660 

tatttttttt atggtaatta taatttagta tattatatat atattttttt gtttttcgta 720 

ttttggttta gggtaaagtt ttttaaaata ggtattgtta attagtgtta ttaagaaggt 780 

ttggatgtcg ttttgtggga atattttaaa gaggaatgtt taaaaggaaa agggggatgg 840 

gttgggagaa gggtattagg cgggtatttt aaaattattt ttagggttat aggtttaatt 900 

tatttggttg tggacgttag agtcgttatg gtaagaagga agtaaagttt tttgtaataa 960 

ttaaagtttt tagaagtagc gtgttttatt gtttattagt gcgtcgtgaa gtttggtgtt 1020 

tatttatagg gtttttttta gtattgttta ggtttttcga gtgttttagt atagtagttt 1080 

ggagtttgtt ggtttggtga ttaagatata ttttagggaa tatgttatgt agtggagttt 1140 

ttttttcggt attgtatagt aaaaggaaag ggtcgttggg tgtttgtggg ttttgggtag 1200 

ttatagaagt tatcgcgttg gcggggagga gggggatcga tgcggtttat gtttcgggta 1260 

gttttatttt ttttgtttgc gaagggtttt tgttcggcgg gaggagagag gcgcgtttta 1320 

ttcgggtttt tttatatttg tcgtcgtttg ggtcgatttc gcgggtttcg ttcggcgttt 1380 

tagtcgattt tcgtttagtt tcgggtttat gggcgcggtt agtagggcgg gttagggcgg 1440 

cggggcgcga tattgggagg aagtgcgggt cgtttgttcg ggcgcgttaa ggaagttgtt 1500 

taaaatgagg aagagtcgcg ggttcggcgg ttgaggttat ttcggcggcg gttggagagc 1560 

gaggaggagc gggtggtttc gcgttgcgtt cgttttcgtt ttatttggcg taggtaggtg 1620 

tggtcgcgtt ttttattcgg tcgggatttt ttggtaagga gaggaggtta cggggaacga 1680 

cgcgttgttt ttatgttttt ttttgtttta tttttatcgg tcgaggtaaa agtgttgaaa 1740 

ttatgtgaat aaaatatagg tgggtttcgt tagtttcgtt tttgaattta ttcgcgttcg 1800 

ggatttagaa gttgcgtcgg gagagagggg tttaggtttg ggcggagggg acggaggtta 1860 

gatcgtgcgg aaagtgattc gggtatttta gggcgtttag gtttttaggg agcgcggaaa 1920 

gtgcggtcgc ggttcggttt tcgggagacg cgggattggg attaggtata gcgcgaggaa 1980 

gtcgattttg gagttagaat attttttttt ggttatttat acgaatttat tggaaaatgt 2040 

cgtagtgttt attaaagtta tttaaagtag aaatgtttag acgttttatg agtttagata 2100 

aattttttat tataaaaaga aatagtagtt gtatttaaat aataattttt ttgaattatt 2160 

attaaaattt agtataatta tttttgtgga tatattttta ttgttaagta atttatttag 2220 

ttaatgaatt tggagagtaa gaaagtttta tgtagtaaaa tgtaaatttg agttaagagt 2280 

taagggtgtt tttttttgtt tgtttgtttg tttttggtaa tgtggtttta aaattttaag 2340 

tttatttaaa aattatataa atgtaattta tttttttgtt ggaatgttta aggattagaa 2400 

agataattgg agaagtgaga gtttgaattt ttttttacgt tggaaatagt gttgtaaaat 2460 

attttttgag ttttgtttga tttagtaaag atttagtcga atttaagtag agtttaaagt 2520 

attgtagcgt gtagtaaaaa aaaaaaagag ttgaagatgt tgtgttatat ttgatttttg 2580 

gtattaaaaa taaaaaaagg aattatttaa tttttaagag tttttggaag aaatggaatt 2640 

gattttatta tttgattttt ttgttttagt agggataaat ttattgtttt tatataggta 2700 

gtattttaat taaatgtaat taaatatatt gatgtttttt ttttttttag tgaaagttcg 2760 

agtagtgtat agatagaatt atattttttt aaaaaggttt aaatatatgt aaattattaa 2820 

gtgttttaag tgagaaattt ttgttagttg aaattatttt attaaattat ttttgttttg 2880 

aacggtattt ttgttttaac gtatttagaa aatttttgta ttaaaataga aattgatttg 2940 

tattattttt tgtatttgaa gtgtatttta aagtgtattt gaatgagaga ttataattaa 3000 

attatttgat ttgtgtttta tttttttatt tttttatttt ggatttgaaa aggttggatt 3060 

tgaatattag gaaaaaagag atttttttat tgagggattg tgggaagatt tttttttaag 3120 

tttttgtatt tgaagtgaaa tataaaatag gattttataa tagttttaat aattagaaag 3180 

ttttaagtta ggataaaaat ggggtgtata ttttaggttt ttattttttt tgagagatga 3240 

aattggtaga aacgttattt tattgttaga ttttatttta gaaatgaatt attttgattt 3300 

ttgttaggtt tttatttaat attttaaatg taggtttaat ttttttgttt ttttttaatg 3360 

gaagattttt tagtagaatt ttatttattt agtatattgt ttgtttatgg ttataattta 3420 

atatgtttgt gttaagacgt attttggaat ttaagttttt ttagaatttg gaattaaaat 3480 

ttagaattta aattttagaa t c i ■ 3501 

<210> 243 

<211> 3501 ■ \ - f . . . 

<212> ' DNA - } <- ! • 2 ; -{'n , \\ 
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<213> Artificial Sequence 



seq__prot.txt 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 243 

gttttgaaat ttagatttta aattttaatt ttaaattttg aaaaagtttg aattttaaaa 60 

tacgttttgg tataagtata ttggattata attatgggta aataatgtgt taaataagtg 120 

aagttttatt aagaagtttt ttattaaaaa aaagtagaga gattagattt gtatttagaa 180 

tattaaatag gggtttaata aagattaggg tgatttattt ttaaagtgaa atttagtaat 240 

aaaataacgt ttttattaat tttatttttt aggaaagata aagatttaaa gtatatattt 300 

tatttttatt ttagtttaaa attttttagt tattaagatt attataaaat tttattttat 360 

attttatttt aaatatagag gtttaaaaaa aaattttttt ataatttttt agtagaaaaa 420 

tttttttttt tttagtgttt aaatttaatt tttttaaatt taaggtagaa gagtaagagg 480 

ataagatata aattaagtaa tttgattata attttttatt taaatatatt ttaaagtgta 540 

ttttaaatat aggggataat ataaattaat ttttgtttta atgtaagagt tttttaaata 600 

cgttaaaata aaagtgtcgt ttagaataaa aatgatttga taaaataatt ttaattgata 660 

aagatttttt atttaaaata tttaataatt tatatgtatt taagtttttt tgagaaaata 720 

taattttatt tatgtattgt tcggattttt attggggaga aggaggatat taatgtattt 780 

agttgtattt aattaaaatg ttgtttgtgt gaaggtaatg aatttgtttt tattaagata 840 

aagaagttag atggtagaat taattttatt ttttttagga atttttaggg attaaatggt 900 

tttttttttt atttttaata ttaagaatta gatatggtat aatattttta attttttttt 960 

ttttttatta tacgttgtaa tgttttgaat tttatttgag ttcgattgaa tttttgttaa 1020 

gttaggtaaa atttaaaaaa tattttgtaa tattgttttt aacgtaaaaa aagatttaaa 1080 

tttttatttt tttaattgtt tttttagttt ttgaatattt taataaaaga atggattgta 1140 

tttatatagt ttttaggtgg gtttaaggtt ttggagttat attgttaaaa atagataaat 1200 

agataaaaag aaatattttt aatttttaat ttaaatttat attttattat ataaaatttt 1260 

tttgtttttt aagtttattg gttaagtaaa ttatttgata atgaaaatgt atttataaga 1320 

gtaattatgt taaattttag tagtggttta gaagggttgt tgtttgaatg taattgttgt 1380 

ttttttttgt agtaaaagat ttgtttaagt ttataagacg tttggatatt tttattttga 1440 

gtaattttga taaatattgc ggtatttttt agtggattcg tgtaaatggt taaaggaaaa 1500 

tgttttagtt ttaagatcga ttttttcgcg ttgtgtttga ttttaatttc gcgtttttcg 1560 

agggtcgggt cgcgatcgta tttttcgcgt tttttggggg tttgggcgtt ttggggtgtt 1620 

cgggttattt ttcgtacggt ttgattttcg tttttttcgt ttaggtttga gttttttttt 1680 

ttcggcgtag tttttggatt tcgagcgcgg gtaggtttag gagcgaagtt ggcggaattt 1740 

atttgtattt tatttatatg gttttagtat ttttatttcg gtcgatgaag gtagaataag 1800 

aaagggtatg aaagtagcgc gtcgtttttc gtaatttttt ttttttatta gaaagtttcg 1860 

gtcgggtagg ggacgcggtt atatttattt gcgttaggtg aggcgagggc gggcgtagcg 1920 

cggggttatt cgtttttttt cgttttttag tcgtcgtcgg ggtggtttta gtcgtcgggt 1980 

tcgcggtttt tttttatttt gggtaatttt tttaacgcgt tcgggtaggc ggttcgtatt 2040 

tttttttagt gtcgcgtttc gtcgttttgg ttcgttttgt tgatcgcgtt tatgagttcg 2100 

gagttgggcg ggaatcggtt gaagcgtcgg gcgaggttcg cggaatcggt ttaggcggcg 2160 

gtaggtgtag aggagttcgg gtggggcgcg tttttttttt ttcgtcggat aagggttttt 2220 

cgtaggtaga gaaggtgggg ttgttcggga tatggatcgt atcggttttt ttttttttcg 2280 

ttagcgcggt ggtttttgtg attgtttagg atttatagat atttagcggt tttttttttt 2340 

tgttatgtag tgtcggggaa gagattttat tgtatggtat attttttgga gtgtattttg 2400 

gttattaaat taataagttt taagttattg tgttggagta ttcgggaggt ttgggtagtg 2460 

ttgagaggga ttttgtaggt gaatattaga ttttacggcg tattagtggg taatggggta 2520 

cgttgttttt gaaggtttta attattataa aaggttttgt ttttttttta ttatgacggt 2580 

tttgacgttt ataattaaat aaattaaatt tatagtttta agaatggttt tgagatattc 2640 

gtttgatatt ttttttttaa tttatttttt tttttttttt ggatattttt ttttaaaatg 2700 

tttttataaa acggtattta gatttttttg ataatattgg ttggtagtgt ttgttttaaa 2760 

aagttttatt ttaagttaaa atgcgagaag taaaaaaata tatatataat gtgttggatt 2820 

gtaattgtta tggagaaaat atagggatgg gattatttat atttattgga gtttttagtt 2880 

ttttttgttt ttagggttat tggattttta tttatttttg ttattatttt ttttaaattt 2940 

gttaaaagaa aattttagta gaattaaatt taaagtagtt taattgagta atgaatgatt 3000 

tgcgaatggg gtagttttta gaattatagc ggatttagag attttaggga tgttttatgg 3060 

ttagaatata tttatagata aaaaaaggga agtgatatat agaaattgga ggtaaggtag 3120 

agaaataatt ggattggtta taggttggtt tttgtttttt ttgaatatag tttgaatatt 3180 

tagtagtgta tgaatggttg aagtatggtt attgggattg gttaagattt agttattgtt 3240 

aaaggtatat atttttaaat taggttttta attttgtttg tttattaagt taggttatag 3300 

tttatttata aggatttaaa tatagaatat agagtttttt ttagattata tttaggttgt 3360 

tttaataaat ataaatggaa gattttgata atggttatat ttgtttatta ttggatttaa 3420 

ttttgtttta attattcgaa aaaatagtta ttgtttttga gatggagttt gtagaatgtt 3480 

tatagatggt ttttagagtt g 3501 



<210> 244 



sei te 152 



<211> 4216 
<212> DNA 

<213> Artificial Sequence 



seq_prot.txt 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 244 



aggtaataat 
ttaaattttt 
aattttttgt 
tagatggttt 
taaattttta 
tattagtgtt 
tattttggga 
gaatatggtg 
gtttgtagtt 
gaggttggag 
ttattttaaa 
cggtaattgt 
gtgtagtggt 
attttagttt 
tttgaatttt 
ttttaagtaa 
gtatttagtt 
ttatatattt 
gggaaattga 
agatgtgaat 
ttagattgtt 
taggttggag 
taattttttt 
gttaattttt 
atttttgatt 
gagttattac 
ttttgagata 
ttatagtttt 
gggttatagg 
ggttttatta 
tggtttttta 
ttttttgaga 
gtttattgta 
tatacgcgtg 
ttgtttggtt 
cgtttaagta 
gaatttattt 
tcgtttacgg 
gttattgtta 
ataaaagcgt 
tagttagtat 
tttttcgcga 
tgtcggagga 
tgtttttcgt 
gtcggtaatt 
ttttggatgt 
tggagattat 
gtagggtcgg 
gtttagtatc 
cgtttttaag 
atcgtttgag 
tttcgatttc 
gaagagagaa 
taagcgggga 
ttttaagtgc 
aaattttgaa 
ttatttttaa 



aattatgatg 
aggttaattt 
taggtaattt 
agtttattat 
tagtatatga 
tttaaagata 
ggtttaggcg 
aaattttatt 
ttagttattt 
tgagtttaga 
ataaataaat 
ttttgtttgt 
attattaggt 
tttgagtatt 
agtagagatg 
tttttttata 
agtaatataa 
ttatgtaatg 
ggtataaaga 
ttaggtagtt 
tttttttttt 
tatagtggtg 
gttttagttt 
gtatttttag 
tttggtgatt 
gtttagttta 
gggttttgtt 
tatttatcgg 
tatgtattat 
tgttgtttag 
aagtgttgag 
gataagattt 
gttttaattt 
tggttttaat 
ttgttttttt 
ttagggtcgg 
gtttttttta 
tttttgaacg 
ttaatagaga 
agtcgagttt 
gatcgagcgt 
ttggtattcg 
gtggtcgtag 
cgttatcgag 
tagtagcggg 
taattatttc 



gaggtggggt 
tttttttttt 



cggtgagttt 
gggcgtgcgg 
gggaaaaata 
ggcgtttttt 
gttaggagtt 
gcgttattat 
aatgcgtttt 
gttttttggc 
gtttgcgtat 
tcgaagaatt 
ttttttttgt 
ttttggttta 
tttaaaggta 



tattaggttt 
attatagtta 
tattattgtg 
atatttaagt 
ttgtattaaa 
gaagatggtt 
ggtggattat 
tgtattaaaa 
aggaggttga 
ttaggttatt 
aaataaataa 
ttgttttgag 
ttattgtagt 
tgggattata 
gggttttatt 
ttagtttttt 
ttttatggga 
tatgattgtt 
atttaatttg 
tggttttaaa 
tttttttttt 
agatttcggt 
ttttagtagt 
tagagatggg 
tttttatttc 
gattgtttta 
ttgtagttta 
ggtttaaagg 
tatgtttagt 
attggtttta 
attataggta 
tgatttgttg 
ttgggtttaa 
tttgtttttg 
taatagaagg 
cgagtgtttt 
acgagagaag 
ttgggggagg 
ttttaaatat 
agcgtttcgt 
cgcgtttttt 
tatagtcgtt 
tggttaggcg 
agtttcgtag 
gtttcggaga 
gtttcggacg 
ttttgttttt 
ttgaaacggg 
ggatttttga 
gttttgtaat 
taagattagt 
aaaaaaaaag 
tttatagagt 
gcggtgttag 
tgtttatttt 
tttcggaagt 
taatttttat 
gcggggtcga 
agtacgagga 



cgggttaaag ttattatttg 
tgagttattt aataatagat 
ggaatatttt agagtggatt 
tatatggttt agtttattgt 
tattgaaggt agttgtaata 
aggcgcggtg gtttacgttt 
ttgaggttag gagtttaaga 
atataaaaat tagttagtgt 
ggtagaagaa ttatttgaat 
gtattttagt ttgggcgata 
ataaaatata gaagatgtat 
atagggtttt gttttgttat 
ttcgattttt ttggtttaag 
ggtttacgtt attatgtttg 
gtgttgttta ggtttgtttt 
aaagtgttaa gattatagat 
ttatttttat atttgttgtt 
attattatta ttatttttat 
tataagttta tttgtttagt 
gtttatacgt ttaataatta 
ttttttgaga tggagtttta 
ttattgtaat ttttgttttt 
tgcgattata ggcgttcgtt 
gttttattat gttggttagg 
ggttttttaa agtgttggga 
tttttgtatt tgtatttatt 
ggttgaagtg tagtggtgta 
atttttttgt tttagttttt 
taatttttaa atattttttg 
aatttttagt tttaagggat 
tgagttatgt atttagtttt 
tttaggttgg agtgtagtgg 
gtattagatt ttttttatta 
ttatttttta gttgtatgtt 
acggttttgg ttacgggtta 
gtcgtggtac ggttttagcg 
gttttagatg agggttgaat 
agtgtatggg gaggggcggt 
cgtttgttaa aaatattcga 
atttttttga gtagacgttt 
tttcgttttt gcggggtttt 
ttttcgatta ggttttcggg 
gtagtagttg gttaggttac 
tggtcgcgtt cgtttatagt 
ttcggtatat tgcggatcgt 
agttgacggt taagattaag 
gtaggggaga ggaggaggtt 
ggtttcgggg gtttggggag 
tttttttgtt taggaattgg 
tttagcgttt tgggaggtcg 
ttgggtaata tagcgagacg 
taaataaaaa tttttttaaa 
ttttttttta tttatagttt 
tttttagtcg ttgagtgggc 
ttagtttcgc gttttgtttc 
ttttgagagt ttagatcggc 
tagtttgtag ttttggttgt 
aaggtagttt tttttttcgt 
gcggtaggac gagtatggtt 
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tattaggtta 60 

atatttaata 120 

tatataaatt 180 

ttttaggtta 240 

tgagggtaag 300 

gtaattttag 360 

ttagtttggc 420 

cgcggtattt 480 

ttgggaggta 540 

gagggagatt 600 

agtaaaaata 660 

gcggattgga 720 

tgtttttttt 780 

gttaatttgt 840 

agttttttgg 900 

gttagttatt 960 

ttttgttgat 1020 

tttttagatg 1080 

gatggaataa 1140 

tattagatta 1200 

ttttgttatt 1260 

tgggtttaag 1320 

attatattta 1380 

ttggttttaa 1440 

ttataggcgt 1500 

tatttattta 1560 

atttagttta 1620 

ggagtagttg 1680 

gtagaagtag 1740 

ttttttgttt 1800 

tttttaaaat 1860 

tgagattata 1920 

tattttattt 1980 

ttaatttaat 2040 

tagttagtaa 2100 

tcgcgttttc 2160 

ttttttcgtt 2220 

ttttaaacgg 2280 

ttggaggagt 2340 

agagtagagt 2400 

agttgggatt 2460 

ttgtttcggt 2520 

gtgcgttttt 2580 

cgcgcgttta 2640 

tggcgcgtgt 2700 

gatggcgtgg 2760 

agtagggcgg 2820 

ttaaacgttg 2880 

gagtgcgggt 2940 

agacgggagg 3000 

cgttttttcg 3060 

gattatcgat 3120 

tatttttaga 3180 

gtgtgcgcgg 3240 

gtttttttta 3300 

gggtacgttt 3360 

ttttaagtag 3420 

agtttgattt 3480 

atatttttcg 3540 



gtgttttacg cggaaatata cgtgagtttt 

ggggtggggt tagggaagag ggtataggga 

tttttttttg tacgtttagg ttgtttttcg 

tgttttttga gggtatattg atcgtggagg 

acgagattat tattttagtt attttcgagt 

taaaattcga tgagattgtc gttaagtaaa 

gttattggtt gtgttttttt cgttatttgt 

tttttaaata aagtttaaag taattatttg 

ggaagtttta ggtatttgtt atttgttggt 

ttgggttatg tgggtatatt ggtgtaggtt 

gatagaggtt ttagggtcgg gcgtagtggt 

ttgaagtagg aggagt 

<210> 245 
<211> 4216 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic 
<400> 245 

attttttttg ttttaatttt ttaaagtgtt 
ttttaagatt tttgttttta tggatgtgag 
tatttatatg gtttagtacg ggtttgagta 
ggtgtttgag gttttttttt ataaatatta 
gaattttatt tgagaaaaat agaagataaa 
aggtatagtt agtggcggta gtaggggtgg 
gttttatcgg attttgtagt ttttgggttt 
gggatggtga tttcgttgga ttgcgtggtt 
gtgtttttag gggataggga ggaggaaatt 
gacgtgtaga gaggaaaggt aagcgttata 
tttttgattt tatttttacg ttatttattt 
ttttcgcgtg aagtatcggg agatgtagtt 
tttgggggga agagggaaat tagattgcgg 
ttagaggttt tattttttgt ttggaagtag 
agggggttgg ggatgggggc gtgttcgtcg 
tttcgattta gagttttggg aggaacggag 
gtaggcgtat ttggagtcgc gtatacgttt 
gggaattttt cgtttatttg gggatggggt 
cgtatttttt ttttttatcg atgattttta 
tggcgcgggg tcggggcggg ggggcgcgtt 
tttggtttta agcgattttt tcgtttcggt 
agcgttttta gaagcgattc gtatttttaa 
tttttcggtg ttgggttaac gtttaatttt 
acgttttcgg ttttgttcgt tttgttagtt 
ttatcggtga tttttattac gttatttttg 
tggttgatat ttagggatac gcgttagcgg 
ttgttgagtt gtcggttgag cgcgcggttg 
atggcggcgg ggggtagggg gcgtacgtag 
gattattttt tcggtagtcg gggtagttcg 
tattagtcgc ggaaggggtt ttagttgggg 
tcggttatgt tggttgattt tgttttggac 
tcggttgcgt ttttatgttt ttttagtcgg 
tattaatggt aatgattcgt ttgagggtcg 
taggggtcgt gggcggggcg aagagggttt 
gaaagtaaat gaattcgaga gcgcgacgtt 
ttttggtgtt taagcgttgt tggttgtggt 
ggatagagtt agataggttg ggttggggta 
gaattatacg cgtgtgagat agaatgtgat 
tgaggttgta gtgagttatg attttattat 
tttgtttttt aaaaaaattt taaaaagggg 
gtattttggg aggttaaggt agaagggttt 
gtaatatagt gagattttat ttttattaaa 
tgtatgtttg tggttttagt tattttagag 
tgagtagagg ttgtggtgag ttggattgta 
aaattttgtt ttaaaataag taaatgaata 



seq_prot.txt 

ggcgttaggt cggggtgggt gggtggcgtg 3600 

tttattcggt gtgtaatgta acgtttgttt 3660 

gtgtggattt tatttaagtt tttttttttt 3720 

tttttatgtt taagttagtt acgtagttta 3780 

cgcgggttta gtttgggggt ttagaagttg 3840 

gttttagttc ggatgtttat ttttgttgtc 3900 

gtgttttttt gatatattta ttttttgttt 3960 

ttattggttt aggttttggt gtttgtggaa 4020 

ttttaggagt tatttttgtt taggttcgtg 4080 

gttggatata ggttgattta tatttataaa 4140 

ttatatttat aattttagta ttttgggggg 4200 

4216 



DNA (Homo sapiens) 



gggattgtag gtatgagtta ttgcgttcgg 60 

ttagtttgtg tttagtaatt tatattagtg 120 

aagatgattt ttgaaagtta gtaaatggta 180 

gggtttgggt tagtgatagg tggttgtttt 240 

tgtattaaaa gaatatatag gtggcggggg 300 

gtattcgggt taaggtttta tttggcggta 360 

ttaagttggg ttcgcgattc gaaggtgatt 420 

agtttgggta tgggggtttt tacggttagt 480 

tgggtggggt ttatatcggg gggtagtttg 540 

ttatatatcg ggtgggtttt tgtgtttttt 600 

atttcgattt ggcgttagga tttacgtgta 660 

atgttcgttt tgtcgttttt cgtgtttgtt 720 

gggaggggat tgtttttcgg tttcgttagg 780 

ttaggattgt aggttggtag ggattaatag 840 

gtttgggttt ttagaaattt tcggaaagtt 900 

tagggcgcgg agttggggag tgagtagtac 960 

atttagcggt tagaaattgg tatcgcgtta 1020 

tgtgggtggg aaggggattt tgtagaaaag 1080 

aaaaaatttt tgtttgtttt ttttttgtaa 1140 

tcgttatgtt gtttaggtta gttttgaatt 1200 

tttttaaagc gttgggatta tagagtagaa 1260 

tttttgagta agggaattag gagttttgtt 1320 

ttaggttttc gggattttcg ttttaatcgt 1380 

tttttttttt ttttgtagga gtaggggggt 1440 

gttttgatcg ttagttcgtt cggggcgaag 1500 

ttcgtagtgt gtcggatttt cgagatttcg 1560 

taggcgggcg cggttattgc ggggttttcg 1620 

tttggttagt tgttgtcgtt taattattgc 1680 

aaggtttggt cgaagaggcg gttatgcggg 1740 

tttcgtagga gcgagaaggg gacgcggcgt 1800 

gtttgtttag aaaagtgcgg ggcgttgggt 1860 

gtatttttag taggcggtgt ttgaggtttt 1920 

ttttttttta tgtatttttt ttttagcgtt 1980 

agtttttatt tggaattttt tttcgttaag 2040 

ggagtcgtgt tacggtaggg tattcgtcgg 2100 

tcgtggttag ggtcgttttt ttgttaagga 2160 

tataattgag aagtggtaga ggtaggattg 2220 

aaaaggagtt tggtgtttga gtttaggagt 2280 

tgtattttag tttaggtaat agattaaagt 2340 

ttgggtgtat ggtttatgtt tgtaatttta 2400 

tttgaggtta ggagtttgag attagtttgg 2460 

aaatatttaa aaattagttg ggtatggtgg 2520 

gttgaagtag gaggattttt tgaatttcgg 2580 

ttattgtatt ttagtttggg ttatagagta 2640 

aatataaata taaaaa taaa gtagtttggg 2700 

Sei te 154 



ttgggcgtgg 

attagaggtt 

aaatataaaa 

tgaggtagga 

attgtatttt 

gaaaaaagta 

gtttgaattt 

tgtgttttag 

gtataaagat 

attattggtt 

ggaggattat 

tttattaaaa 

tttaggaggt 

gatggtgtta 

ataaaaataa 

atttattttg 

ggtttatttt 

agttgggatt 

gggttttatt 

tgggtttttt 

tttagaaata 
atgttgtaga 
taggttagat 
tgtttaataa 
tggtttgagg 
ataattattg 



tggtttacgt 

aggagtttga 

attagttggg 

gaattgtttg 

agtttgggtg 

atttagtaat 

atatttttgt 

tttttttatt 

gtgtaagtta 

gggtgtagtg 

ttgaagttag 

tttagaataa 

tgaggtggga 

ttgtatttta 

ttatcgtgtt 

agatggagtt 

aatttttatt 

ataggtaagt 

atgttcgtta 

aaagtgttag 

ttaatattta 

ggtttgtagt 

tatttaggtt 

ggaatttgtt 

atttggtgat 

ttattt 



ttgtaatttt 
gattagtttg 

tgtggtggcg 
aatttagaag 
atagagtgag 
ttggtgtggt 
tttattatta 
tggaaaatag 
gtaagggata 
gttgatattt 
aaggttggaa 
attagttagg 
ggagtatttg 
gttcgtatga 
tttattgtat 
tttttttgtc 
ttttaggttt 
gtcgcgatat 

ggttggtttt 

gattataggc 
tttttatgtt 
ttaggagtag 
tgtgtaagtt 
gaatgtgttt 
ttaatgtagg 



seq_prot . 
agtattttgg 
gttaatatgg 
ggcgtttgta 
gtagaggttg 
attttatttt 
tgttaggcgt 
agtagatgaa 
agatgataat 
gtaaatatga 
gtaattttag 
taagtttggg 
tatggtggcg 
agttaaggag 
tagaataaga 
attttttatg 
gtttaggttg 
aagtgatttt 
tggttaattt 
gaatttttga 
gtgagttatc 
gtagttgttt 
taggttagat 
tattttgaga 
gttattaagt 
tgatgatttt 



txt 

gagatcgagg 

tgaaatttta 

atcgtagttg 

tagtgagtcg 

aaaaaaaaaa 

gtggattttg 

tttgtgtaag 

gatggtaata 

aggtggtttt 

tattttggga 

taatatagtg 

tggatttgta 

gtcgaggttg 

ttttgtttta 

ttttatttat 

gagtgtagtg 

tttgttttag 

ttgtattttt 

ttttaagtaa 

gcgtttggtt 

ttaatatttg 

tatatggttt 

tgtttttata 

gatttatgat 

ggttcggagt 



tgggaggatt 

tttttattaa 

ttggggaggt 

agattttatt 

aaagaaaaaa 

aagttagatt 

ttagattttt 

gttatgtatt 

ataaaattat 

ggttgatgtg 

agattttatt 

gttttagata 

tagtgagttt 

aaataaataa 

ttatttattt 

gtttgattta 

ttttttgagt 

agtatagatg 

tttattcgtt 

attttttatt 

gtgtagttat 

aggtgtgtag 
atgatgaaat 
tgtaatgggt 
ttggtatatt 



<210> 246 
<211> 11001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 246 

ttaagttaga 
attttatagg 
ttttttatgt 
taaagtatta 
gtttggaagt 
ttgtttttta 
ttagtgataa 
tttttaggat 
gttgtttatt 
tagggttagt 
tagggtgggg 
ttgcgggggt 
tttgtgcgtg 
cggcgtttga 
tttttgttgg 
gtgtaagaat 
ggtgagatgt 
tttaagataa 
tttttacgtt 
ttgggtattg 
ttattttata 
tagggttttt 
tttgtttttg 
ttttgtatgt 
gtttagggtt 
ggtatatggt 
taaaattagg 
taggaggatt 
atttttatta 
aattgtttag 
agttattatc 



tgttttttaa 
tttaatattt 
tattagatgg 
aggtaatttt 
ttttgtttta 
gtataatttt 
ttaagaaatt 
aaagtttagg 
atttaatagt 
tgtttagttt 
tgggaagttt 
tatttattat 
atagtatatg 
tatttgtgtt 
agggtataga 
cgtttttgtt 
tagtttattt 
ttttagttgt 
ttttttttga 
tattttttga 
ggttcgtttt 
ttgaaaaatg 
atgtatatat 
tttgggtgga 
ttagtattga 
taggggttag 
ttgggtgtgg 
atttgatttt 
aaaaaagaaa 
gaggttgagg 
gtgttattgt 



ttatttgtgg 
tggagttaga 
atttttgtat 
ttttggtttt 
gttataatgt 
ttttttaagt 
aagaattatg 
gttttgttat 
tttcgtaagt 
ttgtatgttt 
attttggtcg 
taagaatttt 
taggtgtttg 
tcgttttggg 
gatagggcgt 
ttaaagaaat 
ggggttaggt 
gattttttga 
gtagttaatt 
ttttttgttt 
gtaagatggg 
gtttttgggg 
atatagttgg 
taagggaaag 
atgtagtttt 
gaatagttga 
tggtttatat 
taggagtttg 
aaaaaaaatt 
tgggaggatc 

attttagttt 



ataggttagg 
aagtttttag 
ttttagaaga 
ttgggataat 
ttatatattt 
tttattgtag 
tatttacgtt 
tgggttttgt 
atattgtgaa 
aggtgtatgc 
tgacgtttag 
cgtagtaggg 
atagtatttt 
tttcgtcggg 
attagtttta 
agagaaatta 
tgggtatgta 
gtattaggta 
tattaagttt 
aattttagtt 
ttagtatgga 
tagtttttgg 
tgtttatttt 
atagaattat 
aaggatatta 
gggattttga 
ttgtaatttt 
agattagttt 
agttaggtgt 
gtttgagttc 
ggggaattga 
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tatattttga 
gtaaaaagtt 
ttttttatat 
tttaggtttt 
ttggaattgt 
ttatagttta 
tattttttta 
taggagtcgt 
ggggaagtaa 
gtatatattt 
cgcgtttaaa 
ttttaatgat 
tgggtaggta 
tattttgtaa 
gttgaatttt 
aattttgata 
taaattattg 
tatagttggg 
atgaagaggt 
tgagaaaggt 
tatagggttt 
aaattgattg 
gaatttatta 
ttggtttttt 
tagaagtagg 
agagggattt 
agtattttgg 
gggtaatata 
ggtggtgtgt 
gggagattgt 
gtgagatttt 



gtttaatttt 

tgaagggggt 

taggaaagat 

ggtatgagtg 

tttgtagggt 

ttagttttgt 

gagttatttt 

agtcgtaggg 

tgattagaga 

ttatataggg 

gagtgtaaat 

tttacgtgtt 

aaaggaagtg 

ggagggtggt 

gatgaagttt 

ataggtttta 

tttgcgttta 

tatttgtttt 

tgttgttgat 

taggtgtttt 

ttgaggaatt 

tttttggttt 

ttgtttttgg 

tttgttgttt 

ggtaattgaa 

ttatttaaag 

gaggttaagg 

gtaagatttt 

ttgtagtttt 

agttatagta 

gttttaaaat 



2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4216 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 



seq_prot.txt 

ataaaaaata aaaataggtt gggtacgttg gtttacgttt gtaattttag tattttggga 1920 

ggtcgaggcg ggtggattat ttgaggttag gagtttgaga ttagtttgat taatatggag 1980 

aaatttcgtt tttattaaaa atataaaatt agttaggcgt ggtggtatat gtttgtaatt 2040 

ttagttattt aggaggttga ggtaggagaa ttgtttaaat tcgggaggtg gaggttgtag 2100 

tgaattgaga tcgtgttatc gtattttagt ttgggtaata agagcgaaat tcggttttaa 2160 

aaaaaaaaaa aattagtaaa attatatttt aattgtatat tttgattata gtattttagt 2220 

tgagttggag tgagggtttg ttttggagaa ggtagtttat tttttttttt tgtttcggta 2280 

cggggrtatg atttattgta gggtgagagg agtggagagt ggtgtatatt agtagtttag 2340 

ttattagtgg atagagtagt atttggagtt agttttttat gttttatata tagtgagaaa 2400 

aattattgtg atatgatgtt taattttgat ttaagttgta taaaaggtag ttttaggtta 2460 

ggttttaatt tgttagaggt atataggtag ttttttggtg ggtttttgta tttgtttgtg 2520 

ttgtttggag atttggttta aagatttttt ttttttttga gacgaagttt tattttgtcg 2580 

tttaggttgt agtgtagtgg ttggattttg gtttattgta agttttgttt tttgggttta 2640 

agcgattttt ttgttttaga ttttcgagta gttgggatta taggcgcgtg ttataataat 2700 

attcggttaa tttttgtatt tttagtagag atgggatttt attatattgg ttaggttggt 2760 

tttgaatttt tgattttaag tgattcgttt gtttttattt tttaaagtgt tgggattata 2820 

ggcgtgaatt atcgtattcg atttagagat ttttaattcg attatttatt ttttatttta 2880 

tttagggatt ggatttttgt cggaagggtg gagtgtggga tagggtagtt agggttttga 2940 

atcgattttt tttttttaga tttttttggt tttattgtat tagttttatt ttttgttgac 3000 

gttagatagg ttttagttag aatgcgagtg ttatagatat agttaagttt agcgttgatt 3060 

aatattttgt tttagaagaa tttttataag gttttttgta gaatgatttt gtgtttagtt 3120 

taggagagtt agggtttttt ttgatttcgt tttggagttt ttttaagtat ttaaattatt 3180 

tgatggggat aaatggagag gatagatgag ggagtagggt ggagcgtttt agtagaatgt 3240 

tttttattta gaattcgttg ttattttgta gttagtaagg atgtggggtt aagaattaag 3300 

gttagggttt tataggaaaa aggtaaaggg ggaggggtgg gaatttaagt ttattttttt 3360 

ttttaagtat ttaaaggttt tttggatgga gaagagtatt ggagtaaaaa ttttagtata 3420 

aattttattg gggatagtgg gtaattttgt cgggttagta aaaataaatg gtgtgggttt 3480 

tggaaaatga gggttggagg ttgtgaataa agtagtggat gtgtttgttt agtatattaa 3540 

cgggaagaag tatttagatg ggaggagtat taggggtagg agaaatgtta gatagatttt 3600 

agtgttaggg taagaaggaa gattattttg tttgtagaat agggagggta tagggatggt 3660 

gttaatttgt ttttgtgatg gttttgagtt tttatttaat aatgagaaag tttgtttttt 3720 

tttttttttt tggatgattt aggagttttg ggttgggatg tagtgatttt atttttagtt 3780 

tttttttttt tggtgatgaa tttttttatt tttatttaga aaatagattt ggattagagg 3840 

tattgtatag tttttttagg attttaaagg aggaagagtt tttttttttg tttttaaagt 3900 

tgtttgttgg aagaggattt taatagttat tttagtcgga tgtatagtag gattatggaa 3960 

tttttttttt gtattatagg gatttatttt ttattttatt attgtttata aaaattgatg 4020 

gttttttttt tgagatagag tttcgttttg ttttttaggt tggagtgtag tggtgcgatt 4080 

ttggtttatt gtaatttttg ttttttgggt ttaagtaatt ttttgtttta gttttttaag 4140 

tagttgggat tataggtgtt tgttattata attggttaat tttttgtatt tttagtcgag 4200 

acggggtttt attattttgg ttaggttggt tttgaatttt tgattttatg atttatttat 4260 

ttcggttttt taaagtgttg ggattaaagg tgtgagttat tgtatttggt ttaaaattga 4320 

tgtttttttt ttttttttta atatataatt tgggattttt tagtttttta tttttttttt 4380 

tttttttttt ttttttttga gatagagttt tgttttttta tttaggttgg aatgtagtgg 4440 

tttagtttcg atttattgta atttttgttt tttgggttta agtgatattt ttgttttagt 4500 

ttttttagta gttgggatta taggtatata ttattatggt tagataattt ttttgtattt 4560 

ttagtataga cggggttttg ttatgttggt ttggtaggtt tcgaattttt ggttttaagt 4620 

gatttgtttg ttttggtttt ttaaaatgtt gagattatag gtatgagtta ttaagtttag 4680 

tttttttttt tttttttgag atagagtttt attttgttat ttaggttgga gtgtagtggt 4740 

acgattttgg tttattgtaa tttttgtttt tcggttgaag tgatttagtt ttttaagtag 4800 

ttgggattat agttatatat tattatgttc ggttaatttt tgtatgttta gtagagatag 4860 

ggttttatta tgttggttag gttgatttcg aatttttgat tgtaaatgat ttatttgttt 4920 

tggtttttta aagtattggt attagaggtg tgagttatcg tatttggttt ttttttttat 4980 

ttttgagata gagttttatt ttgttattta ggttggagtg tagtggtacg attttggttt 5040 

attgtaattt ttgtttttta ggtttaagtg atttttttgt tttatttttt taagtagttg 5100 

ggattatagg tgtgtatttt cgtggttagt tttttttttt aattggttag tgttttgtgg 5160 

tttttttatt tttttatagt ggaaaatggt ttaggattga ttgatatgaa gataagttta 5220 

ggggtttata tttaatttaa tttttgtatt taagttttgg gttaagattt tggcgtgttg 5280 

agtattattt attttgtaag gaattttgta aaattttatt tgaagtatta tttataattt 5340 

tatttttttt atttaaataa ggattttcgt tttatttttg ttaggtatat tgagttttat 5400 

agtttttgtt tttttttttt ggtgtttagg tttggttttt tgagtttggt ggttatatta 5460 

atggtatttg gtatatagtt tttcgataat ggggatattt aggaggtttc gagatatttt 5520 

atagttttgg gttagtaatt tggatttttt tttttatttt tttaggtatt ttataattta 5580 

gttttttttt tttttgtggg taaagtgttt ttgaatgttt atggtttaaa ataagatttt 5640 

ttttttattt atttttaaat ttttttttag atttatttta gaggaaggga atagaatttt 5700 

ttatatttta gtagttggtg ataggttaga atagggaaga ggtgjagggtt tagttggttt 5^ 6 2 

tatataggag tgtagatgga ggagtaggat ttttttttgt tttttaagtt tttttaaata 5820 

t^ttttttaa tttttggcga ggattttttt ttttttatat ttttttttag tttttttaag 5880 

gdg^gagtag gagtattcga acgcg ga aat cgaggtgtt^ Qtjtaaattg ttfcggtcggt 5940 
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tttagttata 
tcgtgtttaa 
tggaatcggg 
ttacgtagtt 
ggcggagttt 
tttcggcgcg 
ttcgagtttt 
gaaggcgttt 
agtcgttaat 
ttcgggttag 
tttgttttat 
attttatcgt 
gttagcgttc 
cgggaagtgc 
ggagtatttg 
ttttgtgtcg 
ttcgagaatg 
gcgtttatta 
taatagttta 
gagaggtttg 
ggaagtttta 
ggttatttta 
cggaaggtta 
tttttttttt 
ttatttaggt 
ttaagcgatt 
gtttagttaa 
tttaaatttt 
ggtttgagtt 
tatttgaagg 
ttattttttt 
ttttttattt 
tttagcgatt 
tagtattatt 
tttatttttt 
acgatgaagt 
agatttacgg 
tattaggatg 
tagttttatt 
tagttattgg 
taaatatgtt 
cgttgtaatg 
attttgtcgt 
ttgtgtttaa 
tattacgttt 
taggttggtt 
gggattatag 
attttgtgtt 
atgtatatat 
atgtttaatt 
tagggtttat 
ggattgtaaa 
gtttgttggc 
ttagtatttg 
tttgtttgta 
taattttttt 



ttgggaagat 
ttttaaagat 
atttgtggtt 
agcgatgggg 
tcgcgatttg 
gcgcggggtt 
tcgttataga 
cggttaggga 
ttggtacgtt 
agtttaatga 
^ggcgttagt 
tttaggattt 



gttggataat 
tttgtttgtg 
gttttacgta 
aggagggtgg 
tagcgatcgt 
gtttgcgagt 
tttggttgta 
tcgtttcgtt 
tatcgtttcg 
agtcgcgtcg 
tggggtagga 

ttcggagttt 
gggtaagcgt 
gcgtgcgcgg 
cggagtcggt 
tcgggaaatc 
tttgtattgt 
cgttcgtgtt 
ggtagacgag 
gcggtgaggc 
gggagagggt 
tattgttacg 
taggtagaga 
ttttattggt 
tggagtgtaa 
tttttgtttt 
tttttgtatt 
tgattttagg 
atcgtattcg 
aaaaattaaa 
tagattaaga 
attttagtta 
tgtgtattat 
tgaagtttgt 
gtagtttttg 
ttgggtttag 
ttgaaatatt 
tttaaaagac 
gttatatttg 
agataaaagt 
taaattattt 
ttttagtttt 
taaggttgga 
gtaatttttt 
ggtttatttt 
ttgaattttt 
gtatgagtta 
ttttttagtt 
agatttggga 
tagattttat 
ttttagtagt 
agtatttttg 
gatttttagg 
atttattggg 

gggggtttat 

ttattttaaa 



gttcggttta 
atagaaatta 
cggtttcgcg 
ggttgtttat 
tttttttttc 
tagcgaggtt 
gtagttattg 
tatagtcgta 
gtcgcgtgcg 
taatcgttaa 
acgtcggggc 
tatrttataga 
ataagagtgg 
agtttgggtt 
aatttaggtt 
ggtaattaga 
ggtttgttta 
tcgttttatc 
cgcgcggtag 
ggaggagttt 
cgttgttcgt 
cggattttaa 
ggtttgttta 
aattaaaaaa 
tggcgcgatt 
agttttttga 
tttagtagag 
tgatttattc 
gttttttatt 
tatttttaat 
tattggtttt 

ggtttgtttt 

taaatggaat 
tttttttgat 
tttagttttt 
ttttgattat 
gatttaaatt 
gttttaagtg 
tttagttttt 
ttaaatttat 
tgttttttat 
gttttttttt 
gggtagtggt 
tattttagtt 
tttttatttt 



ggatttcgag 
aagaaattta 
tagatacgag 
cgaaagtaac 
attcgcggcg 
attatttacg 
ggtcgtatta 
ttagttgtcg 
ttagtcgggt 
gtttaggttt 
aagcgcgggt 
agattgggcg 



gattttaggt 
tcgcgttcgg 
gaatttgtga 
tattatttat 
gttataataa 
tgatttaata 
gtgattttag 
ggtgtagaag 
ttagaaaagg 
taaaggtttt 
ggttttggag 
gatagtaggt 
aaattagcgt 

tggagtgcga 
ggatttagtt 
attgcgttgt 
cgcgtattgt 
cg^gtgcgtg 
tgtggggttg 

gcggtttttt 
tttcgatatg 
cgggcggggt 
gcgggaagga 



seq_prot.txt 
ggtttattat aagttatata 
agggggtttc ggtatttagt 
ggtaggtttt cggttaggat 
cgatttagga cgcggtaacg 
gttcgtcggt attttttggt 
cgcgcggtga agtattgttc 
ttcgtcgtgt tgttttaggt 
ttcgttcgtt ttttagtttt 
tgcgtgtacg cgtgttagtg 
gattgaaacg tagatcgtcg 
ggggatacgt acgtttcgtt 
ttttatttat tatagggaac 
tttttggtcg gaggcgaggg 
agtttgggtt cgggttcgtt 
ttttttttag ttttcgcgta 
acgttttttg cgcgcggtat 
ttttcgtttt ttttatacgt 
gcgggttttt agtttaggtt 
cggtagtggt aggtgaattt 
taggtcgggg aaatgtttcg 
taggtttcgt aggttcgatt 
tgttggttat ttgggcgttt 
atagttggat ttttatcgtt 
ataataataa aaaattgcgt 
tcggtttatc gtaattttcg 
gtatttagga ttataggcgt 
acggggtttt attatgttag 
gtttcggttt tttaaagtgt 
gggaacgtat atggaatata 
ttacgtttgt tttgtggttg 
tatatatttt aatttttcgt 
tttttttagg aaattgttcg 
ttagtgtttt attttttatt 
ttttaggggt tatatttttt 
ttgtagattt tgatttaatt 
tggtttggtt tgtttatatt 
tttagattag atttttcgtg 
aatatggtta aaatttaatt 
tttttgtagt aaaaatggtt 
ttttgatttt ttttttgttt 
ttttatggtt attttatttt 
tttttttttt tttttttgag 
acgatttcgg tttattgtaa 
tttcgagtag ttgggattat 
ttagtagata tgaggtttta 
gatttattcg ttttcgtttt 
ttttttgttt attttttgta 
tgtttttttg tatcggatga 
ttataagttt ttaaataggt 
tttggggagt ttttaaaatt 
ggtttgcggt gggatttagg 
ttggtgttta tttaggggag 
gattgttggg tgtgtggttg 
gtgtttgtta aataaagatt 
aaattttttt aggttttttt 
ggggttatga gtgtttgaga 
ataaattttt ttttaagaag 
atttttatat ataagtagtt 
taagggataa attattttcg 
ttcgggagtt gttttcgtcg 
tttttggttg tttttttcgt 
aggtttttgc gtacgacgtt 
gcgggtttcg cgggttggaa 
tacgcggtgt ttcgcgcgat 
ttagcgcgtt tagcgggtgt 
gttcggttgg gttcgtgttt 
tatagggcgg gtttcgattt 
gttgaggaga gt;cgcgtdat 
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gttgtttttt 
atttaggcgg 
tcgcggggag 
gatcggggag 
ttttatttgg 
gagtttcgag 
tatttcgaaa 
gcgcgttcgt 
tgcgcgtgcg 
ggatttagtt 
tttaggaatg 

gggggcgggt 

cgggaaggtg 
tgtagcgggt 
gaattagttt 
ttaggtagtt 
tttcggtttc 
tcggggttcg 
gtaatttgta 
gagattgaag 
tattttagtg 
ggaaatcggt 
tagtatagaa 
tttgtttttg 
ttttttgggt 
tcgttattat 
ttaggttggt 
tgggattata 
tttgtttatt 
ttatttgttt 
ttttattttt 
ggatagggtt 
tttatttttt 
tagttttttt 
tttatatttt 
tatttgtttt 
ttagtatttt 
tttttttttt 
attaggtttt 
ttatttttga 
tttttgagaa 
atagagtttt 
ttttcgtttt 
aggtattcgt 
ttatgttggt 
tcgaagtgtt 
ttttgttata 
gagggttttt 
tagagtagtg 
tattgatgtt 
ttagcgggga 
agtaattttt 
cgtgtatatt 
taataaaatt 
ttataggtgg 
agaggtaagt 
ggttaaggta 
atttttgttt 
cgtatttttt 
^tttttttifg 
tttcgtaggc 
ttagatgaag 
gcggtggtta 
gcgtagcgcg 
tagttttcgt 
cgttggtttt 
tagcgttttt 
ggaa^tttgg 



6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 



seq_prot.txt 

gtgtagggat tgtggggttc gaaggcgggg ttgggcgcgt tttcgtagag tttttttcgt 10080 

tttgtttttt tttttttttt tcgttttttt tttatatttt atttcggacg gttataacga 10140 

cggcgatcgt aaagtattac gcggagatat tcgtgttttt ggaggttagt tttattgtgt 10200 

tagaggaaga gggtttttat attcggtttt ggtttttttg gttcggtttg ttgaagtaat 10260 

atatttggtt tatttattgg gtggggtagg aagtttcgag tttttatttg gggtgaggag 10320 

gagggagatc ggttagtagt tttatcgttc gttttgtttt ttattgcgga gattggggtt 10380 

tcggtagagg ttggatcgtg attttgaggt ttaggggtgt attttgggtg gatttttttg 10440 

gtatgggtgg tcggttttta gtaattgtag tttttatttg gttttgttat tttgggttgt IDSuu 

taggatataa gtttttttat gtttttttta gtgtttgatt tggtattttt tgtaggtagg 10560 

tgggtattga ggatggtaat gtatgtgggg gatgtgggag tagggtttag aggtttaagg 10620 

ttttaggata tttttatttg tagtaatatt atttattttg gtatcgtgag tagcgtttag 10680 

aagtttttgt attgtagtaa gtatagcggg gtcgttttgg agttattgtt tttagtatat 10740 

ttagtttgta ggttttagtt tatttggggg aaagttagga aggtttgatt ggttttggaa 10800 

ggtgggggta ttttatttat atttatgttt tttgtatttt ttttattttt tttgttattt 10860 

ttataggttt tattttcgcg tttgtagtcg taggttttgt tttgaggggt tgaatatatg 10920 

ttggagttgg tgtttggtaa ttgtttgtta tttgtttttg ttttttcgtt ttagtcgttt 10980 

ttagattttt gggatttagg a 11001 

<210> 247 
<211> 11001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 247 

ttttagattt tagaaatttg ggagcggttg gagcgagaaa atagaggtaa gtggtaggta 60 

attgttaagt attagtttta gtatgtgttt agttttttag agtaggattt gcggttgtag 120 

gcgcgaaggt aaggtttgtg gaaatggtag ggagggtgga ggggatgtag gaggtatgga 180 

tgtgggtggg gtgtttttat tttttagggt tagttagatt tttttgattt ttttttaggt 240 

gggttgagat ttataggttg gatgtgttag aggtagtggt tttagagcgg tttcgttgtg 300 

tttattgtag tgtagaggtt tttaagcgtt gtttacgatg ttagaatgag tggtattgtt 360 

gtaggtgagg gtattttaga attttggatt tttaagtttt atttttatat tttttatatg 420 

tattgttatt tttaatattt atttgtttgt agggagtgtt aagttaagta ttgggaaaag 480 

tatggaaaga tttgtgtttt ggtagtttag ggtgatagag ttaaatgagg gttgtagttg 540 

ttgagggtcg attatttatg ttaagggaat ttatttagaa tgtatttttg aattttaaga 600 

ttacggttta gtttttgtcg gagttttagt tttcgtagtg gagagtagag cgggcggtaa 660 

agttgttgat cgattttttt tttttttatt ttaagtgaag gttcgagatt ttttgtttta 720 

tttagtgggt aggttaagtg tgttgtttta gtaaatcgga ttaggagggt tagggtcgga 780 

tgtggggatt tttttttttt agtatagtaa agttggtttt tagaaatacg ggtattttcg 840 

cgtggtgttt tgcggtcgtc gtcgttgtgg tcgttcgggg tggggtgtga ggaggggacg 900 

aaggagggaa ggaagggtaa ggcggggggg gttttgcgag agcgcgttta gtttcgtttt 960 

cgggttttat agtttttgta tttaggtttt tattgcgcgg ttttttttag ttttttttcg 1020 

tcgtttagtt tggattttgg gggaggcgtt gaagtcgggg ttcgttttgt ggtttcgttc 1080 

ggttcgcgtt tgttagcgtt taaagttagc gaagtacggg tttaatcggg ttatgtcggg 1140 

ggagtttgag tttattgagt tgcgggagtt ggtattcgtt gggcgcgttg ggaagggtcg 1200 

tattcggttg gagcgtgtta acgcgttgcg tatcgcgcgg ggtatcgcgt gtaattttat 1260 

acggtagttg gtttttggtc gtggttatcg tttttagttc gcggggttcg ttacgtatac 1320 

gtggtgcgat ttttgtggcg attttatttg gggcgtcgtg cgtaaaggtt tgtagtgcgc 1380 

gcgtgagtag tggtttcgcg cgtttacgag agcggaaggg gtagttaagg ggtagcgtag 1440 

tcgtcgcggg ttaagtcgcg gtagaggggg tcggcgggga tagttttcga ggattaggtt 1500 

cgttattttc gttttatcgt tgaagagtgc gcgaaaatgg tttatttttt gtcgtatttt 1560 

attcgtattt gggttataga tgagtagagg tggttgttta tatgtaaaaa tacgttgatt 1620 

ttaagttttt tatttttaaa atgttttggt tttttttgag aaagggtttg tgtttattgt 1680 

tttcggagtt tattttttta ggtttgtttt tttttaaata tttatgattt tttttagaat 1740 

ttttagggtg aagggaaatt attatttatg ggagggagtt tggaaaaatt tagaattttt 1800 

ggtgggtttt ttgtaagtag gagttttgtt gagtttttat ttagtaaata tttttttttg 1860 

atttagtgaa ttagatgtta aaatatgtac gtagttatat atttagtagt ttttttgtat 1920 

ttttgggaat cgttagtaag taaaggttgt ttttttttgg gtagatatta gttggaatta 1980 

ttaggggtgt ttttatagtt tttttcgtta gtttggattt tatcgtagat ttgttgaatt 2040 

aattgttggg agtggatttt aggtattagt aaattttaaa aattttttaa attattgtaa 2100 

tatggagttt gggttgagta ttattgtttt ggtttattta ggaatttgtg gatggatagt 2160 

gttttaggtt tgtgtgtgta tggagatttt tttattcggt ataagaggat attataaatt 2220 

tagttggggg gagtataaag ttgtgataga atgtaaagaa tgaataaggg gtcgagcgcg 2280 

gtggtttatg tttgtaattt tagtatttcg gaaggcggag gcgggtggat tatttgaggt 2340 

taggagttta agattagttt ggttaatatg gtgaaatttt atgtttatta aaaaataaaa 2400 



aaaaatgagt 
gggagaattg 
ttttagtttt 
taaggttggg 
aggatgattt 
ggtttttgtt 
gtaagtgtag 
cgttttttag 
tagtgtttta 
attgagttta 
gtaggagttg 
agtaagtttt 
ggtagtatat 
ttggttggag 
gttttggttt 
gtttagtttt 
tcgggtgcgg 
atttgagatt 
aaatataaaa 
tgaggtagaa 
attgtatttt 
tgttaatgag 
ttttttgttt 
gcgtagtagt 
ggtttttttt 
cgttttatcg 
gttcgtttgt 
ggatacgaac 
tatagtgtag 
cgatttttcg 
tgtcggtttc 
ttcgcgtacg 
tgcgtttgtt 
ggagtttcgg 
tttttgtttt 
gcggcgcggt 
tcggggcggt 
gggcgaggcg 
ttgtagttaa 
agttcgtagg 
agcggtcgtt 
tttatttttt 
gtgcgtaaga 
ttatagatag 
tattatttag 
ttcgaatgtt 
ttcgttagga 
ttttatttgt 
ttattagttg 
gatttgggaa 
atttatagga 
aggttattaa 
gaattgtgtg 
taggaaaaag 
tttatttaag 
ttttataaaa 



gttgagttga 
tattgtggaa 
ggaggtgtat 
ttgggaggta 
gagtgagatt 
tgttaatgtt 
tttggttaat 
.ggtgtgtgat 
gttgtagtga 
ttrttaaaaaa 
ggaggttaag 

gtaaaatttc 



taggcgtagt 
tttgaatata 
ggcgatagag 
atattgtagc 
gggtatgttt 
tttagtaatt 
tagtgaggtt 
atattttagt 
gtcgtggatt 
gattttatcg 
tagggggtag 
aaatgatgtt 
agatcgttga 
tgggtagagg 
gaggggagta 
ttttttaagt 
tggtttaagt 
aggagtttga 
attagttggg 
gaatcgtttg 
agtttgggtg 
gaaaggggaa 
gtggtttttc 
gtgaggtaat 
ttggggtttt 
ttaggttttt 
ttggattgtt 
gtggtgagcg 
gtattttcga 
gcggtataga 
gtaagtattt 
cgtatttttc 
cggacgttgg 
agcgatgagg 
aatgagataa 
tttggttcgg 
ggttggcggt 
gaggcgtttt 
gaggattcgg 
tcgcgtcggg 
ggagtttcgt 
tggttacgtg 
tttcgatttt 
gttgaatacg 
ttatgattaa 
tttgtttttt 
attgagaagt 
atttttgtat 
ttggaatgtg 
tagataggaa 
agggggaaag 
tttaggattg 
ttagatgtta 
aaatagggat 
taaagaaagt 
tgggtggtgt 
gtgtagattt 
aggtgggaaa 
atttgtaatt 
gaggttgtag 
ttgttttaaa 
ttgggaggtt 
atggtgaaat 
tgtagtttta 
gttaagatcg 
aagaaagaag 
gtaggtagat 
gtttgtattg 



ggcgggtgtt 
ggaggcggag 
tgagattttg 
gtttttaaag 
attagaggta 
gggagtttag 
gaagaaaagg 
gaaggtattg 

tggggtagat 

tgagatatgg 
gaggaaaatt 
gagggggtga 
ggattttgtt 
ggagagtgaa 
gagataggtg 
aaatgggtag 
ttgtagtttn 
gattagtttg 
tatggtggcg 
aatttaggag 
ataaaagtaa 
gttttgtgtt: 
ggtcggtttt 
ttattgagat 
tttttaattt 
ttgtagattg 
gcgggtttcg 

cggggtcgag 

gggttgtttg 

gaggttaatt 
tattcgttgt 
gtattttttc 
tattcgtttt 
ttatttttgg 
gagttagatt 
gcgtacgcgt 
tacggacgcg 
ttttcgaaat 
attcggagtn 
attaggtggg 
ttttttcggt 
gtttttcgcg 
atcgtttaga 
gaaaaagtag 
agtcgatcga 
ttttggggag 
atgtttagga 
ggagttagtt 
gaagattttg 
agaagttttg 
ttagattata 
taaggtgttt 
ttggtgtgat 
tgtgaagttt 
ggagttgtga 
ttagtacgtt 
ttgggtttgt 
attataagat 
ttagttattt 
tgagttaaga 
aatagaaaag 
aaggtaggtg 
tttattttta 
gttatttggg 

tgttattgta 
gttgggtttg 
tatttgaggt 
aaaatataaa 
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seq_prot.txt 
tgtaatttta gttattcggg 
gttgtagtga gtcgagatcg 
ttttaaaaaa aaaaaaaaaa 
agaaataaag tagttatgga 
gagataaggg agaaattaaa 
tggttatttt tgttgtaaag 
gaattaaatt ttggttatgt 
gtacggagga tttagtttga 
gaatgtagat agattaggtt 
aagttgagtt agaatttgta 
gggagagtgt agtttttggg 
gaatggagaa tggaatattg 
tcgggtagtt ttttggagga 
ggcgaaggat tagagtgtat 
ataattatag ggtagacgta 
atgtatttta tatacgtttt 
agtattttgg aaggtcgagg 
gttaatatgg tgaaatttcg 
ggcgtttgta attttaggta 
gcggaggttg cggtgagtcg 
gacgtagttt tttgttgttg 
aggcgataga gatttaattg 
tagacgttta ggtggttaat 
aggtcgggtt tgcggagttt 
tcgggatatt ttttcgattt 
taagtttatt tgttattatc 
ggatttgggt tgggaattcg 
ggcgtatggg aagggcgagg 
ggtgtcgcgc gtaaggagcg 
ttgcgcgggg gttgggaggg 
aagcggattc gggtttaggt 
gttttcgttt tcggttagag 
cgttttttgt ggtaggtggg 
gggcgaagcg tgcgtgtttt 
tcggcgattt acgttttagt 
atattgatac gcgtatacgt 
taggattggg ggacgggcgg 
gatttggagt agtacgacga 
cgagtagtat tttatcgcgc 
agttaggggg tgtcggcggg 
tcgttgtcgc gttttgggtc 
ggttttggtc ggggatttgt 
tgttgggtgt cggggttttt 
ttgtatggtt tgtggtagat 
gtagtttgga ttagtatttc 
attaggggag gatgtggaga 
aaatttgaga ggtagagaga 
gagtttttat tttttttttg 
tttttttttt ttagggtgga 
ttttggatta taagtattta 
aaatgtttaa agaggtggaa 
cggaattttt taggtatttt 
tattaggttt agagaattag 
agtatgtttg gtagaaatgg 
gtgatgtttt agataaaatt 
aaaattttag tttagagttt 
ttttatgtta gttagttttg 
attaattaat tgaaaaggag 
gggagggtga ggtagaagga 
tcgtgttatt gtattttagt 
gaagttaagt acggtggttt 
gattatttgt aattaggaat 
ttaaatatat aaaaattagt 
agattgaatt attttaatcg 
ttttaatttg ggtgataggg 
gtgatttatg tttgtaattt 
taagagttcg agatttgtta 
aaaattattt ggttatggt<j 
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aggttgaggt 
tgttattgtt 
aaaaaaagaa 
gataagaagt 
gataagtttg 
aggaagttgg 
ttatttgaaa 
gggtttaggt 
agtgattagg 
aaggagttga 
agttaaaggg 
gattttattt 
agaggtaagt 
agagattagt 
ggttaaaggt 
tagtgaaggg 
cgggtggatt 
tttttattaa 
tttaggaggt 
aaatcgcgtt 
tttttttaat 
ttgagtaggt 
attagagttc 
ggcgagtagc 
ggagtttttt 
gttgtcgcgc 
cggtggagcg 
atgggtaggt 
ttttaattgt 
gagtttggat 
tgatttaggt 
gttatttttg 
gtttgtgagt 
cgtttcggcg 
tttaacggtt 
acgtacgcga 
gtacggttat 
gtagtggtta 
gaatttcgtt 
cgggagggga 
ggtgggtagt 
tcgcggaatc 
ttggtttttg 
ttgagtcggg 
gattttcgcg 
gggaagagtt 
gattttgttt 
ttttggtttg 
tttggagaaa 
ggagtatttt 
aaagagattt 
tattatcgga 
gtttaggtat 
ggcggaaatt 
ttataaaatt 
gggtgtaagg 
agttattttt 
ggttagttac 
ttatttgaat 
ttgagtgata 
atatttttaa 
tcgaggttag 
cgggtatggt 
ggaggtaaag 
tgaggttttg 
tagtattttg 
ggttaatata 

gtgtgtgttt 



2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 



seq__prot.txt 

gtaattttag ttattgggga ggttgaggta ggagtattat ttgaatttag aagatagagg 6540 

ttgtagtgag tcgagattgg gttattgtat tttagtttgg atgagagagt aagattttgt 6600 

tttaaaaaaa aaaaaaaaaa aaaagaaaga ataggaggtt gagaagtttt aagttatatg 6660 

ttaaaaaaaa agaaaaaaat attagtttta ggttaggtgt agtggtttat atttttaatt 6720 

ttagtatttt ggaaagtcga ggtgggtgga ttatgaggtt aggagtttaa gattagtttg 6780 

gttaaaatgg tgaaatttcg tttcgattaa aaatataaaa aattagttag ttgtggtggt 6840 

aggtatttgt aattttagtt atttgggagg ttgaagtaga gaattgtttg aatttaggag 6900 

gtagagattg taatgagtta agatcgtatt attgtatttt agtttggaaa atagagcgag 6960 

attttgtttt aaaaaaaaaa ttattagttt ttatggatag tggtagagtg gagggtgggt 7020 

ttttatggtg tagaagggaa attttatggt tttgttgtgt attcgattgg gatggttgtt 7080 

gaaatttttt tttagtaggt agttttggaa atagaaaaag aaattttttt ttttttagaa 7140 

ttttggaagg gttgtgtagt gtttttaatt taagtttgtt ttttgagtga agatagggag 7200 

gtttattatt agaagggaag gggttggaaa tgaggttatt gtattttagt ttagggtttt 7260 

tgggttattt aggaagggaa gaaggagtaa gtttttttat tgttaggtag gagtttagag 7320 

ttattataag aataagttag tattattttt gtgttttttt tgttttgtaa ataaaatgat 7380 

tttttttttt gttttggtat tagagtttgt ttggtatttt ttttgttttt agtatttttt 7440 

ttatttgggt attttttttc gttggtgtat tgaataaata tatttattgt tttatttata 7500 

gtttttagtt tttatttttt agggtttata ttatttgttt ttattaattc gataaggttg 7560 

trtattgttt ttagtaaggt ttgtattggg gtttttattt tagtgtttrtt: ttttatttag 7620 

gagatttttg gatatttggg gaagaaaatg agtttaaatt tttatttttt tttttttatt 7680 

tttrttttgt aaggttttgg tttitagtttt tagttttata tttttgttgg ttgtagaata 7740 

gtagcgggtt ttgggtaagg agtattttgt taaaacgttt tattttgttt ttttatttgt 7800 

tttttttatt tgtttttatt agatggttta agtgtttaag gggattttag ggcggagtta 7860 

gggagaattt tggttttttt gggttaggta taagattatt ttataggaaa ttttgtggga 7920 

atttttttgg gataaagtat tggttagcgt tgagtttagt tgtgtttgtg atattcgtat 7980 

tttaattagg gtttatttga cgttaatagg aagtaaggtt gatgtagtgg ggttaaggga 8040 

gtttgggaga agaaagtcgg tttagagttt tggttgtttt gttttatatt ttattttttc 8100 

ggtaagaatt tagtttttag atgaggtggg gagtgagtgg tcgagttaaa aatttttggg 8160 

tcgggtacga tggtttacgt ttgtaatttt agtattttgg gaggtgaagg taggcggatt 8220 

atttgaggtt aggagtttaa gattaatttg gttaatgtgg tgaaatttta tttttattaa 8280 

aaatataaaa attagtcggg tgttgttgtg gtacgcgttt gtagttttag ttattcggga 8340 

gtttgaggta ggagaatcgt ttgaatttag gaggtagaat ttgtagtgag ttaagattta 8400 

gttattgtat tatagtttgg gcgatagagt gaggtttcgt tttaaaaaaa aaaaaaattt 8460 

ttgggttaaa tttttagata gtataggtag gtgtagaaat ttattaggaa gttgtttgtg 8520 

tatttttggt agattggagt ttggtttaaa gttgtttttt atgtagtttg ggttaaggtt 8580 

aaatattatg ttatagtgat tttttttatt atgtgtgaga tatggagaat tggttttaag 8640 

tattattttg tttattggtg gttggattat tgatgtgtat tattttttat tttttttatt 8700 

ttgnagtggg ttatggtttc gtgtcggggt agaggagaaa aatgggttgt tttttttagg 8760 

ataaattttt attttaattt aattagggtg ttgtgattag aatgtgtaat tgaggtgtga 8820 

ttttattgat tttttttttt tttgagatcg agtttcgttt ttgttgttta ggttggagtg 8880 

cgatggtacg attttagttt attgtaattt ttatttttcg agtttgagta atttttttgt 8940 

tttagttttt taagtagttg ggattatagg tatgtgttat tacgtttggt taattttgta 9000 

tttttagtag agacggggtt tttttatgtt ggttaggttg gttttaaatt tttgatttta 9060 

ggtgatttat tcgtttcggt tttttaaagt gttagaatta taggcgtgag ttaacgtgtt 9120 

tagtttgttt ttgttttttg tgttttgaag tagggtttta tttagttttt taggttggag 9180 

tgtagtgata cgataatagt ttattgtagt tgtaattttt cgggtttaaa cgattttttt 9240 

attttagttt tttgaatagt tgggattata ggtatattat tatatttggt taattttttt 9300 

tttttttttt ttagtagaga tgaggttttg ttatgttgtt taagttggtt ttaaattttt 9360 

gaggattaag tgattttttt attttagttt tttaaaatgt tgggattgta gatgtgagtt 9420 

attatattta gtttgatttt attttaaatg agagtttttt tttagagttt tttagttgtt 9480 

tttggttttt ggttatgtgt ttttagttgt ttttgttttt gtggtatttt taaggttata 9540 

tttagtgttg aggttttagg taggtagtag agagaagtta aatgattttg tttttttttt 9600 

atttatttag agtatgtaaa attaggagta gtggtgggtt tagggtgggt attagttatg 9660 

tatatgtata ttagggatag ggggttaaag gtagttagtt tttaaagatt gttttagagg 9720 

ttatttttta gagaagtttt gggtttttta agggttttgt gtttatgttg gtttattttg 9780 

taggacgagt ttgtggagtg ggagatattt gatttttttt aagttgagat tgagtagaag 9840 

attaaggagt ataatgttta gattaatagt aattttttta tgagtttggt gagttgattg 9900 

tttaggaagg gggcgtgggg aggagtaggt atttagttat gtgtttgata tttagagggt 9960 

tataattgag gttattttgg gtgggcgtaa gtagtaattt gtgtatattt agtttagttt 10020 

taagtagatt gatattttat ttggaattta ttattaaggt ttggtttttt tattttttta 10080 

gaataaggac ggtttttata taggttttat taaggtttag ttgaagttgg tgcgttttgt 10140 

ttttgtgttt tttagtaaga agttattttt tttgtaggat gttcggcggg gtttaggacg 10200 

gggtataagt gttaggcgtc gtattttttt ttatttgttt aaggatgttg ttaagtattt 10260 

gtatgtgttg ttacgtataa gggtacgtga agttattgag gttttgttgc gaaagttttt 10320 

ggtggtggat gattttcgta agtttgtatt ttttgagcgc gttgagcgtt acggttaagg 10380 

tgggtttttt attttatttt gttttatgtg agggtatata cgtatgtatt tgagtatgta 10440 

ggggttgagt agttggtttt gtttttgatt attatttttt ttttatagtg tatttgcgga 10500 

agttattgga_tgatgag.tag.tttt 1Q560 
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ttttgagttt tgttttgaag gaaaatgatt trtggggaggt gaacgtgagt atatagtttt 10620 

tagttttttg gttgttatta gataggattg atgggttgta gttatagtaa ggtttggagg 10680 

aggaattgtg ttggaagata agttttgtaa aatagtttta ggagtgtata ggtattgtaa 10740 

ttaaagtaaa ggtttttaga ttatttatgt taaagtttag ggttgtttta agaagttagg 10800 

aagaattgtt ttggtgtttt gatttttttt ggtgtggaaa attttttgga gatgtaggag 10860 

tttatttaat gatatgagga ggtttttttt agatttttta tttggaagtt ttttggtttt 10920 

aaggtattag gtttgtggag tgaaattaga tttagaatat gtttgatttg tttataggta 10980 

attggggaat atttgatttg g 11001 

<210> 248 
<211> 4448 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 248 

tttatttttt ttatgtgttt tttttgttta gttttgttta tttggtgttt ttgatatttt 60 

ttttttttat tttattgttt tcgaggttta ttttattttt tggtttttaa ttatttattt 120 

atgtaggatg tgaattttta aattagtatt taagatgtat atttggatat ttattttaat 180 

gttttagttt taaatttaat atattaaaag ttgtattttt tatgtttatt attagtttta 240 

aagaatgtat aattgaataa aattttgatt attattttat attttttttt tttttgagtt 300 

atttatagag gtgattaaga tagaaatgtg tgttattttt tatttatttt attgttatta 360 

attatatgta tttattaaat tatgttagtt ttattttttt aataattgta gtttaatatt 420 

tgaataattg aataaaaatt tataatttaa tttttttaat atttatttat tttaatttta 480 

tttttataat ttatatataa tttataatta tttttatata attggtatag tgataatttt 540 

agataaaatt tattgaattt ttatttttat ttttgttaat gatatatata tataatgtaa 600 

tgttgagagg gttgggaagg aagaatggga gaaaggtaga agttgatagt taaaaaaaaa 660 

aagtttttag atggtttttt tagtgttatt tttgtaattt tattaaataa gggtttaaaa 720 

tttatgttat aatatttgtt tgatgtttta tttaaattgt tttattggat attttttatt 780 

tgttaagttt tttgaaagaa ataaaattgt tttgttatag ttagatttta tgattttata 840 

tatttataac gggtagattt tgtaggggtt tatatttttt aaggtggttt gggaaaatga 900 

tatagaaaag ttttatatta gttgaaaaga aaaatgtata atttattttg gtaattttag 960 

ttttaatttt taataggata aaggaaatat gtatattata aattaatgtt ttgtgttaat 1020 

aaataattaa gtaagtagag ttgtaagtat tggttaaaat gaattttgga tatttttagt 1080 

tattaaattt ttcgaggtaa ggtagatata tatttggatt tgaatatttg tattattagg 1140 

gaatttttgt ttgtttgttt tatattgttt tgttattttt aaaagtaggt gttaaattag 1200 

gttatttgtt gtttgggtaa tgttattttt tgttataatt ataaattgaa gaaaattgat 1260 

tgtttttttt ttttagttaa tatgttgtgt ttttagtttt aaatattttt gagaagttgt 1320 

ttagatttat gagtaatgtt tttgttttta ataggttaag atattaggta ggtttttgta 1380 

tttttggagt ttttagtttt ttgtaaagtg aggaagttag attaagtaat cgttaggttt 1440 

ttttttagat cgattaattt gatggtatta gatgtaattg tttttgaatt agggtatgaa 1500 

atgaatttag ttttggtgta ttaatgtgat gattttgttt tattaaagtt tgagtacgcg 1560 

ataggtttag tattatttta tatagagata aagggtaatt ttttgttttt aaaggaatga 1620 

tataatttgt ttttgaagtg atttatatta tttttatttt tgaataattt aatgtttaga 1680 

aaataattta agaattttcg ttgattttag ggatgtaaga tacggttttt tgatagtatt 1740 

tgggattgtg gaaaaaagta attgaggaaa gggtattttt ataacgtaat attgaattta 1800 

gtgtttaagg tttattatag gaatttttaa cgatttttat aatttttttt tttttttttt 1860 

tttttttttt ttattttgaa aataaattga gaagttagta ttgggataat tatttttttt 1920 

gatttaaata aaaagttttg ggtaaatata ggtataaatt gttaaatgga aaaagttttt 1980 

ttttattttt agttagaggg aggttgggga ttttagtttt ttagaagtcg gttcgtggac 2040 

gtttagagaa ttttttcgga gattaggtta gggttattga gtttgtttag tagggcgtcg 2100 

tttcggacgt cgtttcgttt tttattttgt tagcgtcgcg ttcgggtcgc gaaggtgcgt 2160 

gcggcgttcg gtgattggcg gcggttcgga gttgttcggt tgttattggt tgttcggttt 2220 

tttttgtttt cgggttcggg tcgtaggttc gttgcggcgg attgggcggc ggaagttcga 2280 

cggcgtcggg cgagtggttg ttgagcggcg tcgcgggagt ttcgtaggtt tttcgtgttc 2340 

gtagcggagt cggaggttag ttgaattcgg tcgtgggatt tcggatagga ggaggagggg 2400 

atttatagga cgcgttaata tggatttgga aaataaagtg aagaaggtag gggggcgttc 2460 

gtggcgggcg gcggttgttt tatttgcgcg ggtcgcgcgg cgcgcggcgg tcggaggtgt 2520 

cgaggtgggt ggggttcgcg ggtttttcgg tgtgagttcg gttttgggtt tgcggtgttt 2580 

cgggtttagg gatttggttt tttgggtaga ggaggtgttc ggcggtttcg ttagtttttt 2640 

atattcggga gcgatagaiat tggaagcgcg agcgagggcg ggcgcgggat tttttttttt 2700 

agttttacgg aattttagtg gttgtaggtg ttggaaattt tatttaagat gttttagttt 2760 

tgtttgtgtt tttttaaaag gaaaggacga gtttagggcg agtgcgggcg agatcgtgat 2820 

attttttggt ttaggaagtt gaattttatt aagtttttgt ggtttggggt tttgttgtgt 2880 

tttgatagtt ttgatttttt ttttttcggt tgggttgttt ggggcgtttt aaaatgagtg 2940 
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ttgatttaat gtattgtttt cgtattcgtg ttggtgataa tttttaatgt gatttttttt 3000 

tcggttattt attatttgtt tatcgatcga ggtagtgttt tttttttagg gaattatttg 3060 

ttaaagagcg tatttttttt gtttggtagg tagttttaaa gggcggttgt tattagcgtt 3120 

ttagcgtagg gagatatttt tagtttggtt gttgtttcgt aggtattgga atcgacggtt 3180 

ttatttgagt tattttcggg ttatgcggag tttggtgtcg tttgggagaa aagggtgggg 3240 

attagattat gatttagaag gaatgttgat gttatattcg agtgaattgt tttgtgcgaa 3300 

agtttttttc gttttagggg atttgtttat gttttatttt gtggtaaatt taagttagtt 3360 

gttttttatt aaatagagaa gttttgtaga gagaggaagg gaaaaatata gaaaatagtt 3420 

tttgtttttt ttatatttat ggtttttaat attttatttt ttaaatataa agtttttgat 3480 

ttgagtatta attttatttt atgtttagaa ttggattttt gaaaatggat ataaaatatt 3540 

gtgttaggat tgtagagttt tcggttagga agttattgat taataaaaat tcgaaatgag 3600 

aaaatagttt tttttatttt ttgcgattaa aatttttagg tatttaaaaa ggtatttaag 3660 

aatagaaaaa tataaattgt taatggtagc gtgattgtat tttatttttt ttttatttgg 3720 

tttttttttt tttttttttt aaataattgt aatatttttg gttaattttg tttgttatgg 3780 

gggaaaagcg gggaaagtat tttgttcgtt tagttttttg ttgatgaata ttatttttat 3840 

ttagttttta agtaggtaag agttatttaa tttagtattt gtttaaaatt tggggggttt 3900 

tgtttttgtt tttaagtatt tatttgaaga taattttgta aaagtaaata tattttgggt 3960 

tgtcgaggat gaataggtga agtataggat ttttagggta atgatattat tttgtatgat 4020 

attataaaat ggttgatata tgtaattata tattaattaa aatttataga atgtagaata 4080 

ttaagagtaa attttaatgt aaattatgta tgttggggat aatgatgtgt taatgtgttt 4140 

tattatttgt aataaatgta ttgtattgat acgggttgtt ggtagttcgg gaggttgttg 4200 

gagataggaa ggtatggggg aggtatatgg gaatttcgaa ttttttgttt aattttgttg 4260 

tgaagttaaa aaatgtttta aaaataaagt ttgtttaata gggggagaaa aattaaagat 4320 

atttttgaaa ttttttgaga taaggagtaa ttttgaggga agaagtttaa atttaaaaat 4380 

tgtatatgtt tgtaaaggaa aaatttagac gttgtttata aataattttt ttagagtttg 4440 

aaaatgtg 4448 

<210> 249 
<211> 4448 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 249 

tatattttta aattttaagg aaattgttta taaataacgt ttggattttt tttttgtaag 60 

tatatataat ttttaagttt gaattttttt ttttagaatt attttttgtt ttaaaaagtt 120 

ttaaaaatgt ttttgatttt ttttttttta ttaaatagat tttattttta aagtattttt 180 

tagttttata gtaaaattga gtagaaaatt cgaagttttt atgtattttt tttatgtttt 240 

tttgttttta gtagtttttc ggattattaa taattcgtat tagtgtagta tatttgttat 300 

aagtgatgaa atatattgat atattattat ttttaatata tatagtttat attagggttt 360 

atttttggtg ttttatattt tatggatttt gattaatgta taattatatg tattagttat 420 

tttatagtat tatatagaat agtgttatta ttttaaaaat tttgtgtttt atttgtttat 480 

tttcggtagt ttaaaatatg tttatttttg taaagttgtt tttaaataaa tatttaaaag 540 

taaggataag attttttaag ttttagataa gtattaaatt aagtagtttt tatttatttg 600 

aaaattaaat gagagtagta tttattaata gaaagttaag cgggtaaaat gtttttttcg 660 

tttttttttt atgataagta agattaatta aaaatattat agttatttaa aagaaaaaaa 720 

gaaagaaatt aaatgggaag agaatagaat gtaattacgt tattattaat aatttatgtt 780 

tttttgtttt tggatatttt tttgagtgtt tgagaatttt agtcgtaagg aatgaaaaaa 840 

attgtttttt tatttcgagt ttttattggt taatagtttt ttaatcgaaa attttgtagt 900 

tttaatataa tattttgtat ttatttttaa aaatttagtt ttgggtataa aatagaatta 960 

atgtttaagt taaaaatttt gtgtttagaa gataaaatat taggaattat gagtatgaga 1020 

aaaataaaaa ttgttttttg tatttttttt tttttttttt tgtaaaattt ttttatttga 1080 

taagggatag ttagtttaaa tttattataa agtaaaatat gagtaaattt tttggaacga 1140 

ggagaatttt cgtatagggt aatttattcg aatgtgatat tagtattttt tttgagttat 1200 

agtttggttt ttattttttt tttttaggcg atattaggtt tcgtataatt cgggagtggt 1260 

ttaggtaggg tcgtcggttt tagtgtttgc gaggtagtag ttaagttgaa aatgtttttt 1320 

tgcgttgaga cgttggtgat aatcgttttt tgggattatt tgttaaataa ggagaatgcg 1380 

ttttttgata gataattttt taagagggag gtattgtttc ggtcggtaga taagtaatga 1440 

atgatcgaaa aaaaaattat attagaagtt attattagta cgggtgcgaa ggtagtgtat 1500 

taaattaata tttattttag agcgttttag atagtttagt cggaagggag gagattagag 1560 

ttgttaaagt atagtagagt tttaaattat aaaggtttaa tgaaatttaa ttttttgagt 1620 

taggaagtgt tacggtttcg ttcgtattcg t|tttaagttc gttttttttt tttgaggagg 1680 

tataggtaga gttgaaatat tttaagtaaa gtttttaata tttataatta ttgggatttc 1740 

gtgagattgg agagaagagt ttcgcgttcg ttttcgttcg cgtttttaat tttgtcgttt 1800 

tcgagtgtgg ggagttggcg gggtcgtcga gtattttttt tatttagggg gttaggtttt 1860 
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^?9?"^99 iJ"^ gta 9 a "tagagtcg ggtttatatc ggggggttcg cgagttttat 1920 

35 at H cggt "tttcggtc gtcgcgcgtc gcgcgattcg cgtaggtgaa gtagtcgtcq 1980 

ttcgttacga gcgttttttt atttttttta ttttgttttt taggtttitg ttaacgcgtt 2040 

ttatgggttt tttttttttt ttattcggga ttttacggtc gggtttagtt ggttttcggt 2100 

ttcgttgcga atacgggaaa tttgcggaat tttcgcggcg tcgtttaata gttattcgtt 2160 

cggcgtcgtc gaattttcgt cgtttagttc gtcgtagcgg gtttgcggtt cggattcggg 2220 

»^2?iSSSS & C S§Si agt ta atggtagt cgggtagttt cgggtcgtcg ttlattatcg 2280 

agcgtcgtac gtattttcgc ggttcgagcg cggcgttggt aaggtgagag gcggggcqqc 2340 

g "£ gaggcg Scgttttgtr gggtaagttt agtgattttg atttggtttt cgaagggatt 2400 

™E8£?25 Qtttttaaag agttgggatt tttagttttt ttttigttga 2460 

?? a ^gagaa gaattttttt tatttggtaa tttgtatttg tgtttgttta ggatttttta 2520 

tttgggttaa ggggaatagt tattttaata ttaatttttt agtttatttt ttaggtaaag 2580 

a 5 agaaaaaa aagagagaga gagagattgt agggatcgtt agggattttt gtgatggatt 2640 

I2??S£? a f" taatat tflcflttgtgg agatgttttt tttttaattg ttttttttta 2700 

„ aa "" aaa jattgttaag aaatcgtatt ttatattttt aaagttagcg aagattttta 2760 

in£?£25S *? a ?3 a H gg Sttgtttaaa agtaaagatg atgtgaatta ttttagaaat 2820 

agg 5H gtt atttttttgg aagtagaggg ttgttttttg tttttgtgtg agatggtgtt 2880 

SfSSfS? 5" tgatgaa gtag agttat tatattggtg tattaaagtt 2940 

SSSSnSSE* ta *9 ttttaa tttaggagta attgtattta atattattag attggtcggt 3000 

£££3!S£ gtttaac 9at tatttagttt agttttttta ttttgtaggg aattgaggit 3060 

SStt&S 2&8SS 25SJS8E t tag " tg "9 999atagagi tattgtSat 3120 

f aa o^n aa n^ aa HHH aa aga H5 ttgg 99ttggaagt atagtatgtt gattaaggaa 3180 

aaagagtagt taattttttt taatttgtgg ttatggtaaa aaataatatt gtttagataq 3240 

lUil^ll a ^^ g ? tat ttattt « aa aaataataaa ataatataaa Itaaattaat 3300 

a 2 aaa 5H" taataatgta gatatttagg tttagatgta tgtttgtttt gtttcgagaa 3360 

n^Hfgtagt * aaaaatatt taaaatttat tttagttaat atttgtaatt ttgtttattt 3420 

I«2aH?? o aa ^ a 2 ag ta " gat " a taatgtgtat gtttttttta ttttattaag 3480 

a *Ii aaaa I* gaaattatta aaatgagtta tgtatttttt tttttagttg atgtaaaatt 3540 

£S2E£a SE£2£ a atta "" ga aa ggt 9 tggg tttttgtaaa git€g«c^ 3600 

* g * aa9 ^^ a taggattatg aggtttagtt atgataaggt aattttgttt tttttagaga 3660 

aSSXa?? ^n aaa i?i ttagtaaagt aatttaaata aggtattaaa taaatgttgt 3720 

aalaat??^ at H ga H ag g^gtagaga tgatattaag aaaattattt 3780 

f? aaa ""5 tttttttaat tg«aatttt tgtttttttt ttattttttt tttttaattt 3840 

"" a ?"" a * a " g I§i g tgt ^ at H t tggtaaggat ggaaatagaa atttagtgag 3900 

aattattatt gtgttaatta tataaaaata attatagatt atgtataaat 3960 

SrattlS ™2i a ??£ a !" aag i gt * aa 9 a g a g« aaattgfaag tttttgttta 4020 

rlrnllnrf?, SE2»£ ta ta ?ttattaa ggaggtagaa ttggtatggt ttgatgagta 4080 

J?H^ a fH g atgatagtgg aatgaataga agatggtata tatttttatt ttgattattt 4140 

*H£ gga £ a 9 "* agaaaga fagaaaatat aggatgatgg ttagggtttt gtttagttgt 4200 

ii a «^£H 9 ggttggtgg tggatataaa ggatataatt tttaatatat tgagtttgag 4260 

a " gaga £ at tgaaatgagt atttaaatat atattttgaa tgttgattta ggagtttita 4320 

ttttatataa ataaataatt aaaggttaag aagtgagatg agtttcggag atagtaaaat 4380 

KgattaT 9 aatattaaag atattaa 9tg agtaaggttg agtaagagal aStXXS 4440 



4448 

<210> 250 
<211> 4408 
<212> DNA 

<213> Artificial Sequence 

<220> ; 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 250 

ttatattgtg agtatataaa aagtattata cggttaacgg aggacgagga attataataa fin 
?E?iI«§S taagt " taa gaaataaaat aatttgttla alaatlattt ttgttgattS 120 
^ g J aaga H gaaagtgtag gaaaaatata gttcgaataa ttttagattt ttttatattt 180 

SSSSS-S&SSS ?£S22 at aataaaattt ttaatlgaaa gtttaaaaS 240 

aaa 5? g H ta g9 a atatgtg ttttaaatga attaaattgt gaaattagtt agtaaattaa 300 

'JHSi a Si aa gtaattattt aaggaaatta aaatattgtt tagtttagtt ttgtattttl 360 

«2«£& a £ i^""" 3 taattaatta atataagtgt tttaggalta tttglagatl 420 

aa ^ a =i.H a at5 taa 9 g aa taaagtattt aaataattta agtgtaattt tgttgagtta 480' 

aa« aa tata tttaat ' tttt tlgggaaagt ttggt?a«g 540 

aaa 5 g ii g i a tftttatgtt atattaataa aaatttttaa tttattttgt ttatgtgttt 600 

SKSSI** 9 atatta «gg tatttgaatt ttagatggat ttttgttaia atgatatttt 660 

?£f nf a ^ aaa agtattttta gttttgattg atagattiaa ataaatgtaa ggaaaftttt 720 

ttaaattaga ttaatttttt ataaaaatat tttagaatgt atgaattttg atatttat^t 780 
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ttataatggt aaaagttttt ttcgtttagt ttagtaagat aatatttata taaaagagta 840 

aaaaaaaatt atattatttt atgatagttt gatttttaaa ttgtttaaga aagtaaagtg 900 

gttaaattgg aaaagaggaa tatatttcgg aggtttagaa tcgaaaattt tttttttaat 960 

ttttagttgg aaaataattt tttgtattta tttaaagtgt attttttgaa gtgttagatt 1020 

ggagttgatt ggtgattaat ttaaaggagt tataatttaa agaaatggtg agagtttggt 1080 

atttaggttt ggtttttagg taattcgttt gggtttgaga ggttattaat tgttagttaa 1140 

gatggaattt tttttttttt tttttttttt taatggataa taatgggaag ggggttaatt 1200 

ttttagtagt tgaaattttg tatttagttt tttattttga gaatgttaat ttttggttcg 1260 

aggatttgtt tttgtagtgt tggtatcgag atttaaggga agatatttcg ttttaaatgt 1320 

tagttacggt ttggtttttt tttcgatttt agtattttgt agattgttag tgtttgtggc 1380 

gggggacgaa aggaataggg ttttgtaagg tttgtttgtc gattgcgtta ttttgggcga 1440 

aatttagttt taaaagttat aaattattta cggtgaagat ttttcgaagt ggaataaatt 1500 

tttagattcg tattatttta tatttttgcg ggatagatgg tttttattta tcggttatcg 1560 

ggagagagtt gttgttttcg cgttttattg tttttcgggg cgatttttag cgagtcgagt 1620 

tttcggttgt acggtaagcg ttcgaaagtc gggtttgaga ggattgtagg gtttttgagg 1680 

gtgttaagtt tcgaaggagt ttacgggtgt attggggttt tcgaaattta gtcgttattg 1740 

gtagtttttt tttgtttttt tttagttttt tcgttcggtt tcgtattttt tttttttttt 1800 

ttttttttta tttttttttt ttttttttgt ttttatttcg tgtggggagt gacgtgacgt 1860 

tagtagagat tttattaaat tttattgtat agtggcgcgc gggcggtcgg tcgagttcgg 1920 

ttgcgcggtt ggcgatttag gagcgagtat agcgttcggg cgagcgtcgg ggggagcgag 1980 

taggggcgac gagaaacgag gtaggggagg gaagtagatg ttagcgggtc gaagagtcgg 2040 

gagtcggagt cgggagagcg aaaggagagg ggatttggcg gggtatttag gagttaatcg 2100 

aggagtagga gtacggattt ttattgtgga aaggaggatt agaagggagg atgggatgga 2160 

agagaagaaa aagtaatttg cgttaattcg gtagttttaa taaattaaag ggggagcgtt 2220 

agggtagcgg ggagatagaa acgtattttt ggggagtaaa ttaggacggg ttgggaggaa 2280 

gcgataggga aagtggttta agagacggaa taaaggataa tgtttatggg gttgtttggg 2340 

acgaggcgtg tggagtgtgg gtgtgagcgt gcgtgtgtga ttttttttta ggtttgtaga 2400 

gttgaggaaa gaggttatag taaagaggga ttgcggaggg aggaaagtga gagatcggta 2460 

gagggcggga gtggaggtgg gcgcggtggg gatgggagag gatgagtgaa gagaaattta 2520 

gaagaatgga gtgagttagt gggagagggt gggagggtta tagtcgggag cgaacgagtt 2580 

aggtttgtta gttggggaag gtcgggacgt tgggtttagt ttagttggga tatcgcgttc 2640 

gaggttaagg cgggtggatt aggtatgttg agagtgtcgg cgtataggtg ggtacggtta 2700 

cgtattgatt tagtgtttac gaagggtttg tattggataa ggtttagacg tttatagagt 2760 

ttagaatttt ttttgttgta tttatattta ataagtttat tttgggttac ggatatttta 2820 

ttttttaaaa tgacgaggtt aaggtttttg gcgaggatgg tattaaattg tacgggatag 2880 

aagtgggggt gggggagaga gtttttttta agtttatatt tgtttttgta aagtaaagag 2940 

tatgtgaaat tatagggtat atttttattc gaaaagtgtg ttttattttt gaattttgat 3000 

tttttgattt tttgatttga gtaaagatgt gtattttggt agtgagtaga atattttggt 3060 

tttgttttgt ttttgagtgg aaggattata aatataattc gtttggagga ttaggtgtga 3120 

aggtttttgt taggtatatg ggataatgtt tttttaattt taagggtatt ttgttaatgt 3180 

atgtttttgg aaagtgtcgg aatatagtta ttgtttttgg attcggattt ttttattaat 3240 

attaattttt gtttgagagt aaaatttagg ttcgttatta aaaagatatt tttttggttt 3300 

ttaattgaga ataaagtttt ttttaaaagt tgtattgttt tttttaaatt aatatattaa 3360 

tattcgtaat tttagaaata tatagtgatt cgggagaatg tgtataaaat agatacgttt 3420 

aaaaaagttt ggcgtttaaa attaatttta gttattatat aggtgttggg ttttttttat 3480 

ttttgggggt tgtttggaat atgttatgtg tttttttgaa ttatttcgtg ttttgaattt 3540 

atttgagtta gtagtaaaaa taggtaaata aatttgttta atttgttttg agtgttaaat 3600 

ttttttattt tgaaatagtt aatagtcgat agatggattt attttatgga aagggttagt 3660 

ttttttagtt acgaagaaaa ttgattagag atttatattt taagttattt ttaattttta 3720 

cgtaatattc gtgaaaattt aaattttttt tttttattta gtggaaattt aaagtagtgt 3780 

tatttaaggg gagagaaatg agggggaaaa tgtttacgtg ttgtttaatt gtattttttt 3840 

tttgattttg agaattttta tttttggttt ttgaaatttc gtcgaggtaa gaaaattaaa 3900 

tttttttaat aagttttata attgaatttt agttatagga tatcggaaag tgtagttcga 3960 

gaaagatatt tttatttttg tttatcgacg atttttgtag tttttttatt tttttgagta 4020 

atgggttaat aatttttttt tttttttttt ttattttgta gagattaaga ggcgttcgta 4080 

gtagaacggt tttgttttta gttggtggcg aggataggta attttatgga aaagttggaa 4140 

gagaatgaga aaattaaaga tagaaagatt tagagattcg cggagagata tagggagagg 4200 

gaagggagtt gcgttgaaaa gacgtaaaga tacgcgcgtg taattttttt tttttttagg 4260 

ttttagaggt ttgtaaatta gggttgagag gaaggggttc gggaagttta cgtttttttc 4320 

gjttttttttt tgtttggagt ttcgttcgtt agaggttggt taattttagt ttcggtcgtc 4380 

gtagatattg cgttgagttt ttgggttt , 4408 

<2i0> 251 
<211> 4408 

<212>, DNA » , 

<213> Artificial sequence : . 

<220> \.Y; - ; . „.". . I., ,'-'-\ 
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<223> chemically treated genomic dna (Homo sapiens) 

<400> 251 

ggatttaaaa 
cgggcgagat 
tttagttttg 
tttgcgtttt 
tttttttgtt 
ttattagttg 
agaaaaaaag 
tcgatgggta 
agtttagttg 
attagaaata 
ttttttttta 



gaaagtttga 
ttaattagtt 
cgattgttag 
tttgtttgtt 
tataatatgt 
gggttagttt 
ttattatata 
ttttagaggg 
tgagttttgt 
gttgtgtttc 
tattgtttta 
tagttttttt 
atttttgttt 
gtgagaatat 
agggaaattt 
ggaattttaa 
aatgtaggta 
aatttttcgt 
gtatgtttgg 
gtttcggttt 
ttttttttat 
tatcgcgttt 
tttttttgtt 
gtttatattt 
tcgttttttg 
aagtacgttt 
ggttggcgta 
tatagtggga 
ttttttttcg 
ttttttgttt 
gttcgttttt 
gtagtggagt 
agagggaaga 
aagttggaga 
attcgtggat 
ttttcgggcg 
gtgggacgcg 
tagggatgtg 
ggtgatttgt 
ttgtaaagtt 
agtcgagagg 
cggtgttaat 
gttgggtata 
aaaaaaaaga 
gcgaattatt 
ttttttaaat 
gatgtagaga 
gaaatatgtt 
attattataa 
taaacggaaa 
atttttatga 
attaaaatta 

tttaaatatt 



gtttagcgta 
tttagataga 
gtttgtaaat 
tttagcgtaa 
tttggttttt 
aaggtaaggt 
ggaggattat 
gaggtgaaga 
tgggatttgt 
gaaattttta 
tttttttttt 
gtttttacgg 
tttttcgtgg 
ttattttaaa 
tttattgtta 
tttagatagt 
taagcgttaa 
tttttaaaat 
aattttattt 
ttttaagtag 
ggtatttttt 
tatgtttggt 
atttagaggt 
aagttaagaa 
gttttgtaat 
ttttttttat 
tttcgttatt 
tagtagagga 
gaatattggg 
tttattcgtt 
tttttagttg 
tagtttattt 
atttttattt 
gtgatttttt 
atattttata 
ggttattttt 
ttgttttttc 
gattgttttt 
gttcgtgttt 
ttttttcggt 
cgtttttcgt 
ggatcgttag 
ttggtggaat 
gagagggatg 
ggagtagaaa 
tttttcggaa 
tttgtcgtgt 
gagatagtag 
agataatgcg 
ggtttttggg 
ttgttttttt 
gaagttagat 
attgtaggaa 
aagttttagt 
aaagaaaaag 
tgggagttag 
tgattattag 
attatttttt 
tttttttttt 
ggtggtgtga 
aaatttttat 
aaaattaatt 
aagatgtttt 
aataatatta 

. _ ■ * 



gtgtttgcgg 
aggggggcga 
ttttgaaatt 
tttttttttt 
ttattttttt 
cgttttgtta 
tagtttatta 
tgtttttttc 
tgaagaaatt 
gagttagaga 
tttaaataat 
atgttacgtg 
ttgaagaggt 
gtgaagggat 
atttaaatga 
ttttaaaagt 
gtttttttaa 
tgcgagtatt 
ttaattaggg 
gaattaatat 
aaaaatatat 
agaagttttt 
aggatagagt 
attagaaaat 
tttatatatt 
ttttattttt 
ttaaaaaatg 
aattttagat 
ttagtgcgtg 
ttgatttcgg 
ataagtttag 
tattttttta 
tcgtttttta 
tttttaattt 
cgtttcgttt 
tttgtcgttt 
gttgttttgg 
tttttttttt 



ttgtttttcg 
ttcggttttc 
cgtttttgtt 
tcgcgtagtc 
ttttgttgac 
agagggaggg 
gaaattgtta 
tttggtattt 
agtcggaggt 
ttttttttcg 
agtttggaaa 
gttaagtttc 
cgtttttcgt 
cgtggttggt 
taaattttcg 
tattggaaga 
attttatttt 
gtttggatgt 
ttaattttaa 
agttggagat 
agtttaatta 
jttttttttta 
tattataaat 
tgatttaaag 
tattatataa 
aigaaaataaa 



cggtcgggat 
gaggaacgtg 
tgaaagggga 
tttttttgtg 
ttaatttttt 
cgagcgtttt 
tttagaggaa 
ggattgtatt 
tgattttttt 
ggaaatataa 
attgttttga 
gaggttagaa 
taattttttt 
ttagtattta 
atttaaaata 
agggaaagtt 
acgtatttat 
ggtatattga 
attaaagaga 
tggtgggaaa 
attaatagga 
atatttggtt 
taaaatattt 
tagggtttag 
ttttgttttg 
atttcgtgta 
agatattcgt 
tttatgagcg 
gtcgtgttta 
gcgcggtgtt 
ttcgttcgtt 
gatttttttt 
tcggtttttt 
tgtaggtttg 
taaataattt 
ttttttagtt 
cgtttttttt 
tattttattt 



tggggttaat 
agtttttcga 
gggagttgta 
ttttttcgcg 
tatgagattg 
ttaattttta 

tggggaggtt 

tttcggtgtt 
gtttcggcga 
ttaaatagta 
gtttttattg 
atggtttaaa 
tataaaatga 
aaataaattg 
cggagtaatt 
tagtatttat 
tttatgtata 
tttaggaaga 
tgttttttta 
attcgaattt 
tgtttttgag 
ttttaggcga 
tgtttattat 
aagtaaggta 
taggagtaaa 
atttaatatt 
gatttagggt 
tttgagtttt 
tttgtgcgtc 
ttagttaagt 
ttcggttgtg 
tatttatttt 



gttggttttt 
gatttttcgg 
cgtttttttc 

gggttcggtc 

gttacgttat 
agaggagaga 
gtggcggtta 
ttaggagttt 
tcggttcgtt 
gtagtcggta 
tttgttttat 
gtttaaggta 
tatagatatt 
atttaaaacg 
ggttaaggat 
ttagtttttt 
aattggtagt 
taagttttta 
tttggtattt 
taagaaaaaa 
ttttattttt 
tttttttgtg 
gtaaatatta 
aaattttttt 
aatattattt 
gtatataagt 
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attttttttt 
aaagaaggtt 
tatgaatatt 
cgttttgatt 
ttgatttatt 
ttttttttgg 
aagtgtttcg 
ttcgttggta 
ggcgttcgtt 
ggtcgttcgc 
tttttatacg 
gagtgcgaga 
gatttcggag 
tgtagttttt 
ggaaatcgtt 
agtggaggtt 
ttcggagaat 
acgtagtcgg 
aataatttat 
aggtattttt 
tagtattttt 
ttttattgtt 
tagtgatttt 
ttattttttt 
taggagatat 
attttcgatt 
ttaagtaatt 
tgagtattgt 
gaatttatat 
gtatttgttt 
tggtagaaat 
aaaataaktt 



tagtttttgg 60 

gttttttttt 120 

cgcgcgtatt 180 

gatttttgaa 240 

tttattttcg 300 

taaaatgaaa 360 

gtaaaaatcg 420 

ttgtaattag 480 

gattttaaaa 540 

cgtgggtatt 600 

ggtaaagaga 660 

atgtagattt 720 

gtttatttgt 780 

agtaagtttg 840 

taagaaaata 900 

atagtgatta 960 

ttttttcgag 1020 

gtaatataat 1080 

atagcgggtt 1140 

aggagtaatg 1200 

attgaaaaaa 1260 

attatattta 1320 

taaaatatat 1380 

tatttttcga 1440 

tgtggatttg 1500 

attttcgtta 1560 

gaatttgttg 1620 

gtttagtgta 1680 

gatattttta 1740 

tgggtttagc 1800 

gtttttttat 1860 

tttttatttt 1920 

tttcgtagtt 1980 

atatacgtac 2040 

gttttttgtt 2100 

tgttttttaa 2160 

agggttgtcg 2220 

tttttttttt 2280 

ttaggttttt 2340 

tttgtttttt 2400 

cgggcgttgt 2460 

gcgttattgt 2520 

gagtaggagt 2580 

tcgagcgaga 2640 

gttttagtgt 2700 

ttaggttcgg 2760 

tcgggaagta 2820 

atttatttcg 2880 

ttttatcgta 2940 

taaatagatt 3000 

agggtgttga 3060 

ttttaaattt 3120 

aagataaagg 3180 

atttattggg 3240 

ttaggtttaa 3300 

ggattigtaat 3360 

attttaaatg 3420 

ttaaattttc 3480 

tagaaattaa 3540 

tttattaaat 3600 

attttaaaat 3660 

tagtttatta 3720 

ttatttaaaa 3780 

gaagattttt 3840 



seq_prot.txt 

gttgatgtaa tatgagtata taatatttta ataattaaat tttttttaaa aaattaaata 3900 

gttattttat ttgtggaatg ttttatttta atttagtaaa attatattta aattatttag 3960 

gtgttttgtt ttttaagtta agcgtgtttg tttttaaatg tttttaaagt atttatatta 4020 

attggttgta aagaacgtat atatatggta aaatatagaa ttgaattgag tagtatttta 4080 

atttttttaa ataattattt attataaatt aatttattgg ttaattttat aatttagttt 4140 

atttaaaata tatgtttttg tgttgtttat ttttaaattt tttattaaag attttgttat 4200 

ggggtaataa agtgtatgaa aaggggggaa atgtgaaagg atttgggatt attcgaattg 4260 

tatttttttt gtat ttttag ttttgcggta gttattagaa attatttttt agtaaattgt 4320 

tttatttttt agggtttgtt tgtttgtttt gttatggttt ttcgtttttc gttagtcgtg 4380 

tagtgttttt tgtgtgttta taatataa 4408 

<210> 252 
<211> 4435 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 252 

gttttttttt tggtcgttac ggtttacgtt attaatttat agttattttt tcgtttgtta 60 

tggttttttg ttttcggttt atgaattttc gggttgttta gattggggtt gggggtggtt 120 

gaattggggg tttaaagtta tttatatgaa attgtatgta ttttcgttag tttggggaag 180 

ggagttgatt ttttggagat ttagaattta gaagagagat ttttagtaag tagtgggggt 240 

ggaatttgtg tttttagtgt ttttaaattt tttttgagtt ttgcgacgtt ttttaatttt 300 

cgttcgtaat tttttttttt cgttagtttt ggtgggtttt ttatcggcgt acgagcggga 360 

tttttagatt ttgagtagta gtaagatgat ggcgaggagg tgggagaggt tttttaggaa 420 

tcggaagaga tttatggttg gggaattttg gtagggttga gcgggaggga ggtaggttgg 480 

cgggggggtg tttttcgagg ttgttggttt gaacgggtag ttgggttggg gggacggaag 540 

agggatttta ggtgcgtttc gtttcgggga ggggattttg ggagggggag taaaggttgg 600 

agttggcggc ggagttggag tcgggaagag ggaggagagc gagaggggga ggagttcggg 660 

aggagaggtt tcgttttcga gggcggggtt tggattttcg gcgtttttta tcgtttattt 720 

tttgttttgt cgtggcggcg ttaaacgtat ttttaaagtt gttttatagg aatttgggag 780 

ggtcgtaggg ttcggaaagt gttaaattta gtatcggttc gtttaggtag tttgtgtttt 840 

tggaagagac gtagtttagt atattagttt tagtttttgt agggatgtag agattgtttt 900 

ttgagtttga aaaattttgt agggattcgg aggtttagtt gttagttttt gtagttttta 960 

ggtattttaa ttttaatttt ttagaagata agaaagatat tttatttatt ttttttttta 1020 

gatttaggat ttaaggtttt agttcgttcg ttaaatttag aagtttgggt ttttagtttt 1080 

ttttttttta gatttaggag tttagatttt tagttttttt ttttttagat ttagaagttt 1140 

agatttttag tttttttttt tttagattta ggagtttagg tttagttttt tttcgtttag 1200 

atttaggagt ttaggttttt agtttttttt tttagattta ggagtttagg tttttagttt 1260 

ttttttttag atttaggatt ttaggttttt agattttttt tttttagatt taggagttta 1320 

ggtttttagt tttttttttt ttagatttag gagtttagat ttttagtatt ttttttttta 1380 

gatttaggag tttagatttt tagttttttt ttttttagat ttaggggttt agggttttag 1440 

tttttttttt tttagattta ggggtttagg gttttagttt tttttttttt agatttaggg 1500 

gtttagggtt ttagtttttt tttttttaga tttaggagtt tagggtttta gttttttttt 1560 

tttttaggac gttgtttttt ggaatttagg gttttatttt tattatttta tggattaaat 1620 

attttaattt taagaattta gatttatagt tttttttttt acgatttata gatttaggtt 1680 

ttgatttttt ttttttttag gaatgtgtat tttattttgt ttttttagat tttgaggatg 1740 

aaggaaatag gagttttatt taggaggttt aaggttaaaa ttttgattta aattatttta 1800 

ggagtttttg gttttggttt tttttttgtt ttagagtttt tgttttgttt atatatatat 1860 

attttttttt atttttagag gtttcggtgt tttgttttac gttttatttt agagttttac 1920 

gggtggtttt ataaaagtgt cgggtttagt tttttagtag gaggggaatg ttgggtattt 1980 

gggtgtggga ttttcgggga atagtttgtg gtttggattt ttgtatttat gaggggatag 2040 

acgtggtttt tttttcggat gatggggtat ttatagatga tggaggttag ggttttttaa 2100 

taaaagaagg ggtgtaggcg tgttgatttt tttagagggt tggaaggacg gggtgtttaa 2160 

gggtgatatt tacgagtttt gggtttttga gggtggtttg tacgggggag agtcgggatg 2220 

attgagtttt taaaagagat ttcgatttgg aggcggtttt taaatttttg ggttttagta 2280 

gagaagggga tttttgggtt tgagggagga ggggttgggg ggtgggattt ttgggattag 2340 

ggtcgagatt tggtttttag gtttggtttt ttggggtaat aaaagttata gtttgggttt 2400 

tagtgtggcg aatattgaag ttagggaaag gttttttgtt tttagagttt taaggtaggg 2460 

cgggggtaga gggtagtaat ttttagtttt ggagtttagt tttgaagttg gtgtttttat 2520 

atcgggtttt gagtttttgt ttgtttgttt ttagtttgat tttttttttt ttttttattt 2580 

ttagtttttt atttttttgt tttaggagga aggtagaggt tggtagttag gggtgggggg 2640 

cggttttttt tttaagtttg gtaggagaag gggtttttag ggaggttagg agggggggtt 2700 

Qtgggttttt cggtagtggt agacggggat tgaatgttaa tcgtatttcg agtgagtgtg 2760 

tgtgtgcgag aatatagcga gtgtgtgagt ttttttcgtt ttagtttttt taagtcgcgg 2820 



tcgtcgtcgt 

gggggtgttg 

gttcggtgag 
tgtaagcgtt 
gtttgtttgg 
atggtgtgta 
cggttgtttg 
ttattagatt 
gggtgttgag 
gtttttgttt 
gaaggggtaa 
attagtttta 

ggggttgagg 

ggaaaggggg 

ggggattttt 

ggtttaattt 
tttgggggag 
atagatagga 
ttgtttttcg 
gtttcggttt 
ggggataggg 
agttggtaag 
tatttagttt 
tttttttttt 
tttttttttt 
tttatttttt 
tttttttttt 



tattttcgtt 

tttgggtagg 

gatttgtgtg 

gcggcggcgg 

gtgtcgtggg 

tttgggtttt 

tgtttttttg 

ttttattttg 

ggggaatttt 

gtgtttgaga 

gtagtttttt 

gaggtgattt 

gtataaagcg 

gaggagtcgg 

acggggtcgg 

ttagagtaat 

agatagggta 

ggttcgggtt 

atatcggttt 

tggttttttc 

gatgtaggga 

aagtttcgtg 

ttttgttttt 

tgagttttta 

ttatttttat 

ttttgtattt 

tttcgttttt 



cgtagttttt 

tcggttcggt 

ttcgggtgtt 

agggagggag 

gttttcgttt 

gtgttttttg 

tttgtttaaa 

tgtgtgtgtg 

agggaagtga 

gtgggggagt 

aagtaggtaa 

agatagatag 

ggggtgcgag 

ggttgaagcg 

ggcggtaaga 

atgtgtagtt 

ggataggatt 

gtcgttgttg 

tttgagtttt 

gttatttttt 

tggtggtggg 

gagggagggg 

tagggatttt 

tattttagat 

atatttgatg 

ttttttttgt 

tttttttatg 



seq_prot . 
cgtagttttt 
ttttaggggt 

tggggttggt 

gggtttgttt 

ttttttttcg 

tttgcgttcg 

tagtttttat 

tttgtgtgtg 

gatcgtgcgt 

taaggggggg 

tttttcgttt 

atagatatag 

tgagttaggg 

gtagaggggg 

ggatatttcg 

acgttttcgt 

aaggaagagg 

tcgrtattat 

tttcggaatt 

taatagaata 

aatgtggatt 

gtttgaacgg 

tttttatttt 

tttttacgtt 

ttttatattt 

attagttttt 

ttagagttat 



txt 
ttcggttatc 
ttttcgagcg 
tggtggaggg 
gtttgtatcg 
gtttttttaa 
gttaggcgtt 
tagtagtggt 
tgtttttttt 
gtgtgcgtga 
tttagaggtg 
tttatacgta 
acgttggaag 
agaggcggga 
gtatttcggg 
atagtttttg 
ttagtttagg 
aaggagagac 
tattgtcgtc 
ttggggtcgt 
gggttatgaa 
tttaaattta 
tgggtagggg 
tttttttttt 
tttatttttt 
ttgtgttttt 
ttatcgcgag 
gagtttgtta 



ggtgtttgtt 
tttgttattt 

ggggtgtgtt 

attttgagtt 
atttagatgg 

tgggcgtttt 

agtttggttt 
taagttttgg 
gtgtatgtgt 
gttaagcgag 
tatattagtt 

ggggggtggg 

ttggatatat 
tgggcggagg 
taatgttcgg 
tggtcgtaat 
ggagttaggg 
gtttcggggt 
tggacgtcgg 
aaggtaaggc 
ggatagagga 
ttttgatttt 
ttttttgagt 
tgatatttgg 
atttttttat 
tcggtttttt 
ttaat 



<210> 253 
<211> 4435 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic dna CHomo sapiens) 
<4G0> 253 

attaatagta 
atcggttcgc 
gagatggagg 
tgttagagaa 
gagagggaga 
taagattttt 
tgttttagat 
atttttttat 
gtttagcgat 
ggggcggcgg 
gtttcgtttt 
ggttatttgg 
tattgtagag 
gtttattcgg 
tttagtttcg 
tttttttttt 
gtgtgtgcgt 
ttggttattt 
tatatttacg 
gtttgaggag 
aggttgttat 
gtttagacgt 
tgggtttggg 
agggtcggta 
attttttttt 
gtagacgttc 
gtatcggtgg 
gtttggagga 
ttattcggga 
ttttttttgg 



gatttatagt 

gatgggggga 

tatagggata 

atgggggcgt 

gagagagggt 

gtttatcgtt 

ttgggggttt 

gattttgttt 

tttaagattt 

tagtggtggt 

tttttttttt 

attaggcgag 

gttgtcgggg 

ggtgtttttt 

tttttttttg 

agcgtttgtg 

gtgaggagcg 

ttagattttt 

tatatacgta 

ggatatatat 

tgttgatagg 

ttggtcggac 

ggggtcgggg 
tagatagata 
tattagttag 
gagagatttt 
tcgagggagg 
gttggagcgg 
tgcgattaat 
tttttttggg 



tttgatatgg 
ttgatgtagg 

tggggtatta 

gaaaggtttg 

ggaggagagt 

taaatttttt 

atatttttat 

tgttgggggg 

cgaggagggt 

ggcggtagta 

tttttggttt 

gacgtggtta 

tgtttttttg 

tttgtcgttt 

gtttattcgt 

tttatttgtt 

ggagattgtt 

ttttgatttt 

cgattttatt 

atagatatat 

ggttgtttgg 

gtaggtagga 

ggaaggagcg 

gatttttttt 

ttttagatat 

tgggggtcgg 
ttgcgggagg 
gagggattta 
atttagtttt 
gatttttttt 



gaggggagag 
gaggggggta 
ggtatgtggg 
aggtgtaggg 
ttttgaggga 
ttttttacga 
tattattttt 
atggcggggg 
ttagagagtc 
gcggtagttc 
tgttttgttt 
tatatgttgt 
tcgtttcgat 
tagtttcggt 
attttcgttt 
tgtttgggtt 
tgtttgggga 
tttattttta 
tttttgggat 
atatatagag 
atagatagga 
ggtatagaat 
ggagttttac 
ttttttcgtc 
tcggatatat 
gtcgatttat 
ttgcgggcga 
tatattcgtt 
cgtttgttat 
tttgttaggt 



cgagaggaga 
tagggagaag 
gatggggaga 
atttaggaaa 
taggggagtt 
ggttttttgt 
gtattttttg 
ggttaggatc 
gatgtcgggg 
ggattttttg 
tgtttttttt 
tttggaagtt 
ttcgtgggga 
tttttttttt 
tgtgttttta 
atttttgaag 
attgtttgtt 
aatataggta 
ttttttttaa 

tggggggttt 
ggatataggt 
ttaggtatat 
gatatttagg 
gtcgtagcgt 
agatttttat 
ttaggtaata 
gggtggcggc 
gtgttttcgt 
tgtcgggaga 
ttggggaggg 



gagggagagg 

gtgagataga 

gggagttaag 

aaggggttta 

aggtgggggt 

taattttttt 

ttttcgtttt 

ggaattcggc 

ggtaggtttc 

tttgtttttg 

ttaaggttgc 

gggtttcgga 

tttttttttc 

tttttatgtg 

gtttttttat 

ttagtggttg 

tttttttcgt 

gaaatatata 

tatttttaga 

aatgggggtt 

agtcggggac 

attatttatt 

taggtagttt 

ttatagatat 

cgggtagatg 
tttttagtag 

ggcggtcgcg 
atatatatat 
tttatagttt 
ggtcgttttt 



2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4435 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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seq_prot.txt 

tatttttagt tattagtttt tgtttttttt ttgggataag ggagtgaggg gttgaaaata 1860 

aaaagggagg aagaagttag gttggggata ggtaggtagg gatttagaat tcggtatgga 1920 

gatattagtt ttagagttag attttagggt tgggggttgt tgttttttgt tttcgttttg 1980 

ttttgaggtt ttggaaatag gaggtttttt tttgatttta gtgttcgtta tattaagatt 2040 

taagttgtgg tttttattgt tttagaaggt taggtttgaa aattaggttt cgattttggt 2100 

tttaggagtt ttatttttta gttttttttt ttttagattt aggagttttt ttttttattg 2160 

ggatttagga atttggggat cgtttttaag tcggagtttt ttttaaggat ttagttattt 2220 

cgattttttt tcgtataagt tatttttagg gatttaagat tcgtggatgt tatttttggg 2280 

tatttcgttt ttttagtttt ttgaagaaat taatacgttt gtattttttt ttttattgag 2340 

ggattttggt ttrtattatt tgtgggtatt ttattattcg gaagggaagt tacgtttgtt 2400 

tttttataga tgtaggagtt tagattatag gttgtttttc gggggtttta tatttagatg 2460 

tttagtattt tttttttgtt agagggttgg gttcggtatt tttataaagt tattcgtggg 2520 

gttttggggt agagcgtggg gtaggatatc ggggtttttg agggtggaag agggtgtgtg 2580 

tgtgtgggta gggtagaaat tttggagtag gggggaagtt agggttaggg gtttttgagg 2640 

tagtttgggt tagagttttg gttttgagtt ttttgggtga ggtttttgtt ttttttattt 2700 

ttaaggtttg ggagaatagg gtgaggtgta tatttttgaa aaagaagaga attagggttt 2760 

aaatttgtgg gtcgtaaagg gagaaattgt agatttagat ttttaaggtt aggatgtttg 2820 

atttataaga tggtgggggt gggattttaa gttttaagga atagcgtttt aggagggaaa 2880 

agggttgggg ttttgaattt ttgggtttga gggaggaggg gttggggttt tggatttttg 2940 

ggtttgaggg aggaggggtt ggggttttgg atttttgggt ttgagggagg aggggttggg 3000 

gttttggatt tttgggtttg agggaggagg ggttggggat ttggattttt gggtttgagg 3060 

gaggaggtgt tgggggtttg gatttttggg tttgagggag gaagggttgg gggtttggat 3120 

ttttggattt gggggaggag ggtttggggg tttggagttt tgggtttgag ggaggaggtt 3180 

gggggtttgg atttttgggt ttgagggagg aggttggggg tttggatttt tgggtttgag 3240 

cgaggaggag ttgggtttgg atttttggat ttgagggagg aggggttggg ggtttggatt 3300 

tttgggtttg agggaggaga ggttgggggt ttggattttt gggtttgagg gaggaggggt 3360 

tggggattta gatttttggg tttgacgaac gggttggggt tttgagtttt gggtttaggg 3420 

gaggaagtag gtggggtatt ttttttgttt tttgggggat tggagttaga gtgtttgaaa 3480 

gttgtaaaag ttagtagttg agttttcggg tttttgtagg attttttaag tttagagggt 3540 

agtttttata tttttgtaag ggttgaggtt ggtgtattgg attgcgtttt ttttagagat 3600 

atagattgtt tgggcgaatc ggtgttggat ttggtatttt tcgaattttg cgattttttt 3660 

aagtttttat ggggtagttt tgggggtgcg tttagcgtcg ttacggtaaa gtaggaggtg 3720 

ggcgataggg ggcgtcgggg atttaggttt cgttttcggg ggcggagttt tttttttcgg 3780 

attttttttt ttttcgtttt tttttttttt ttcggtttta gtttcgtcgt tagttttagt 3840 

ttttgttttt ttttttaaag tttttttttc ggagcggagc gtatttaggg tttttttttc 3900 

gtttttttag tttagttatt cgtttagatt agtagtttcg gggggtattt tttcgttagt 3960 

ttgttttttt ttcgtttagt tttgttaggg ttttttagtt atgaattttt ttcgattttt 4020 

gggagatttt ttttattttt tcgttattat tttgttattg tttaaaattt ggaagtttcg 4080 

ttcgtgcgtc ggtgagaggt ttattagggt tggcgaagga gagagattgc gggcgggggt 4140 

tgggggacgt cgtaggattt aggaagggtt tgggggtatt aggggtatag gttttatttt 4200 

tattgtttgt tggggatttt ttttttaggt tttgggtttt taggggatta gttttttttt 4260 

ttagattggc gggggtgtat atagttttat gtgaatggtt ttaggttttt aatttagtta 4320 

tttttagttt taatttgggt agttcgaggg tttatggatc gggaataggg aattatggta 4380 

ggcggggagg tggttgtgag ttggtgacgt ggatcgtggc ggttaggaag ggaat 4435 

<210> 254 
<211> 6001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 254 

gggtatagtg tgaggaagaa atatggaaaa gatagatttt ttttagattg aaaaggagat 60 

tgtttagggg cgggaggaag ataaatagag gttagtgggt tttgaggttg attgtatgtg 120 

tgatttgtgt ttttgaaata ttattttgga aattgtagga ttttcgggag gaatggttgt 180 

aggggatgtt ttagtagatg agataatagt tatcgttatt ttattttaaa tttttgtgtt 240 

tttagtggga tatagaagta gtaggttgtt tattaattta agtagttata ttagtttggg 300 

tagtggaaat aatttagtta tattttttgg gaatagagga ttaaatggat gtgttgtttt 360 

tttttttaat ttattatatg ggattgtgta tagttttggt gttaggaatt aattttagga 420 

aggatgaagg ttgatttttt tagtttttag tagataagtt gttaggggta gttattaata 480 

tataataatt gagttattta cgtaaaataa tggatatgtg atgttttttg tatgtttgag 540 

tttttgggtt gagatttatt ttgttttgaa tgtcgcggtt tttttaatga agttgttgag 600 

gaacgtaggg gttttgttta ttttgttttt ttttggaaat tttgtttttt aatttttaga 660 

tttagagtag tgtttttgtt tttagtttta tatcgtttat atttttatta ggaaaagtaa 720 

ataaaagtta aagttttgtt tagttataag ttttttttaa acggggtagt tggattgtait 780 
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atatttttgg cgtattaatt ttatttagat agggaaaata tttttatatt aaaagaaatt 840 

aggttaaatt atggtagaga atgtaaaatt tatagtttaa aaatgatgaa attttagatt 900 

ttaaaggaat tttttttttg tagggtaggg ggagggtatt ttgttttaga attttatgcg 960 

ggtttttatt ggagtttttt tgagaaggat atttttgtgg aaaagtttga gtagttttgg 1020 

ttttgcgttt ggtttggtta ggggttattt ttttttggtt tgggttgtag cgttttatgg 1080 

gtatcgcgtt tttgttaata tttttttacg tttttaaatt tatagtttgt tagcgcgggc 1140 

gtaatttgag agttgtcgta ggtttttagt ttattrcgttt tttaaaggag gaaatggaga 1200 

attagcgatt ttaaaggatt tgtttggcgg gtatttacgt tttttaggtt atcgtttttt 1260 

tttgcgtttt tggttatttt cgttttttaa ttttgtagga atttaggatt tacgtgtaaa 1320 

tataaagaat cgtatttatt tattttcggt ttttttttat tttgcgtttt taattttggg 1380 

ggggtttttt ttttttattg gggtatttga cgttcgtggg gggtgtttag aaaagtttgg 1440 

tagattagtg attggtggtt ttggaataaa tatattatag tttaaattgg gggttggtgg 1500 

tggagaggtg ggtggatggg ggttataaat ttgttcggta attgtttttt tagttaagag 1560 

agaggagttg aggttttttg gggttggagg attggaatcg gttagattgc gggtttaagg 1620 

ggcgaaggta ggttggtagg ggtagttttt tttcgtcgtt tataattttc gggcgggcgc 1680 

gtagtcgagt cggttcggtt ggttggcgta atttcgcgcg ttttttgtat tgattaaaaa 1740 

tgggggttga aatagtaaac gcgaggagga gtaattgttt cgattcggtt tagaagcgcg 1800 

attaatgggg atgtgagttt tttcgcgcga attaattagc gtagggtttg cgatagtacg 1860 

ggttaatggg gcgtcgattc ggcgtaggaa taaggcgggg gttcggggtc ggttgtagat 1920 

ttttatcgta gcggttagga acgttagtcg tttacgcgtt cggttttttt tggttgattt 1980 

atcgttttgg tcgtcgtatt atggacgttt ttaggtaggt ggttaatttt gggtttggtt 2040 

tcgttaagtt gtcgtattta gtaagttttc gcgagcgggc gtcgggagtg aggtttaggc 2100 

gggagtacgt acgcgggtgg gtttgtattt ttgcgtgtgg tagtcggatt ttcgtttttt 2160 

gttttgagtt ttttaggcgt tttgtattag cgtgtatagc gggattagta gtttcggtaa 2220 

gcgggtttcg ggaagaatgt agttggtgag gaagttcggc gaggcgtgtt cgtgtagttg 2280 

tttttggttt tgattgttgg tgcgaggtag tgtacgattt agttggtcgg ggtttgttgt 2340 

ttcgtcggtg ttacgtattt gtagacgttc gggttgtgtt attttttggg tcggttcggg 2400 

ggttggggcg gggcgaaaaa gaaaaagttt tgatttttgt tttcgtttcg cgtagttgtg 2460 

cggcgagttc gggtagtgtg gagcggatgt atgaatggat atagtgtgaa tgagtgatga 2520 

acgggaatga atcgatgata ggttttgtat atgtagtgga ttacgttagt tgaaagggat 2580 

tgtaaattta aaggttcgcg tgtgcgtgtt tttttacgtt ttatattagt gtttgtttta 2640 

gttagtattt aggaatattg ttttttagaa atgtaaattt gaaaattttt agtttagttt 2700 

tttttttttt gttttttata gaatttattt ttttttaaag tttttttttt tatttttttt 2760 

ttttttttaa tttgtttgtt ttttaaaggc gttcgagtat tttaattttg gtagattttg 2820 

ggttgtttaa tttaattttt tgtttatttc gttttggagt aaatcgagtg attgaatttt 2880 

ttttcgggtt ataattagat tttgttggtg gaatttattt tttgtttttt taggttttag 2940 

ttcggttttt tttttttttc gttaaattta tcgtttttag ttttattcgt agtgaagaag 3000 

gtaaagtttt cgatgtgttt tgagtttatt gttaggcggt tgtcgtcgtc gttagatttt 3060 

tattttttta attaggatag agttgataat atgttggagt agtatgaggt atagttaagt 3120 

attttataat tattaattgt tgagtagagt agaaattttt ttgggataga gttttttttg 3180 

tgttgtggta agaatagaga atttaatttt aaaggggaaa ggatttttta tttttttagg 3240 

ggtagcgttt atagtagttg agaggatagg gtttattttt ttttagtggt tatagtttat 3300 

tttaggcgag attttttgtt ttagttgtgg agatgttttt tgtagttttt ttttgtattt 3360 

tttatgtttt ggagtgtttt taacgtttgt tttttatgta tttcgttatt aatttattat 3420 

tataattgta atggtaattg ttattagtaa tataattatt tgttattaat ttttttggga 3480 

ggatttttgt ttttggattg tatgtaattt ggggggtagg agggtgaggg ggtaggtaga 3540 

tgttgttttt tatgtatttt tttgatttag ttgaattgta aatattagag aagtttttaa 3600 

attttttttg attgtaattt atagtgaaaa atatgtttat aaaatgataa agtatatata 3660 

tttaattgta ataaaaataa atgttttatg aaatgatgtg tagtttaatt atttgtgata 3720 

tatattattt tgtattttat tttgttttag tttgttttgt tttattaaaa aaattagtcg 3780 

tgagttgggt gtggtggcgt acgtttgtaa tttaagtaat ttaggaggtt aaggtgggag 3840 

gattttttga gtttaggagt ttgaggttgt agtgaattgt gattgtttta ttgtatttta 3900 

ggttgggtga tagagtaaga ttttgttttt taataaaaaa aaaaaaaaaa aaaaaaaaaa 3960 

aattggttat gatttataaa agtaatgtat tttttaattg gaaaaaaaaa aatttattgg 4020 

attagggagg tgtgattttt tagtaggagt ttttaaaatt gtttgtaatt attttggtta 4080 

taaattttta tagaattatt tttaatgaat gttgtttata agaaagatat agttttagga 4140 

gtaagtttaa agattattta tagatgaatg ttttttttta ttaaaagaaa agatttattt 4200 

tattttattt gattagttgg tgttaataag tatttaatat tttagaaaat aataatatat 4260 

ttttagattt agtcgttaag gtagtttttt ttttaattta attttttttt tttttttttt 4320 

ttgagagaag ttaagttttg ttacgaggtt ggagtgtagt cgtttggttt cggtttattg 4380 

aaattttcgt ttttcgggtt ttagcgattt ttttgttttc gttttttgag taattgggat 4440 

tataggtatt tattattaag tttagttaat ttttgtattt ttagtagaga cggggtttta 4500 

ttatgttggt taagatggtt ttaatttttt gattttatga tttgttcgtt ttggtttttt 4560 

aaagtgttgg gattataggc gtgagttatt acgttcggtt ttgtttaatt ttttattgtt 4620 

.aaaatattgt tttttgagat aagttgtaat ttagtttagg ttatagttgt gttttaatga 4680 

tgrtttttttt agtaagtatt ttaaaaaaaa aggtgttttt tatttgtaga ttttttttgg 4740 

ttggttgttt ttgtatttat tttgtgttta tttttaaagt tttttttttt ttttttattt 4800 

tttattaagg gaatt-tttai: atatataaag ataaaagtag tgtaatgaat ttttaaggat 4860 
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ttatttttga gatttaataa ttattaatat tgtgttagtt ttgttgtagt attttgggac 4920 

gaattttatt tatagtaggt tttttttttt ttttgtttat tttacgtttt ttagtttagt 4980 

attttagatt aaaagagatc ggaaaattga gttatttgta taatttgggt ataatttttt 5040 

tatcgttgtt tttttaattt agaattttat gtatttttta ttttagtttt aggagagtgg 5100 

tgaatgagat ttgtgaaggg gatatttttt agggaatcgg taaattcggg tttttggaaa 5160 

aagtgtatta atattagtta tattgtttaa ttttatgttt attttgtgtg taagttgtat 5220 

aatcgttggt tttgtggata tgggaaggaa ggagattggt tgtgggtggg gatggaagtt 5280 

tggggatttt attgtagatt tttttttgtt ttttcgttag gtttgtatgt ttagagggaa 5340 

agtagtgtag aatatattta aataatttta ttttttttta ttttttttta ggtgttgtta 5400 

gagatataaa aggaattatt agattataaa ggagttggta ttagtgtttt tggtaagatt 5460 

tatttttgaa ttttgtgaat gtttatgttt taaaggaagt tttttttttt tttttttttt 5520 

ttagaggtag ggttttgttt tgttatttag gttggagtat agtggcgtga ttatagttta 5580 

ttgtagtttt taatttttgg gtttaagtga tttttttgtt ttagtatttt gagtagttag 5640 

gattataggt atatattatt atatttggtt aatttttttt aaatgtttaa aatatttttt 5700 

aaagatttta tgagatggga ttttattttg ttgtttaggt tgatttgaaa ttcgtggttt 5760 

gaagttattt ttttatattg gttttttaaa gttttgggat tataggtttg agttattgag 5820 

ttttggttta aagggaatta atttaaatgt attaaatata gaattatagg gtttataggg 5880 

tttttttata ggtttttata gatatatttt taaattttaa tagtttgggt tgtatgtttg 5940 

atagaaatta ttttgaaaat agtataaatg ttggtgagag gttttgatat tttagtttat 6000 

t 6001 

<210> 255 
<211> 6001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 255 

ggtaagttga gatgttagaa ttttttatta gtatttatgt tatttttaaa atggttttta 60 

ttagatatgt aatttaagtt attaaagttt aaaaatgtat ttataaaggt ttataggaag 120 

attttatagg ttttgtaatt ttatatttga tgtatttaaa ttagtttttt ttgggttagg 180 

gtttagtggt ttaagtttgt aattttaaaa ttttgggagg ttaatgtggg aggatggttt 240 

taggttacga gttttagatt agtttgggta ataaagtgag attttatttt atgggatttt 300 

taaaaaatgt tttaaatatt taaaaaaaat tagttaggtg tggtgatgtg tatttatagt 360 

tttagttatt taggatgttg aggtaggagg gttatttgag tttaggagtt ggaggttgta 420 

gtgaattatg attacgttat tgtattttag tttgggtgat agagtaggat tttgttttta 480 

aaaaaaaaaa aaaaaaaaaa agtttttttt gaaatatgga tatttataga atttaaaagt 540 

aaattttatt aagaatatta atgttaattt ttttgtagtt taataatttt ttttgtattt 600 

ttaataatat ttagaaaaaa ataaaggaaa atagggttat ttaaatatgt tttgtattgt 660 

tttttttttg aatatataaa tttgacgagg ggataaggaa gggtttatag tgaggttttt 720 

aggtttttat ttttatttat agttagtttt tttttttttt atgtttataa agttagcgat 780 

tatgtaattt atatataagg taagtatgaa attaggtaat gtgattgatg ttgatatatt 840 

ttttttaaag gttcgaattt atcgattttt tagaaaatgt tttttttata aattttattt 900 

attatttttt taaagttaaa gtaaagaatg tataaagttt tgggttaaag ggataacgat 960 

aaaaaaatta tgtttagatt atataggtaa tttaattttt cgattttttt tggtttaagg 1020 

tgttagattg ggaagcgtgg ggtgggtagg aagaaggaag agtttgttgt gaataagatt 1080 

cgttttaggg tgttgtaata agattggtat aatgttgata attgttgagt tttagagatg 1140 

ggtttttgga ggtttattat attattttta tttttgtgta tgtgaggatt tttttaataa 1200 

aaaataaaga gaaaaagaaa agttttaaaa atgaatataa aataagtgta aaagtaatta 1260 

gttagagaag gtttatagat aggaggtatt ttttttttta agatgtttgt taaggagggt 1320 

attattaaaa tatagttata atttaaattg gattgtaatt tgttttaaaa aatagtattt 1380 

taataataaa gaattagata aggtcgggcg tggtggttta cgtttgtaat tttagtattt 1440 

tgggaggtta aggcgggtag attatgaggt taagagattg agattatttt ggttaatatg 1500 

gtgaaatttc gtttttatta aaaatataaa aattagttgg gtttggtggt gggtgtttgt 1560 

aattttagtt atttaggagg cggaggtaga agaatcgttg gaattcggga ggcggaggtt 1620 

ttagtgagtc gagattaagc gattgtattt tagtttcgtg atagagtttg atttttttta 1680 

aaaaaaaaaa aaaaaaagaa ttagattaaa aaaaagattg ttttgacgat tgggtttaag 1740 

aatgtgttat tgttttttga aatgttggat gtttgttagt attagttgat taaataaaat 1800 

agagtgaatt ttttttttta gtaaagagaa atatttattt gtgaataatt tttgaatttg 1860 

tttttgagat tgtatttttt ttatagatag tatttattga aaatagtttt ataaaagttt 1920 

ataattaaaa taattataaa tagttttaaa gatttttatt gaaaaattat atttttttaa 1980 

tttagtaggt tttttttttt ttagttgaag aatgtattgt ttttgtggat tataattaat 2040 

tttttttttt tttttttttt tttttttgtt aagagatagg gttttgtttt attatttagt 2100 

ttggaatgta gtgggataat tatagtttat tatagtttta aatttttggg tttaagagat 2160 

-tt-ttttat-tt t^gt-t-tttt^ tatttagttt -2220- 
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acgattaatt tttttaatga aatagaatag 
atgttatagg taattaggtt atatattatt 
atgtgtatat tttgttattt tgtaaatatg 
tttgaaagtt tttttaatat ttataattta 
atttgtttgt ttttttattt ttttgttttt 
tttttagaaa aattaataat aaataattgt 
agtaataaat taataacgaa atatataggg 
Ogggtgtagg aggaggttat aggaaatatt 
aatgaattgt aattattgag aaaaaataag 
ttttagaaaa gtaagaagtt tttttttttt 
atagaggagg ttttgtttta gggagatttt 
tatttggtta tgttttatgt tgttttaata 
aaaaatttaa cggcggcggt agtcgtttga 
ttttttttat tgcgggtggg gttggggacg 
gttggggttt gggaaggtag agagtgagtt 
gagaatttaa ttattcggtt tattttagaa 
ttaaagttta ttagagttag aatattcgaa 
ggaggggggt ggggagaaag gttttaggaa 
aaagttggat tgaaggtttt taaatttata 
ttaaagtaaa tattggtgta aggcgtaggg 
aatttttttt agttaacgtg atttattgta 
tttattattt atttatatta tgtttattta 
gtatagttgc gcgaggcgaa ggtagagatt 
tttcggatcg gttitaggaga tggtatagtt 
gatagtaggt ttcggttagt tgagtcgtgt 
gtagttgtac gggtacgttt cgtcgagttt 
tttgtcggag ttgttgattt cgttgtgtac 
tagggagcgg gaattcgatt gttatacgta 
cgtttgaatt ttattttcgg cgttcgttcg 
ggattaggtt taaagttgat tatttgtttg 
tgagttagtt aaggaggatc gaacgcgtga 
agtttgtagt cggtttcgaa ttttcgtttt 
tcgtgttgtc gtaggttttg cgttaattgg 
tcgcgttttt gagtcgagtc gaagtagttg 
atttttgatt aatgtaagga gcgcgcgaga 
cgcgttcgtt cgaggattgt gggcggcggg 
tttttgagtt cgtaatttgg tcggttttag 
ttttttggtt gaaagggtag ttgtcgagta 
attattagtt tttagtttgg gttgtgatgt 
attaaatttt tttaagtatt ttttacgggc 
ttttaaggtt ggaagcgtag gatgaagaag 
atttgtacgt gggttttgag tttttgtagg 
ggaggagcga tagtttggga agcgtggatg 
ttttttattt ttttttttgg gaaacgagtg 
cgttcgcgtt gataagttgt gagtttagaa 
tttatagggc gttgtagttt aaattagaag 
gttagagttg tttaaatttt tttatagaag 
tcgtatgggg ttttgaaata gaatattttt 
aagtttggaa ttttattatt tttaaattgt 
tagttttttt taatgtaaaa atattttttt 
tatatagttt aattgtttcg tttagagaag 
tttgtttttt ttaatgagaa tgtgggcggt 
atttgggaat tggaggataa agtttttaga 
ttttagtaat tttattaaga aaatcgcgat 
atttaggtat gtaagaagta ttatatattt 
atattagtag ttgtttttag tagtttattt 
tttttggagt tagtttttaa tattagggtt 
agggatagta tatttatttg gttttttgtt 
gtttaagttg atgtggttgt ttgggttgat 
gggtatagga atttggggtg gggtggcggt 
tgtagttatt tttttcgggg gttttgtagt 
atatatatag ttagttttag gatttattga 
aatttttttt ttagtttaga gaaaatttgt 
t 

<210> 256 

<211> 4001 , f x , . . • 

<212>: .DNA: ; ; i 
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attagagtag gatagaatat aaaatagtgt 2280 

ttataaggta tttatttttg ttataattga 2340 

ttttttattg taggttatag ttaaaagaag 2400 

attaaattaa ggaaatatat aaaaagtaat 2460 

taggttatat gtagtttaag ggtaaaaatt 2520 

attattgatg ataattgtta ttataattat 2580 

aataaacgtt gggaatattt taaaatatgg 2640 

tttatagttg gagtagggaa tttcgtttaa 2700 

ttttgttttt ttagttattg tgagcgttgt 2760 

taaaattgga ttttttgttt ttattataat 2820 

tattttgttt aataattgat aattgtaaaa 2880 

tattattagt tttattttgg ttagaaaaat 2940 

taatgggttt aaggtatatc ggagattttg 3000 

gtgggtttgg cgggggaggg gaggagtcgg 3060 

ttattagtaa gatttgattg tggttcggga 3120 

cgaaatgaat agggaattgg gttaagtagt 3180 

cgtttttagg ggataagtag gttaaaaaaa 3240 

agggtaggtt ttatggggga taaggagggg 3300 

tttttgaaaa gtagtgtttt taagtgttgg 3360 

gggtacgtat acgcggattt ttaagtttgt 3420 

tatgtaaaat ttgttatcgg tttattttcg 3480 

tgtattcgtt ttatattgtt cgggttcgtc 3540 

agagtttttt ttttttcgtt tcgttttagt 3600 

cgggcgtttg taggtgcgtg gtatcggcgg 3660 

attgtttcgt attagtagtt agggttaggg 3720 

ttttattaat tgtatttttt tcgaagttcg 3780 

gttgatgtaa agcgtttagg ggatttaagg 3840 

gggatgtaaa tttattcgcg tgcgtgtttt 3900 

cggggattta ttgagtgcgg tagtttggcg 3960 

ggggcgttta tggtgcggcg gttagggcgg 4020 

acggttggcg tttttggtcg ttgcggtgag 4080 

gtttttgcgt cgagtcggcg ttttattggt 4140 

ttcgcgcgaa ggagtttata tttttattgg 4200 

ttttttttcg cgtttattgt tttaattttt 4260 

ttgcgttagt tagtcgagtc ggttcggttg 4320 

aagaggttgt ttttgttaat ttgttttcgt 4380 

ttttttaatt ttagaggatt ttagtttttt 4440 

gatttgtagt ttttatttat ttattttttt 4500 

atttgtttta gaattattag ttattaattt 4560 

gttagatatt ttagtggaga gagaggattt 4620 

ggtcgggggt gggtggatac ggttttttgt 4680 

attaaagaac ggaggtggtt aaggacgtag 4740 

ttcgttaggt aagtttttta aagtcgttga 4800 

agttggaaat ttgcgatagt ttttaggttg 4860 

acgtgaggag atattaataa gaacgcgata 4920 

aaggtggttt ttagttaggt taggcgtagg 4980 

tgtttttttt aaaagaattt tagtggagat 5040 

tttttatttt gtaagaaaga agtttttttg 5100 

gaattttgta ttttttatta taatttaatt 5160 

tgtttgagta aaattgatgc gttaggaata 5220 

gtttgtaatt ggataggatt ttggtttttg 5280 

atggggttga gagtagaggt attgttttgg 5340 

aaaaggtaaa atgaataagg tttttgcgtt 5400 

atttaagata gaataaattt tagtttagaa 5460 

attattttac gtaaataatt tagttattat 5520 

attggagatt aagggagtta gtttttattt 5580 

atatatagtt ttatgtagtg ggttgggaga 5640 

tttaaaagat atggttgagt tgtttttatt 5700 

gagtaattta ttatttttgt attttattag 5760 

ggttattgtt ttatttgtta agatattttt 5820 

ttttaagatg gtattttagg gatataagtt 5880 

tttttgtttg tttttttttc gtttttgggt 5940 

tttttttatg tttttttttt atattgtgtt 6000 

. , 6001 
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<213> Artificial sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 256 

ttttattttt tttatatttg tgttatatat taaaaaatta tatattagag aatttaaaag 60 

atttgttaaa gttgtttaga aagtgatttg atgttaagat ggtgtttttt tatttgtatt 120 

atttttatag tttagaagta tattattaat ttatgttttt ttttataatt tggagttata 180 

tatggattat gttgttttag ttattttgtt taatttgatt tttaatattg ttattgattt 240 

tatttttaag tgttttattt taaattatag agtgttaaag atgttggatt atttttataa 300 

tgttattttt aagtttggtg aattaagaaa ttgttagatt attttgtgta atattataat 360 

ggtattgttg taatattata tgttaaattt agaatgtgta aagattagaa gttgatagta 420 

aattatttta ggatttataa tttgtaaaaa tgaagtttgg tgttatgaat tatgttgttt 480 

ttaaaagaaa tgtaagttta tttttgtaaa tttttttttt gagaaagtag attattttga 540 

tgtgttttta agtaagacgt atattatttt ttaagttagt tatttttgtt atttagtttt 600 

tattttgagt ttttatgtaa atttgggttt tatattaaaa agaaatgtat attgtaattg 660 

aaaatttgtt ttggaatata attaattgtt tttttaatat tttttttata tattattttt 720 

tttagtagga aatggtaagg aatatgattt atttaaaatt ttaatgattt ttattgtaaa 780 

aaaaataata aatttaagaa aattaaagtt gaaattaata tgaaaaatta cgaatttttt 840 

ataaatgatg attatttttg ttttttttaa ataatttaag aatatagtaa tgtattatat 900 

gtaatttggt tgattaatat tatgaataat tgaattggta ttaagttaag taattattta 960 

aaagatattt ttagaaattt taaaagattt tttttttaaa gtaaaatatt tgatttttga 1020 

ttgggaaaaa gattgaagag tttaagaaat aaaatattta tagagatgaa agggagtttt 1080 

tgtattttta agggatttat atagtgttag tttatagttt tttatacgtg tttatagttt 1140 

attttttaac gttaataagg agggataaaa aaggtttttt attttttagt gtttattttt 1200 

taattagtta ttaattaatg ttaatgttaa gataattaag gttagttgta aaaaagtttt 1260 

tagaaattta ttggaattaa ttaattagat ttgggtaagt gatttattat tgtaaattta 1320 

tgaataatag ttagagaaat attattaatt ttgatttttg tagtatttta ttaaagaaga 1380 

aaaatgaaaa taacgatatt taagaaattt tatgttatta tttaagtttg tatgagaata 1440 

tttatttaaa agataattag aagttatagt tatttagttg aataagtgaa ttattttatt 1500 

atgggttagt ttgaagatat agttatatat tatttaaaaa tggataataa aataatatta 1560 

taatgtttat ttgttttttg tttggttttt ggaagttatt taaaagaacg tttttttaat 1620 

gatatatagt aatttttgag gaaaattttt agattttgat aggtatttaa gtattttagt 1680 

aggagatatt gggtttttgg tttttgttag ataatatttt attttatgat tatagttaaa 1740 

ttttatagat ttttttttaa tttaattata agtgttgaaa agagaatgag taaaagtttg 1800 

ttttagtttt tattaaggta tatgtagaga ttttggaatt ttttttatat ttttaaatat 1860 

tttatttttt aattaaagtt ttttaaaatt attttattta tattattttt ttttgggttt 1920 

tttgtaggat gtgtttaggg cggttgattg tggattcgaa gagggatttt agtttcgttt 1980 

tatgagtttt tagtaattgt agttaatgaa gtttttattt gtttttatat tagtagagtg 2040 

tttttatttg tatttaagtt ttgggattaa taggaaattt tatttaatga ttgaagggta 2100 

tttggtgtaa ttattatgac gttaattttt attatgacgt taatttgatt ttattttgtt 2160 

ttaattttta ggttattttt gtttggagtt atttttaggt gatggtgacg ataaggttag 2220 

tttatgaaaa gagagtacgt tgttattttg tacgtaattt tattgagtta tatatataat 2280 

attttgaaag ttgatatttt tttataaata agagatattt aattagttaa gtatttgata 2340 

tttatttaaa taagagttat agattttcga aaatagatat agagtatttt taggtttata 2400 

ttattattta attaggaaat agaaaaataa aatatatatt tatttagatt tttttaaaat 2460 

taatttgtat ttgaatttaa taaagatgta tgagaaaggt atgaggtaat tttaagaagg 2520 

attgaatggt ttttaatata cgtgtttggt ttttggtttt tttttttttt ttattttatt 2580 

tttgtaaagt ttagtttttg tttgaataga aaatatgtat gttatggttt gttaggtgtg 2640 

ggatatttag ttttaaggtt tttaatttag tgttatttta aaatttgtat agttttttag 2700 

attattgtgg tttttttttt gtatatagag attatgattt tttaaaagtt tttattttta 2760 

tttatttcgt ttagtgttat tttcgtaagg aattggttta gatatagatt attaaggatt 2820 

tttttaataa agagtttaga agatttatgt gatgtaaaaa gtaatttttg tttaatataa 2880 

agaaatttgt taaagaacgt aaagttgttt tgtgatttat tttttgtgat aaataaattt 2940 

gtttaggttt ttatattttt ttagattttc gttgtttttt tttagtatgt attttaatta 3000 

tttatttttt tttagtgatg tttttttttg ggggacgagt gggggagacg gaatttcgtt 3060 

ttgtttttta agttggagtg tagtggtata attttattgt agtttatatt tttcgggttt 3120 

aagtattttt tttgttttag ttttcgagta gttgggatta taggtgtgcg ttataatgtt 3180 

tggttaattt ttgtattttt ggtagagata ggagttttat tatgttggtt aggttggttt 3240 

aatttttgat tttaagtgat ttatttattt cggtttttaa aagtgttggg attataggta 3300 

tgagttatcg tgtcgggttt ttttagtgat gattgatgtt attattattt aagaattttt 3360 

atatttaaat attaattaat tggtagtata tttttaatgt agtagattta ttttgaaaat 3420 

ttttttattt aggtatttta tttttttttg ttttattttt ttttttttat ttgtgttttt 3480 

tttatatttt tttatttttt ttttatttga atgattatat agaaataatg tttttaggat 3540 

gtttttatta aattattttt gtttggttta tgattttata gtggatttta tttttatttt 3600 

.atatttttat- ataagaatta-Xgaatattag ^gaatti^tta-tgaaaatgtt-^attg^gt-tt — 3660- 
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ttaaagttta 
agttagtgtt 
taatgttttt 
atatttaagt 
tagaggaatt 
ttttgtattt 



tatattgttt 
tttaatattt 
agttttttat 
ttttattttt 
tttgtttttg 
tttataataa 



tttaattttt 
gatttatttt 
ttgtgttttt 
gttgaaattt 
tgtttagaga 
atatatatta 



seq_prot.txt 
ttagtgatgt ggttatataa 
ttttattttt ttaattattt 
gtagatattt tatgtatata 
tttttaaaat aataaataga 
atttttgttg gtatggtata 
tttttataat t 



ataatataaa 
tatttagaag 
taaataaatt 
atataaaaat 
aattatattt 



<210> 257 
<211> 4001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 257 



aattataaaa 
ttaatagaga 
tattttaagg 
agagtatttg 
aggaaatggg 
tatattatta 
ataagaattt 
ttatgaaatt 
tatgtaatta 
aaagaataga 
tatattgaaa 
tggtattaat 
tttggaagtc 
ggtgaaattt 
gtagttttag 
tgtagtgaga 
tattcgtttt 
aaggaaataa 
aataaattat 
ttttttgtat 
ttaagttagt 
agggttataa 
ttagggtgat 
tatatatatt 
aaaaattaaa 
tatttttttt 
aggtgtgtgt 
tatatttatt 



gtaatatatg 
ttttttaagt 
agaattttaa 
tagaaatata 
gaagataagt 
aaaaaattaa 
tttaatattt 
ataagttaaa 
tttagataaa 
ataaagaaga 
atatattgtt 
tattattagg 
gaggtgggtg 
ttgtttttat 
ttattcggag 
ttgtgttatt 
ttaaaaagga 
cggaaatttg 
aaaataattt 
tatatagatt 
tttttacgga 
tttttgtgtg 
attgggttgg 
ttttgtttag 
aattaggtac 
atgtattttt 
tttatttttt 



tttgttgtag 
ataaaggtag 
tagggatgga 
agtaagaaat 
taggtgttaa 
aaaatagtat 
ataattttta 
taaaaataat 



aaagaaataa 
gtgaaatgtt 
agttggttga 
aaggttcggt 
gattatttga 
taaaaatata 
gttgaggtag 
gtattttagt 
gtattattaa 
aaaaggtgta 
tgcgtttttt 
ttttaaattt 



taaatgtttt 
aaaattacgt 
tatttgaaaa 
gacgttatgg 
ggaatttttt 
taggtgagga 
tttcggattt 
atgaatgaaa 
agattttaaa 
tttttagtat 
taagatgtta 
tgttaggatt 
gagtggtttt 
atttttaagt 
ttagttgaat 
atagtagtat 



tatttgttta 
ttttggttat 
taaaagattt 
agaattatgg 
gtagatattt 
ttttagaaaa 



ttcggaaatt 
ttgtttatag 
ataaagtgat 
tggttttaaa 
taaaaattga 
gttgatttta 
ttttattaat 
atagttaatc 
taattttggg 
gtttttgtat 
ttatgattga 
tttaatagaa 
taaaaatttt 
taggagttag 

gatgtatagt 
gattatagtt 

ggaatttttt 
aagttggtkg 

agtttaattg 
tttagtatta 
tttttgtttt 
gttgatatta 
tgttttttaa 
ggaatttttt 



agtggtattg 
taaaaaagaa 
gaattttgga 
gtagaaattg 
gtatattgag 
attaaattta 
tgttttttaa 
tatagttttt 
gaaagtgtta 
agcgtatttt 
taaaaatgat 
cgttatggta 
gggtttagat 
tgtagttatt 
gttttgaata 
ggattttaat 
atattttaat 
attagaagga 
gttagaaatt 
ttttaagaat 
atagagggta 
t;atattttta 
tttgattatt 
gaatgtcgtt 
tattttttta 
attagtttta 
gtattaattg 
tttttattag 
tatgggtttt 
gttttttaat 
aaagtttttg 



aaagtataga 
gggttttttt 
gatttaaata 
tggaaatatt 
aggtattgat 
gtaaatttta 
tatagaaatg 
ttaataagag 
aaggatgtaa 
tagatggaaa 
tatttaaatg 
acggtggttt 
ggttaggggt 
aaaattagtt 
gagaagtgtt 
ttgggagata 
gaaaaggtga 
agaatttaaa 
ggtaagtttt 
tttgttgaag 
agcggagtag 
gttatagtag 
gttaagtgtt 
agttttataa 
aattatttag 
gatgtaaatt 
ttaaatagtg 
gtttaggtaa 
gtttttagga 
ttttttatgg 
ttaagggttg 
attatattaa 
gtaggtggaa 
gagaatttat 
tattttgtaa 
tgaggagtaa 
aagaattgag 
agtttgtaaa 
taatgttttt 
tattgtatgt 
agtagatatt 
agttggttta 
ttttgaatag 
gtttttattt 
gttattattt 
atgagttttt 
gtggttgatt 
cgttgaagaa 
ttaggaatgt 
ttttttttta 
aaaatatttt 



gaagtataat 
aatttttgta 
taatttgttt 
gtttttgagt 
ttttgtatta 
aaaatataat 
tgaaataaaa 
tattttagaa 
agaaggtata 
gatttttaaa 
tagaaatttt 
atgtttgtga 
taaattagtt 
aggtattatg 
tgaattcggg 
gagcgagatt 
atggttggga 
taaatttgtt 
tttatgttaa 
aggtttttgg 
ataaagatag 
tttgaagagt 
ttatatttgg 
aagtgaaatg 
ttttttttag 
aattttagaa 
gtataagttt 
atattaggta 
tgttatgtgt 
gttgattttg 
aaataagata 
gtatttttta 
atattttgtt 
aagacgaagt 
aaagtttaga 
aatatttgag 
ataggttttt 
atttggttgt 
tgttgagatg 
tattggaaag 
atgatattgt 
tgataaagtg 
atgtttttat 
ttttttttta 
atgaatttgt 
ggaaattttt 
gggaaataga 
taggttatgg 
aaaagttttt 
gttkaaagtt 
ttaaataatt 
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ttatattata 
ttttgtttgt 
gtatatatat 
agaataattg 
tttatgtagt 
aaatattttt 
ataaaattta 
atattgtttt 
aataaaagag 
ataagtttgt 
tgaataatgg 
ttttagtatt 
tggttaatat 
gcgtatattt 
aggtgtagat 
tcgttttttt 
tgtatattgg 
tattatagaa 
ataagaattg 
tagtttgtat 
gaatttttga 
tgtgtaggtt 
taagttatga 
agaaaaaaaa 
aattgtttta 
aagtttaaat 
ggaaatgttt 
tttagttaat 
atggtttaat 
tcgttattat 
agattaaatt 
attattggat 
ggtataaaag 
taaaattttt 
gaaaggtaat 
aatatgagga 
atttattttt 
gattataggg 
tttgagtgtt 
acgttttttt 
tttattattt 
gtttatttgt 
gtagatttga 
ataaaatgtt 
aatgataagt 
ttgtagttaa 
tattagaaaa 
gtacgtgtga 
ttttattttt 
agatattttg 
atttgattta 



3720 
3780 
3840 
3900 
3960 
4001 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 



seq_prot.txt 

atattaattt aattatttat aatattggtt aattaaattg tatgtaatat attgttatat 3120 

ttttgaatta tttaaaaaag ataagagtgg ttattattta taaaaggttc gtggtttttt 3180 

atgttaattt taattttggt ttttttaagt ttattatttt ttttgtagta agagttattg 3240 

aaattttaag tgagttatat tttttattat tttttgttga aaagggtagt gtgtgagaaa 3300 

aatgttagaa aagtaattaa ttatgtttta gagtagattt ttagttgtag tgtatatttt 3360 

tttttaatgt gagatttaaa tttatatgaa aatttaaagt aagagttggg taatagaaat 3420 

ggttaattta gaaggtaatg tacgttttgt ttaaaagtat attaaagtaa tttatttttt 3480 

taaagagaaa atttatagga gtgaatttat attttttttg agaatagtat ggtttatagt 3540 

attaaatttt atttttatag gttgtgaatt ttagagtagt ttgttattaa tttttgattt 3600 

ttgtatattt tggatttggt atataatgtt atagtagtgt tattgtaatg ttgtataaag 3660 

tagtttagta attttttggt ttattaggtt tagagataat attgtagaaa tgatttagta 3720 

tttttaatat tttgtggttt aaggtggggt atttaggggt agaattaata ataatgttag 3780 

aaattaaatt agataagata attgaaatag tatgatttat gtgtgatttt aagttataaa 3840 

ggaggatatg gattaatggt atatttttag gttatagggg tagtataagt ggaaggatat 3900 

tattttagta ttagattatt ttttgagtaa ttttggtaaa ttttttaaat tttttaatgt 3960 

gtagtttttt aatatatgat ataggtgtaa agaaaataaa g 4001 

<210> 258 
<211> 4418 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 258 

agattttacg gaaggggata gggagtcggg ttttttatag gtatttgttg agaaaggtag 60 

gaaggttttc ggttttataa agtggttttg ggtatttagg aagtgttcgg ggtggaagcg 120 

gaagggtttt ttttagacgg ttttattttt tagtatcgat gataggttgg tgatgagtgt 180 

cgttttttgg gtaggagatg tagggtgaga gtggggattg gattttagga tgttgggatt 240 

tttgttatta aatatacggg ggatatatat tgtttggtat atagttggat tttgttaatt 300 

agttttgcgt tcgagaagtt ttatagtatt tttttcgatt ttatagtagg gcgtagttat 360 

attttttaga ggtatttata ttgttttttt tttttgtagg cgttgggttt tttaatattt 420 

tggtaggttt tgatttgttt tttttattag attggggttt tggatggata ggttagtttt 480 

gtttatattt tggatttttt atttaagcgg ggatagttag tgtggtggta ttgaggatta 540 

ggtggttagg gtttttagag tgggtttatt tggtagtagt tatgttgggg ttattattag 600 

gggttggtgt tgagttgggg tgaggagggc gttaggttta ttttagggat gcggaagttt 660 

tgtatttcga tgttacggga tgttatatgg gttatattta gggggatgat gtttttaaag 720 

cgttgtattt cgtgaattac ggtagtggtg tagggtatgt gagtttggtt atttattttt 780 

ggtcgtcgta tttgttttat tacgtcgtcg attttttgtt ggatacggat tggatagata 840 

tgcgttttta taatgggtta gtatttaggg gatatttttt ttttttttgt gttggaggaa 900 

gttaggttta taggagtttg gttacgtttg tgttggaagt ttcgggtgtt ttagttaagt 960 

ttaggggttt ttagttgtat tttttttttt ttagtttttg ttttgggttt tagttgggtt 1020 

tacgttgtat atttaggtgt aggattatga gtaggaggtt ttaggttagc gtggtcgagg 1080 

tggttattat ttcggtaagg aataggttat ttattattat gcgtaggttt ttattattga 1140 

agttgttttt agggtttttt ttggtttgag tagggtcgag aggatattta ggggatagaa 1200 

cggggtagtt tttaaatgat ttttaatttt gtatttgtta gtttagatgc ggttcgtcgg 1260 

gtgatgtatt ggtttaattt tttgtttagt ttttttttat tttttttggg acgtttaatt 1320 

tattattttt gttttttatc gtggtagtta tttttatttt tttttttttt gttaggaagg 1380 

ttttagttag gtttcggggt ggttgggttg ggttttaggt tattttgtgt ttagttagta 1440 

gtttatttag ttgggttagg aaagtttttt ggaagcgtag gattttgtta gttagcgttg 1500 

ggatgtgcgg gaggacgggg atagtattta gtatttatat tagatagaac ggggttttaa 1560 

ttttttttgt gttttgcgtt tatttggatt agttttaggt tttagttatt tttaggaaga 1620 

tttagggttt gtttgttttt attattgatt ttattaagtt tttttttaag tgttagtttt 1680 

tatttttttt ttttttgttt agaggagaaa tttaaaatcg aaatttttaa cgtggacggg 1740 

ggtatagagt ttttggtttt ttttggtgtt ttttgattcg ggtatatttt ttttacgatt 1800 

atgtttgaga tgtttttttt ttttttaggt tttttttata gtggggtttt ttggaatgtt 1860 

tttttttaaa tttatttatg taaattttgt ttttcggagg ttttagttta gtttcggtat 1920 

tttttaggag ttcgttttgt agagattttt cggtttttcg tttcgtattt cgcgtaggaa 1980 

gttcgatttt ttttttagtt ttttttgagt taggtttagt agtttgagga agcgagggtc 2040 

gtcgtattcg aagcggcgtt cgtaggtgag ggaggcgatt acgttgttta cggttttgtt 2100 

taagaggtcg ttggggcgaa aggggcgttt tgggggtggg agatgcgggt aaggggttgt 2160 

ttttttcgtt tttcgttttt ttagttttcg ttttgtgttt ttttgtttat tatttatcgg 2220 

tttggtcggc gaaggcggta taaaggtagg cggttttttc ggttatttat tgttttagcg 2280 

attttttgtt taggtttaag ttgcgtaagg tggatacgga gaagcgtttt tgttcgcgtt 2340 

acgcgggttt atagcgcgat aggattattt ttgggggcgg gacgggtacg tgggcgttgt 2400 

tatgaaggtt. ttggttttat_.^ .24.60 
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seq_prot.txt 

ttcgtagttt ttagttcggt ttagttgggt atagggttta ttttttgttt attztatattg 2520 

tttttttgtt tggggcgggg tttggtttta tttcgttttt gtttattttg attatttttt 2580 

tatttaagga agatttcgtt cgtttcgttt atattgagtt cgtagtatag gcgcggtttt 2640 

cgttatcgtt atttcgacgt attagtttcg tttatcgggt ttttggcggg tttgggtagt 2700 

agtttcgttt ttttttagtt tatagattcg tatttttttc gtgtaggtgg ttttttggtt 2760 

tattgttttt agtttattcg ttggttttta tttttgtttt acgtttagga ttttacgttt 2820 

tgtcggcgtt gtttgggtta cggttattgt ttattcgggg tttacggaaa cgcggttttt 2880 

gttttttatc gtcgtttgtt ttgggaacgc ggttcgaagt ttaggatttg gtagatgggc 2940 

gtaggcgggc ggtcggtcgt gttttcgtcg cgggttatta tcgtttcgcg tacggtcgtt 3000 

agtttattga gtacgattat cggcgtttag gttagttgta ggttgaatac gttttcgaag 3060 

cggcgtcgta attgtagagg gagggttagg gttttttgtt aagttaggat tattttagat 3120 

tataggtttt agttttattt gaattttgga cgattttcgg ggttattagg agtgagtagg 3180 

tggaaggagg agatttagtt ttttgatttt ggggcggggg tgggggttat attttttgtg 3240 

atggaggaat ttagtttgga tgcgttattt aggtatgatt ttgtaagagt tattaaaatt 3300 

gtcgagaggt tttagttagt attttatttt tagatgatgg tttatgtcgg tgagtagtga 3360 

ggttcgagga tttatagtgt aaaaggtttg aatcgggtta ttgtattttt tttattttcg 3420 

atttcgtgat ttaaacggta tttaggatta atttattttt tatttttaag gttttttttt 3480 

ttggtgttag tagaagggat tttgtatttt ataatatatg ttgtttaatg ggtttgtatg 3540 

tttattgtta agtttagttt tatttttagg tttttgtttt attttttttt ggtttttgga 3600 

aaatttagtt ttttatgtta tgtataaatg ttttttttta ggacgttttt taaatttgtt 3660 

tttttttttt agtttggttt ttgatttagt ttgtggttta atttattatt tatgtttgtt 3720 

ggtggtgggg tatttttagg atttttgtcg ttttttagga tttttttttt tatttggtcg 3780 

aagtagtatg gtgtgttttg gaagtttata tgtagtaagg ttgtttagtt cgggtagtgg 3840 

taggggattt ggcgggtagc gtgtagttta gcgttggtgt cggtgtatta ggtttattag 3900 

gagtaggaag atggttatta ttatggttag gggtattagt gtttttagtt ttatggttgt 3960 

tttattatta attgggtttt tttggatata tttggtattt ttattttatt aggtatagag 4020 

gattaggtag gatattttcg gtatatcgag cgcgtgattt tttttttata aagggagttg 4080 

atgatggttt tcgttttttg ttgtgagtga atttgttgtg ttgattgtgt tgttagtggt 4140 

agagttaggt tagggtaggt atgggttgtt ttagaggttt ttgtcgttgt tttttgtttt 4200 

aggtttttat ttagggtagg gtggtagaaa ggtttggtcg gagaagttat tttttttttt 4260 

tattttaagt tttttaagtt tatataggtt tttgggataa ttagggtttt agtggattcg 4320 

gttatttatt ttttagttag gtttatatat tttaatgtag ttataatttt ttttttagaa 4380 

tatgattttg tttttttttt atttttattt gtttattt 4418 

<210> 259 
<211> 4418 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 259 

gagtgggtag gtgggggtag ggaaagggta aggttatgtt ttggaggagg ggttgtgatt 60 

atattagggt gtatgagttt agttgggagg tggatggtcg ggtttattga gattttggtt 120 

attttagaag tttgtgtggg tttggggagt ttggagtggg gagagggggt gattttttcg 180 

attaggtttt tttattattt tattttgggt aagggtttgg agtaggaagt agcggtaagg 240 

atttttggag tagtttatat ttgttttggt ttgattttgt tattggtagt atagttaata 300 

tagtaggttt atttatagta gagggcgaag gttattatta gtttttttta taagggaagg 360 

gttacgcgtt cggtgtgtcg agagtgtttt gtttggtttt ttgtgtttgg tggggtgggg 420 

gtgttaggtg tgtttagagg agtttagttg gtagtgaggt agttatgggg ttagaagtat 480 

tggtgttttt ggttatgata gtggttattt ttttgttttt ggtggatttg atgtatcggt 540 

attaacgttg ggttgtacgt tattcgttag gttttttgtt attgttcggg ttgggtaatt 600 

ttgttgtatg tggattttta gaatatatta tattgtttcg attaggtgag ggaggaggtt 660 

ttggagggcg gtagaggttt tgaggatgtt ttattattag taaatatggg tggtgggtta 720 

aattataggt tggattagaa gttaggttga gaaggggaag taggtttggg ggacgttttg 780 

gggaaggata tttatatatg gtatgaagga ttggattttt taaaggttaa ggaagagtag 840 

ggtaagggtt tggaggtgga gttggatttg gtagtgggta tgtaagttta ttgggtaata 90p 

tatgttatjgg agtataaagt tttttttgtt gatattagaa ggaaaggttt tgg^aatgga 960 

agatgagtta gttttgagtg tcgtttaaat tacgaaatcg aggatgaagg gggtgtagtg 1020 

attcggttta aattttttgt attgtgggtt ttcgggtttt attgtttatc ggtatggatt 1080 

attatttggg aatgggatgt taattggggt ttttcggtaa ttttggtgat ttttgtaagg 1140 

trtatatttgg gtgacgtatt taaattgagt ttttttatta tagaaggtgt gatttttatt 1200 

ttcgttttag gattaggagg ttgggttttt tttttttatt tgtttatttt tggtagtttc 1260 

gggggtcgtt taaggtttaa ataggattag gatttgtaigt ttjggggtigat tttggtttQk ^L320 

taagaggttt tgattttttt tttgtagttg cggcgtcgtt ttggggacgt gtttagtttg 1380 

tagttggt^t ggactgtcggt ggtcgtgttt aatgggttgg fcggtcgtgcg cg^iggtgatg l44p 
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gtgattcgcg gcgaggatac ggtcgatcgt tcgtttgcgt ttatttatta ggttttgggt 1500 

ttcgggtcgc gtttttaagg taagcggcgg tgggggatag agatcgcgtt ttcgtgggtt 1560 

tcgggtggat agtgatcgta gtttaagtag cgtcgatagg gcgtggggtt ttggacgtga 1620 

aatagagata aaggttagcg agtgggttga ggatagtggg ttaggaaatt atttgtacgg 1680 

gggaggtgcg agtttgtggg ttgggagggg gcggggttat tgtttagatt cgttagaagt 1740 

tcggtgggcg aggttgatgc gtcgaagtgg cggtggcggg gatcgcgttt atgttgcggg 1800 

tttagtgtgg gcgggacggg cgggattttt tttgagtgga aaggtggtta gggtgggtag 1860 

agacgaggtg gggttaaatt tcgttttagg taggggagta atgtgggtga gtaaagagtg 1920 

ggttttgtgt ttagttggat cgggttaggg attgcgggag attttgtgga gcgttagggt 1980 

tggagtgggt ggcggagggt ggggttaagg tttttatggt aacgtttacg tgttcgtttc 2040 

gtttttaggg gtgattttgt cgcgttatgg gttcgcgtgg cgcgagtaga ggcgtttttt 2100 

cgtgtttatt ttgcgtaatt tgggtttggg taagaagtcg ttggagtagt gggtgatcga 2160 

ggaggtcgtt tgtttttgtg tcgttttcgt cgattaagtc ggtgggtgat gggtagaagg 2220 

gtataaagcg ggaattggga aggcggggga cggagaaggt aattttttat tcgtattttt 2280 

tatttttagg acgttttttt cgttttaacg gttttttgga taaagtcgtg agtaacgtga 2340 

tcgttttttt tatttgcggg cgtcgtttcg agtacgacga ttttcgtttt tttaggttgt 2400 

tggatttagt ttaggaggga ttgaaggagg agtcgggttt tttgcgcgag gtgcggagcg 2460 

agagatcgag gagtttttgt agggcgagtt tttgagaggt gtcggggttg gattggggtt 2520 

ttcgaagggt aggatttgta tagatgggtt tgggaaagga tattttagga gattttattg 2580 

taagaagggt ttggaggagg aggggatatt ttagatatgg tcgtgggaga ggtgtgttcg 2640 

ggttaggggg tattaggaga ggttaaggat tttgtatttt cgtttacgtt ggagatttcg 2700 

attttaggtt ttttttttgg gtaaggagag agagggtgga ggttggtatt tggggaggga 2760 

tttggtgagg ttagtggtaa ggataggtag gttttgggtt tttttggaga tggttggggt 2820 

ttgagattgg tttaggtgaa cgtagagtat aggagggatt gagatttcgt tttgtttggt 2880 

gtaggtgttg aatgttgttt tcgttttttc gtatatttta gcgttggttg gtaaggtttt 2940 

acgtttttaa aaggtttttt tgatttagtt ggatgagttg ttaattgagt ataggatgat 3000 

ttgggattta gtttagttat ttcgagattt gattgaggtt tttttggtaa agaaggagaa 3060 

ggtgagagtg gttgttacgg tggggggtaa gggtggtggg ttgaacgttt taggaggaat 3120 

gaggggaggt tgggtaaaag gttggattag tgtattattc ggcgagtcgt atttgggttg 3180 

ataggtgtag aattggaggt tatttggggg ttatttcgtt ttattttttg agtatttttt 3240 

cggttttgtt taggttaagg ggagttttga gagtagtttt aatgatgaga atttgcgtat 3300 

agtggtgggt aatttgtttt ttgtcgggat ggtgattatt tcgattacgt tggtttgggg 3360 

ttttttgttt atgattttat atttggatgt gtagcgtgag tttagttggg gtttaaggta 3420 

gggattgagg gaggaagggt atagttgggg gtttttgggt ttagttggga tattcggggt 3480 

ttttagtata ggcgtggtta ggtttttgta agtttaattt tttttaatat aggaggaagg 3540 

agagtgtttt ttgggtgttg atttattgtg gggacgtatg tttgtttagt tcgtgtttaa 3600 

taggagatcg acgacgtgat agggtaggtg cggcgattag agatgggtga ttaggtttat 3660 

atgttttgta ttattgtcgt gatttacgag gtgtagcgtt ttggggatat tatttttttg 3720 

agtgtgattt atatgatatt tcgtgatatc gaagtatagg gttttcgtat ttttaaggta 3780 

ggtttggcgt tttttttatt ttagtttagt attagttttt ggtgatagtt ttagtatggt 3840 

tattgttagg tgggtttatt ttaggaattt tggttattta gtttttaatg ttattatatt 3900 

gattgttttc gtttggatgg ggggtttaga gtataggtag ggttggtttg tttatttaga 3960 

gttttagttt agtggggaag ataaattagg atttgttaga atgttggagg atttagcgtt 4020 

tgtagggaga gggggtagtg tgggtgtttt tgagaggtgt gattgcgttt tgttgtgggg 4080 

tcggagaggg tattgtggag tttttcgggc gtaggattag ttgatagagt ttagttgtgt 4140 

gttaggtagt gtgtgttttt cgtgtgtttg gtggtagggg ttttagtatt ttagagttta 4200 

gtttttattt ttattttgta ttttttgttt agggaacgat atttattatt aatttgttat 4260 

cggtgttgaa ggatgaggtc gtttggaaga agttttttcg tttttatttc gaatattttt 4320 

tggatgttta gggttatttt gtgaagtcgg aggttttttt gtttttttta gtaggtgttt 4380 

gtggggagtt cggttttttg ttttttttcg tggagttt 4418 

<210> 260 
<211> 4398 
<212> DNA 

<213> Artificial Sequence 

.*»-■' 

<220> 

<223> chemically treated genomic DNA Cuomo sapiens) 
<400> 260 

ttgttttaaa aaatatttta aaaaaataaa taaaagaata tttattatta ttaatagaat 60 

aaattgtgat atatttataa aatgaaattt tatagaagaa taagaggata aaaggaatta 120 

attattgata cgtataatac ggattaattt taaaaatatt ttttgagtag ttagatacgg 180 

cggtttatat ttgtaatttt aatattttta gaggtaaaag taggacgatt atttgagttt 240 

aggagtgagt ttatagtgaa ttgtgattgt gttattgtat tttagtttgg gttagagttg 300 

gttttattta tatgaagttt aaaaaaaggt aaaattattt atttatgatg ataaaagtta 360 

gaataagaag -gtggtaggga ttgat^ 420 — 
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attttgtata 
ttgtaatatt 
ggttattttt 
aagttattat 
tttttaaata 
taattttttt 
ttatatttaa 
ttttaaattt 
aaaaaatttg 
ttttattatg 
gttttttaaa 
taaatatttg 
taatggagta 
ttatcgatat 
ttttggtgaa 
gatatatttt 
taaataattt 
gtttggggtt 
attggtagaa 
taaatatttt 
gttaagtaat 
gtttttttaa 
tatggggttt 
agggagaagg 
tagataagtg 
ttaggttagt 
gttggtgtgg 
gaagtttttt 
ttatgttcga 
gtttttgtta 
tttttcgtat 
gtttcgagat 
tttaaaattt 
gaaaggtaaa 
tatgaagttt 
cgtaagcgtt 
agtatgtttt 
tgggcgtggt 
agagagtttg 
tcgaatttta 
agaagttatt 
tttttggtaa 
tttagtattt 
ttgtgtagag 
tatacggttt 
attttttttt 
atttgtaaaa 
gtattcgttt 
tgtaaaagga 
tattatttaa 
aaattgatgg 
ataattttat 
tggagaagtt 
aagatttttt 
gaggaggatt 
tttacgttta 
tttatgtaag 
gaatatgtaa 
attttttttt: 
aggttggagt 
gattttttta 
ttaatttttt 
atttttgggt 
tgagttatcg 
ttacgataaa 
gttttaggcg 
aag^gaaatt 



ttgagggagg 
taatatttat 
ttatataatt 
taattcggtt 
tatttttttt 
tttttataaa 
aaatttagtt 
ttttttttcg 
ttttttttat 
ttgttttagg 
gtgttgtgat 
ttttttgggt 
ttttttatat 
tttttatatt 
tagttaatat 
atttaatgtg 
ggttaaggtt 
ttaaaatttt 
tttgcgtgaa 
ataatcggga 
ttgttaaatt 
acgtttttat 
tggtagatga 
tcggattttt 
atttcggggg 
tttcgttagg 
agaggtttcg 
ttggttttaa 
gttggttagg 
atttttgcgt 
tcgttgttaa 
ttagcgtaag 
gtatatttaa 
attagtttta 
tttttttttt 
ttacggatta 
aggaggggcg 
ttgggtaggg 
tgttgcggag 
attgatcgta 
atttgttgtt 
tttgggagtt 
agggttgttg 
gaaatatata 
tttagaattg 
tggtttttgt 
tttggatttt 
tagtttaatt 
gtatattagt 
ttataggttt 
atgatattaa 
aaggtaaatg 
aagtaatttg 
attgttaagt 
aaggttgagt 
agagttttaa 
ttagtttaaa 
tttgggttta 
tttttttttt 
atagtggtgt 
ttttagtttt 
gtatttttag 
ttaagtgatt 
tgttaggttt 
aagttcgatt 
ttagtaaata 
aaatgatt 



tgtggttata 
atattggaga 
tgtaaaagaa 
ttagtgagat 
ttttattttt 
ttattattta 
gttattaagt 
tttttatcgt 
ttgagattag 
ttgattttaa 
ttataggcgt 
aagttgtagt 
tttatttggg 
tttttagagg 
ttatttagaa 
tattaacgtc 
atataattaa 
ttatgtttat 
atggaatatt 
aagtgtaatt 
ttttaattta 
agtttcgcgt 
taaacgataa 
gattcgggtt 
ttattttgat 
tgagtaggta 
ggaggtgtaa 
cgtcgcgatt 
cgttattcgt 
gaagttatta 
cgcgatcgtt 
gattcggtgt 
gtcgtattgg 
agagagtacg 
ttatttagat 
agttcggcgg 
ggttttaggc 
gcggggttat 
ttatcgtttg 
gttttattta 
gtttagtttg 
gaagtaaata 
ggagatttgt 
tgtagagtga 
gtttttattt 
tttttgcgat 
gagtttgaaa 
taaatatgtt 
tttttatttt 
atataagtat 
tttatatata 
ttattattat 
tttaggttag 
attatgttat 
attttagggt 
gtatttagta 
gttgttattt 
tgtttatatt 
ttttttaaat 
gatttcggtt 
tcgggtagtt 
tagagatggg 
tgttaaattt 
attttatgtt 
attttaaaag 
ttgagtgaaa 



seq_prot . 
tagtatatgg 
atatatttta 
gtgtaatatt 
tatattggtt 
tttttatgtt 
tgggttgaag 
atgatatatt 
aattttttta 
aattatttgt 
atttttgggt 
gagttacggt 
gttttataga 
ttttttatta 
attaggaggg 
ttaattattt 
gttttggatg 
agcggaaaac 
ttttaatata 
ttttagataa 
tcgttgtatg 
gattatagtt 
tttttaattt 

ggaggtcgaa 

atattgggga 
ttttgtagtg 
tataaaaaga 
ggagaataga 
gtttgtcgtc 
atttgcgggt 
aatttgtagt 
ttagagaagg 
tgggattagg 
ggaattaaaa 
taaagtcgtt 
ttgggtagtg 
attaagttcg 

ggggtttggg 

atcgaagtcg 
gtcgtcgggt 
ttttttttcg 
tttaggagta 
agaaaatttg 
tttaataata 
tttaaagttt 
ttatttgagt 
aaatgaggta 
gatttggaat 
tgttaagttt 
ttagggatgg 
attttagatg 
gtattgtttt 
ttttatttta 
ggataagtgg 
aaataaatag 
gagggtgtat 
tcgttaatat 
ttataaagag 
tttttaagga 
tttttttgaa 
tattgtagtt 
ggaattatag 
gttttattat 
aagattttta 
ttgatattga 
aaaatgattt 
gaaaaagttt 



txt 

gtttgtttaa 
tagaagtaga 
gttgattatt 
ttttttttta 
tttatatttt 
atatttaagt 
ataaatttaa 
ttaattttcg 
ttttgttttt 
ttaagcgatt 
attagatttg 
aaaaggaaat 
ttgtagttta 
aatatattat 
gtttaaattg 
aaattgagtt 
gttatgattt 
tgttggagag 
tttaaataac 
taattttttt 
ttttaatttt 
cgagtttaag 
agttagatgt 
tttttataaa 
agttagatta 
agttttagtg 
gacgaattcg 
gaggttaggg 
tataggtttt 
atatgacgtt 
atttcgtttt 
gttgttcgaa 
gttagggttt 

gtggcggagt 
ttacgataat 
gtagtgtcgg 
ggggagaggg 
gttgggaatt 
tgaaggcgtt 
atttagagta 
tagatttgtc 
gaagtatttg 
taaaagtagt 
aggtattagg 
tttaaggtta 

gggggagggt 

tattattttt 
gtgttgtgtg 
atttaggttt 
tatttatgaa 
aaatttattt 



atttattaaa 

tgatagaaat 

ttttttttat 

ttcggattat 

aattataatt 

ttattaaatt 

tatttataaa 

aaaaaaaaaa 

agagataggg 

ttcggtttcg 

aaattttttt 

gaagtaatag 

attgcgtatt 

tttagatagt 

ttagttattt 

tataataaat 

gaaaaggtac 

ttaaatttta 

gagtatttgt 

ttttggacgt 

ttgtattatg 

tgaggttgtt 

tttcgaattg 

ttattttttt 



gaatatagga 
tagagttagg 
atatgaagtt 
attagtattt 
gttgtaaagg 
atattgtgat 
taagtggttt 
atagggtttt 
ttgatttttt 
atgcgcgtta 
gttttttagg 
aaatgttggg 
atagatttag 
attgatatta 
tttatagagt 



aatttttacg 
attatcgtac 
atcgcgggta 
tcgttatttt 
cgaagtttat 
tagagttcgg 
ttcggttgtg 
gatcgtttaa 
ttaggattac 
ttgtaggaaa 
ttaagaaatg 
agttaggtgg 
cggtgattgg 
ttattattcg 
ttttgtttta 
ttttttattt 
ggttaagttt 
ttgaaggttg 
tttttgggtt 
atgtaaatgt 
ttgattattt 
gattagaaat 
agttttggta 
taaggtattg 
tttttagttt 
attatgttgt 
aatttttata 



aaatgacgtt 
tatttggttt 
attttgaaaa 
tggttgtagg 
ttttagcgaa 
ataatggaaa 
gggattagtg 
attttgtttt 
aggttgaggt 
ttacgtttgg 
ttagttttaa 
attataggta 
gaagattttt 
gtttttttga 
ttagattgaa 



480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4398 
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<210> 261 
<211> 4398 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 261 

aattatttaa tttttttttt taatttaaat tttatgaaag gttttttttt ttatttaata 60 
tttgttagcg tttgggattt aagaaggtta atgttaatgg attatttttt tttgaggtag 120 
tcgaattttt tgtcgtagaa aaattttttt gaatttattt agtattaaaa tatggggtgg 180 
gtttggtacg gtggtttatg tttgtaattt tagtattttg ggagttttga gtttggtaga 240 
ttatttgagt ttaggagttt gagattagtt tgggaaatat ggtgaaattt tatttttatt 300 
aaaaatataa aaaattagtt aggcgtggtg gcgcgtattt gtggttttag ttattcggga 360 
gattgaggta ggagaattat tttagtttgg gaagttaagg ttgtaatgag tcgagattat 420 
attattgtat tttagtttgg gatagagtga gattttgttt taaaaagaat ttaaaaaaaa 480 
aaaaaaaaaa aaggaaatta ttagttttaa attatttatt tttggagaaa tatggatata 540 
aatttaaatt gtatgttttt tttattatat tatagtgttt ttttgtaaaa gtggtaattt 600 
tgggttggtt tatatgaatt cgttgaaatt tttataatat attggcggtg ttgaatattt 660 
ggggtttttg aacgtaagtt tgtaattagg atgttggtgt atatttttat tttggaatgt 720 
ttagttttga tttttttttt tttagaatgg ttttatattt atttatttgt aatatagtgt 780 
ttaatagtag gaagttttga gttaaatgtt tagttttgtt atttattttt gatttgagta 840 
agttatttag tttttttagg cgttattttt ttgtatttta aaatggagat gataatagta 900 
tttattttat aggattgttg tgaagattaa atgagtttag ggtaatgtta tatataggtt 960 
agtattattt attaatttgt aatatgattt tataagtgta tttagaatat gtttgtgtaa 1020 
atttatggtt agatgatgaa gttaagaaaa gtttgagttt atttttggag agtaagaaat 1080 
tagtatattt tttttatata gtattttgta tatagtatag gtttaataga tatatttaaa 1140 
ttgaattgaa gcgaatatta ttagaattga aagtggtagt tttagatttt ttaggtttaa 1200 
ggtttagatt ttataagtgt ttttgattat ttttttttta ttttatttat cgtaaagagt 1260 
agaggttagg aaaaggatga ataattaatg attttaaaat ttaggtgaga ataaaaatta 1320 
gttttgagaa atcgtgtgat atttatattt tggtatttga gttttagatt attttgtatg 1380 
tgtatttttt ttgtataaag tttaggaggt tatttttata ttattggggt aaatttttta 1440 
atagttttgg gtattaaata gtttttagta ggtgttttta aattttttta tttgttttaa 1500 
tttttaggtt gttaggggaa gtttggtcga taagtttata tttttgggta gattgagtaa 1560 
tagtaggtgg tggtttttgg ataagaggtg ttttaggtcg gaaggaaata aatgagatta 1620 
cgattagtta aagttcggta aagtaagaga cgtttttaat tcgacgatta gacggtggtt 1680 
tcgtagtata gattttttcg gatggtggaa tttttaatcg gtttcgatgt agtttcgttt 1740 
ttatttagat tacgtttatt aattatcgtt tttttttttt taggtttcgt ttggagttcg 1800 
ttttttttaa agtatgtttt atttgatttc gatattgtcg ggtttagttc gtcgggttta 1860 
gttcgtggag cgtttgcgta ttttttgagt tgtcgtggta ttgtttaaat ttgaatgggg 1920 
gaaagaggga attttatatt ttttatagat ttcgttatag cgattttgcg tgttttttta 1980 
gagttggttt tgtttttcgt ggttttgaga attttggttt ttggtttttt aatgcggttt 2040 
aagtgtgtag gttttaagtt aggcgatttt cgggtagttt tagttttagt atcgggtttt 2100 
tgcgttgagt ttcgggatta tagtcgggga ggcggggttt ttttttgggg cggtcgcgtt 2160 
ggtagcggat gcgggaagtc ggattttggg cgttatgtat tataagttta gtggttttac 2220 
gtagaagttg gtaggagtat gggtttcgga ggtttatagt tcgtaggtac gagtggcgtt 2280 
tggttagttc ggatatgaga agtggcggtt ttagtttcgg cgataggtag tcgcggcgtt 2340 
ggagttaggg gaggtttttg ttcgcggtcg agttcgtttt tgtttttttt gtatttttcg 2400 
gggttttttt atattagcgt acggtggtta ttgaggtttt tttttgtgta tttgtttatt 2460 
tggcgggggt tggtttggcg tggaagtttg gtttggttta ttgtaggggt taaggtgatt 2520 
ttcggggtta tttgtttaaa agggatggtt tgtgggggtt tttagtgtga ttcgagttag 2580 
aggttcggtt ttttttttta attcgaagat atttggtttt cggttttttt gtcgtttgtt 2640 
atttgttaga gttttatagg tagttttatt tgagttcgag attagagagc gcggggttat 2700 
aaaggcgttt agagaaatta tggtgtaggg ggttgggggg ttatgatttg ggttgggaaa 2760 
tttaataggt tatttaatac gtttagggag ggaagttgta tgtagcgagg ttatattttt 2820 
tcgattgtaa aatatttgat agatgttcgt tgtttgggtt gtttgaaaag tattttattt 2880 
tacgtagatt ttattaattg gagtttgatt ttttagtatg tgttgggggt aaatatgagg 2940 
gattttagag ttttagacgt gtttttttaa attatggcgt ttttcgtttt aattgtgtga 3000 
ttttggttaa gttatttaat ttattgtgag tttagtttta tttagggcgg cgttagtata 3060 
tattaaatga agtgtgttaa gtgattagta gtttgggtag gtggttggtt ttggataagt 3120 
gttaattgtt tattagaagt tgtttgagat agtatatttt tttttaattt tttagagaag 3180 
tgtgggaagt gtcggtgaga tacgtagtta gattgtagta gtgaggggtt tagatgaggt 3240 
gtaaaggata ttttattatt gttgttttat tttttttttt tgtgggatat tgtagtttgt 3300 
ttaagaggta gatgtttaag aaagatttta ggtttggtat cgtggtttac gtttgtaaat 3360 
tatagtattt tggaaggtcg aggtcgaggg tcgtttgagt ttaggagttt gagattagtt 3420 
tggggtaata tagtgagatt ttgtttttaa aaataaaaat agatgatttt aiattttagat 3480 
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aagaagggta agtttttttt ttttttttcg ggggttagtg ggggagttgc ggtgggaacg 3540 

ggaaggaaga gtttagggtt tataaatgtt gagtttgtga tatgttatgt ttggtagtag 3600 

ttagattttt gaatatggag tttaataaat ttgggtgttt ttagtttata ggtggtaatt 3660 

tgtagggaag aaaggttgga ttatagttag agtatagaaa tatgagggaa gggtggagaa 3720 

ggagaatgtg tttaaagaat agttcggatg agaagggaaa ttagtgtggt tttattgaaa 3780 

tcgaattggt aatgatttgt aaggaaggaa tagttagtag tgttgtattt tttttataga 3840 

ttatgtgaga aaatggttat ttttattatt tgtttttatg aaatgtattt tttagtgtat 3900 

ggatattaaa tgttatagtt tgatgagttt ggataaattt atgtattgta taattatatt 3960 . 

ttttttaata tgtaagatgt tatattattt taggaagttt ttttgtattt tttgttaatt 4020 

tttgttattt ttttattttg atttttatta ttataaatag atagttttgt ttttttttga 4080 

attttatgta aatggaatta gttttgattt aggttggagt gtaatggtat agttatagtt 4140 

tattgtagat ttatttttga gtttaagtaa tcgttttgtt tttgtttttg aaaatgttgg 4200 

aattatagat gtgagtcgtc gtgtttggtt gtttaaaaaa tatttttgag gttagttcgt 4260 

gttgtacgta ttagtagtta atttttttta tttttttatt tttttgtggg attttatttt 4320 

ataaatatat tatagtttat tttgttggtg gtagtagatg tttttttatt tattttttta 4380 

aaatattttt tgagatag 4398 

<210> 262 
<211> 4471 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 262 

aatttataga ggtatattta tagaggttaa attttattgt atgtaaatta tattgttgta 60 

gtttaatttt tttaaagtgt tattgtatat aaaataatgt gttagatatt gaattatgaa 120 

atttaacgtt taagtatata atttttaagg tttttagatg tattattaaa aatttattat 180 

taaatatttt gagtttagtt tttagtattt tatattttat ttatatttat attttataaa 240 

acgatttaaa gtattattta aaattattat atgaattttg aattattatg tgtttaatat 300 

atatttatag aaaatttttt ttttattttt aaaagtgaat atattaatat ttattaaatg 360 

ttaattatat ggttatgagt tgaatttatt attaaatttt attttttttg aattataaat 420 

atattaagaa atgtaaattg tgattagatt tatttatatt ttttagaata ttacgaaatg 480 

tgttttaaat atgttaattg attttagaaa tttttatgtt ttgagtaaga tttaagtgtt 540 

aataagtatt tgatggtttt taaaataaga atgttttaga aatttattta aaaaaatgtt 600 

tgaacgtaaa atgtaatttt acggtgtata ttttaatgtt tgttgatgaa tagattttta 660 

tagttttttt tttttagaat ttatattaaa atatattttt tttagaaaaa ataggagtta 720 

aagatgttaa ttatgttttt tttatagtta cgttatattt gttttataat atttttagaa 780 

aagttttttt ttttatatgt ttatttttta aattataaat atgttttttt ttattagata 840 

tcgagttacg aaatttagag taaaagtaaa attttttaat ggaatgttta tgtaaatatt 900 

agaaatataa taaaataagg taaagtttgt tttttaaaat gttgtttatt atattaaaat 960 

ataattttgt tttttaaata gcgagtaaaa ttaaattaat tttatttttt taaatagaag 1020 

tttatttaaa gtttttttat aaaaatgata tttatttgat ttaatattgt ttttttaata 1080 

agaataatac gtagggtgtt agagggtagt tgttggattt ttttttagta aataaaggag 1140 

attagttaga gtttagtttg gtttgtttag gaaaggagga acgtagtttg ttgatatggt 1200 

taaggaatgt taattaatat attttaaaat ttttattatt ttgtttgaga cgtgtatatt 1260 

tttttaggtt tttaaagttt aattagaaag tgtatttaat tttattgtta ttttatttat 1320 

agtggggaag tttatcgagg aaaattatag ggaaataaaa tgttttttag ttatttatat 1380 

tagtataatt aatttagagt ttacggtata aattaaataa gttaatttgt tagtgttttg 1440 

aatgaagata tttaattaaa gtatgtattt tttaattttt tagtagtttt taggatagtt 1500 

ggttttggtg atcgtttttt ggttttttat tgtttttaat acgattggtt ttttatcgta 1560 

ggattttaaa taaaatgaga taattaaatt atatttcgag cgaaggggcg tttattttgt 1620 

agataaatat atcggttttg ttttttttaa aatgcggata cgtgtttttt tcgtattagg 1680 

gggggttttt cggcgcgcgt ttcgtcgtta tttgttgagg aaagcgagcg tattttttgt 1740 

agtttaggtt tcgggcgtta gttttgtttc gtagttttag agttcgtcgt agttcgggtg 1800 

gttttttttc ggtttagcgt tegtcgtttg tttttcgttt tgtaagtttt aagaggtagt 1860 

tatttttcgt agttttcgcg tttgtaattg ttttttggcg ggggagtggg tgtttaaaaa 1920 

gttagtagtt ggagaaattg aaaagattat aagcgattta acgataagtt tttttttttt 1980 

tttaaagatc gagaggaggg tagaggggag tagtgtttga gtttacgtga tcgagttagg 2040 

gagttcgacg gttttaggaa cgttcgacgt cgcgcgtgat ttttaagtgg gagtattttc 2100 

gaatcgattt ttggtttatt tataaggata gtggcgtata gatggcgttt ttcgtagttt 2160 

tagttttaga tttaagaggt ttggagtagg gtttgagaat atgtattttt aattaggttt 2220 

tgggggatgt cgatattgat atagttagtt tggggattat atttcgagga ttacgttttt 2280 

agtttttgat ttatataagt gttatttaga atagatgutt gattttaagg agttagtggt 2340 

gaaattagag aggttttggg tttgttaaat ttttagtagt aaacgtaatt tcgggttttg 2400 

gagtggtaaa gtcgtggatt agaggtggag ggagtgggtt tttagttttt aagaggttat 2460 
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tgggaaggtt tttcgcgtta gattaaagat tttaggtatt ttttcgaatt tatttgaagt 2520 

ggtatcgggg agatttgtgt tttcggttac ggcgtttttt ttcgttggag gtatttgttt 2580 

attttttttt tcgggcgaag gtttttcgcg ttttttggtg gtagttttag tttttttagt 2640 

ttaatggggt gtttttttta tttattttag taggagtttt agggtgcgag attaggatta 2700 

tagttttaat tggttttaag gtagttgttg ttgatgaaaa tgaaaaggaa agtagtatgt 2760 

gatttatagg ttattgtgag aattttttag tgttatattt gtgtttaatt aattattaaa 2820 

ttatttattt aggtggtata agggatatgg tttttgaggg ttgtgattta gatttttaat 2880 

agaggaagac gagggggggt atttggaggg aaagttttat agtattagtt ttagttgttt 2940 

ggtgtttagt taaaatagag agacgtaagt gtgttttggg ttgataaaga tgaggtttat 3000 

aggtaatgaa gataggtttt aaagatggag aagtatttgt tttattagtt aaataatagt 3060 

tgtgaaaagt ttttattgtt ttattttaag tttaattttt attaaagttg agagttttgg 3120 

ttttttttag gttgaaaggt agagtttttt tgtttttgag gtagagaagt tagtttggac 3180 

gggaagagtg tgtcgttggt aaataattgt agtagataat atgttaattt tagttttttt 3240 

gttttttgtt ttggggtttt ttaaagtatt tagtttttat atgttttttt tttttgttat 3300 

tttatggaga aaatgttaaa tgatggttta aaaaaattag ttttatggtt aggtatggtg 3360 

gtttatattt ataattttag tattttggga ggtcgaggta ggtggattac gaggttagga 3420 

gtttgagatt agtttgatta atatggtgaa atttcgtttt tattaaaaat ataaaaattg 3480 

gttggacgtg gtggtatatg ttagtaattt tagttattta ggaggttgag gtaggagaat 3540 

cgtttgaata tgggaggcgg aggttgtagt gagttaagat cgcgttattg tattttagtt 3600 

tgggcgatag agcgagattt tgttttaaaa aaaaaaaaaa aaaaaaaaat tagttttatt 3660 

tggtggtata tatttgttaa taagatattt ttggaaatgg aaatcgttat gaaggaattt 3720 

taattataag ttaataggta gaaaaagata ggtgagggtg aagaaaatgt atttattaat 3780 

attaatatta tatttgacgt tgtgttaagt atttgtatag tattttattt aatttttata 3840 

ttagtttagt gagatattga ttttattttt ttagtgagga aattgaggtt tatagagttt 3900 

ttgttagtta tttgaagtta tttagtgaga aaggaattta gtaataatag tggttttata 3960 

gatagttatc gtttattgga tatttatgtt tcgttttttt ttaataattt ttattttata 4020 

cgtaagttta taaggtaaat tttattattt ttattttata tttgagaaaa tgaatgttta 4080 

ggaaggttaa ataatttttt tatggttata tgatttatta gtggttaaga aggatttgaa 4140 

tttaggagag aggttaagga tttatttatt tttatattat gtttttaatt aattagaatg 4200 

aaaatggttt gattttttgt tttatgtaga atattgtaga taaatttttg ttatttgtgt 4260 

gttaagtttt tgttttgtgt ttattgtgat gtttttttta gaaggtattt tatatatagg 4320 

aatagtatgt ttttttttta gagtagttta aatatttata gaagttgttt attatataag 4380 

gaaattttat tgtaggaaat aataagtaga aaatgaatgg gagtgagtaa gttttttttt 4440 

gatttatttt gtttattaag aaagataatg g ~ ~ 4471 

<210> 263 
<211> 4471 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 263 

ttattgtttt ttttgatgga tagggtgagt tagaaggaaa tttatttatt tttatttatt 60 

ttttgtttat tattttttgt agtgaggttt ttttgtataa taaatagttt ttgtgggtgt 120 

ttgagttgtt ttgaaaagag aatatgttgt ttttgtgtgt agaatgtttt ttgaaggaag 180 

tattatagtg aatatagagt agaagtttgg tatataggtg gtagaagttt gtttgtagtg 240 

ttttgtatag agtagagagt taagttattt ttattttgat tgattggagg tatggtatgg 300 

aggtaaatgg gtttttggtt ttttttttgg atttaagttt tttttagtta ttgataggtt 360 

atgtgattat agggaggttg tttaattttt ttgaatattt atttttttaa gtataaaatg 420 

ggggtaatag aatttgtttt ataggtttgc gtataaaata agaattattg agagaaagcg 480 

gggtataaat gtttaataag cggtagttgt ttatgaagtt attgttgtta ttgggttttt 540 

tttttattag gtggttttag gtagttgata gaagttttgt gagttttaat ttttttattg 600 

gaaaagtgga gttaatattt tattgagttg gtgtgaggat taaatgagat gttgtgtagg 660 

tgtttagtat agcgttaggt atgatgttaa tattgataga tgtatttttt ttatttttat 720 

ttattttttt ttgtttgttg gtttatggtt gaaatttttt tatgacggtt tttattttta 780 

gagatatttt gttaataagt atatattatt aaatgaagtt gatttttttt tttttttttt 840 

tttttgagat agagtttcgt tttgtcgttt aggttggaat gtagtggcgc gattttggtt 900 

tattgtaatt ttcgtttttt atgtttaagc gatttttttg ttttagtttt ttgagtagtt 960 

gggattattg gtatgtgtta ttacgtttag ttaatttttg tatttttagt agagacgagg 1020 

ttttattatg ttggttaggt tggttttaaa tttttgattt cgtgatttat ttgtttcggt 1080 

tttttaaagt gttgagatta taggtgtgag ttattatgtt tggttatgaa gttgattttt 1140 

ttaaattatt atttaatatt ttttttataa ggtggtaagg aggaagagta tatggggatt 1200 

gggtattttg agagatttta ggataggaga tagggaggtt gagattggta tgttgtttgt 1260 

tgtagttatt tgttagcgat atattttttt cgtttaaatt aatttttttg ttttaaggat 1320 

agggagattt-tgttttttaa- tttgagagaa att^gga^ -1580 — 
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atttagggtg gggtagtgga gattttttat agttattgtt tagttgatga agtagatgtt 1440 

tttttatttt tggagtttgt ttttattatt tgtggatttt atttttatta atttagagta 1500 

tatttgcgtt tttttatttt ggttaaatat taaatagttg aggttggtat tgtaaaattt 1560 

tttttttaaa tgtttttttt cgtttttttt tattagagat ttggattata atttttaaaa 1620 

attatgtttt ttatgttatt tgagtagatg gtttgatgat taattaggta tagatgtgat 1680 

attggggggt ttttataatg gtttgtgggt tatatgttat tttttttttt atttttatta 1740 

gtaatagttg ttttaaagtt agttaagatt gtggttttag tttcgtattt tggggttttt 1800 

gttggggtgg gtgaggggaa tattttatta agttggggga attggggttg ttattagggg 1860 

gcgcgagggg ttttcgttcg agaagagggg tgggtaggtg tttttagcgg agaagggcgt 1920 

cgtggtcgga ggtataggtt ttttcggtgt tattttaagt gagttcgagg aagtatttgg 1980 

gatttttgat ttaacgcgaa aggttttttt agtgattttt tgagagttga gaatttattt 2040 

tttttatttt tagtttacgg ttttgttatt ttagggttcg aggttacgtt tgttgttggg 2100 

gatttgataa atttaaagtt tttttggttt tattattggt tttttagaat tagatatttg 2160 

ttttgaatga tatttatgtg agttaggggt tgaggacgtg attttcgaag tgtggttttt 2220 

agattggttg tattagtgtc ggtatttttt aggatttggt tggaaatgta tatttttagg 2280 

ttttatttta gattttttaa atttgagatt ggggttgcgg ggagcgttat ttgtgcgtta 2340 

ttatttttgt gggtggatta ggagtcggtt cgagggtgtt tttatttaga ggttacgcgc 2400 

ggcgtcgggc gtttttgaga tcgtcgggtt ttttggttcg gttacgtggg tttaggtatt 2460 

attttttttt attttttttt cggtttttaa aaggaagaag gggtttatcg ttaagtcgtt 2520 

tgtgattttt ttagtttttt tagttgttgg ttttttggat atttattttt tcgttaggag 2580 

gtagttgtaa gcgcggaggt tgcgagaaat aattgttttt tgaaatttgt agggcgaaga 2640 

gtaggcggcg agcgttgggt cggggaggga ttattcgagt tgcgacgggt tttggggttg 2700 

cggggtaggg ttggcgttcg gagtttgagt tgtaggaggt gcgttcgttt tttttaatag 2760 

gtggcggcgg ggcgcgcgtc gggagatttt ttttaatgcg ggaaaagtac gtgttcgtat 2820 

tttagagaag gtaaggtcgg tgtgtttatt tgtaaggtaa gcgttttttc gttcgaggtg 2880 

tggtttaatt gttttatttt gtttgaaatt ttgcggtgag aaattagtcg tgttgagaat 2940 

aataaaagat taaaaaacga ttattaaaat taattgtttt gaaagttatt ggaaagttgg 3000 

aaaatgtatg ttttgattaa atgtttttat ttaagatatt ggtaagttaa tttatttagt 3060 

ttgtgtcgtg agttttgggt tgattgtgtt aatatgaata attgaaaaat attttatttt 3120 

tttatggttt ttttcgatgg atttttttat tatgggtgaa atgataatgg agttgaatat 3180 

attttttgat tgaattttga gggtttggga agatgtatac gttttaggta agatgatagg 3240 

ggttttaaaa tgtattaatt ggtatttttt agttatgtta gtaagttgcg tttttttttt 3300 

tttgggtaga ttaagttaag ttttaattgg ttttttttat ttgttgaaga ggagtttaat 3360 

aattgttttt taatattttg cgtgttattt ttattggaag gataatatta agttaagtga 3420 

atgttatttt tgtgaaaaaa ttttgagtgg atttttattt aggaagataa ggttgattta 3480 

attttattcg ttgtttaaaa agtaggattg tgttttggtg tggtaggtaa tattttggag 3540 

gatagatttt gttttatttt gttatatttt tagtatttat atgggtattt tattagaaag 3600 

ttttattttt gttttaagtt tcgtaattcg gtgtttagtg aggggaaata tgtttgtaat 3660 

ttaaaaagtg aatatgtgaa aggaaaggtt tttttgagag tgttgtaaaa taaatgtaac 3720 

gtgattatga aaagaatatg attaatattt ttgattttta ttttttttga agaaaatgta 3780 

ttttgatatg agttttagaa gaaggaaatt ataaggattt gtttattaat aggtattaga 3840 

gtatatatcg taggattgta ttttacgttt aagtattttt ttagatgaat ttttgaaata 3900 

tttttatttt aaaagttatt agatgtttgt taatatttaa gttttgttta agatatagaa 3960 

gtttttgaaa ttaattaata tgtttaggat atatttcgta gtgttttgag ggatgtgaat 4020 

aaatttaatt atagtttata ttttttaatg tatttataat ttagaaaagg tagaatttag 4080 

tagtaaattt aatttataat tatataatta atatttaata gatattgata tgtttatttt 4140 

taagaataag aaggaaattt tttataagtg tatgttgaat atataataat ttaaaattta 4200 

tgtgataatt ttaggtgatg ttttgagtcg ttttatagaa tataaatatg gataaaatat 4260 

aaaatattga aggttgaatt taaagtgttt aatgataagt ttttgataat atatttagaa 4320 

attttgagaa ttgtatgttt gaacgttaga ttttataatt tagtgtttag tatattgttt 4380 

tatatgtaat agtattttaa aaaaattagg ttatagtagt ataatttata tatagtaaaa 4440 

tttagttttt gtaaatgtat ttttatgaat t 4471 

<210> 264 
<211> 4479 
<212> DNA 

<213> Artificial Sequence 

« 

<220> , . 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 264 



i • > 



cggtggttta cgtttgtaat tttagtattt tgggaggtcg aggtgggcgg attataaggt 60 

taggagatcg agattatttt ggttaatacg gtgaaatttt atttttatta aaaatataaa 120 

aaattagtcg ggtgtggtgg taggcgtttg tagttttagt tagtcggtag gttgaggtag 180 

gagaatggcg tgaattcggg aggcggagtt tgtagtgagt cgagattgtg ttattgtatt 240 

ttagtttggg tgatagatdg agatttcg|:t ttaaaaaaaa a|aaaaaaaa atatggttgg 300 
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gcgtggtggt ttatgtttgt aattttagta ttttggaagg atgaggtggg aggatttttt 360 

gaatttagaa gtttagtaaa attttgtttt taaaaaaaaa aaagaattgt gtataaagat 420 

trttagagagt gttaaagatt agcgtatgga taaggaagtt ttgtgaagag ttgaagtgtt 480 

aggtgaagag gtggtacggg ggaggagggg gcggaagggg agaaagggtg ttacgtttta 540 

taacggtttt taaatttttt tgtttaggag gaaatgaagt tatttgtttt tagtaattag 600 

tatgatagtt tttagttaag taatttggag ttatgagagt tgttagggga gtaatatgaa 660 

ttatgacggt ttttgggaat tttttgataa ttaatttggg agtttcgggg taagttttta 720 

ggttgtagta tttttgttta tgttttggtt acgtttattt ataattaatg ggtttttaaa 780 

tttaaataaa attgattata gttttttaga ggaagtagta aggttggttt tgaagtttat 840 

agtatcgttg atttagtttg ttttttggaa ggttggtagt ttagtaagta tagaagtttt 900 

tttagaagat agtgggttat ttgtttttta aaagttgaaa ggttaatttg tatttttttt 960 

agtaggtagt tggtattttg agttttcggt tggggtagag taaaggagtt tttttttttt 1020 

ttattttttt ggtatttttt ttgttttttt tttgttattt ttaggtggat ttagatttaa 1080 

ggtttagatt tgtaaggtag gaaaatgttg taggtttagg ttgggaaagg gtttaaagtc 1140 

gttagtggat tgttgggatt tagttttttt ttttttatta agagagcgag ttttattggg 1200 

tttaaaatga ttttaagttt tggtttttga tattagggga aagagatggg ggtgatagaa 1260 

ttatagaatt tttgttatgt ttttttaagt gtgtttagag atgcgtgtgt gtgtgtgtgt 1320 

gtatatataa atgtttgttt atttttaggt aggaagggtg gatgtagtta tttatatatg 1380 

gtttgttttt ttggaggata attttatttg ataaataatt gtttttattt gaatagaata 1440 

aataaggttt tatgatgaag taaaatatta aatatatatg tattaaaaaa tgtataatta 1500 

tttttttgga atgggttata tagagatgtg trtttttaaaa tgttaagagt gtaaaaggat 1560 

aaatagtgaa aaataaattt tttttttatt ttgtttttta gttttttaat ttttttattt 1620 

agaggtgaga atagaatttt tatatttttt agaattttta tagttagaat tgtttatatg 1680 

tttttattgt ttttattttt atttttgttt gtataaataa atgaattgtt tattatggaa 1740 

attttttaaa agattcgtta atattttaat aggaagtatt aatagtttat gttttaggat 1800 

tttgttttta taattttgta atattatatt acgatattta atttaatttt tattaagttt 1860 

tgttaaaaac ggattttaaa ttaagttgta aatttttagt aatttggttt tgtttttttt 1920 

tttttgatag tattattaaa taaatttttt tattgtcgaa agtaataagt tcggttttgt 1980 

tttatttatt ggttgtgttg gtgatatttg gggattgtta ttgaatagac gtatagaggg 2040 

agtttttata ggtaggggtt tttttgtttg tgtttttggg agagtatgtt tcgtatattt 2100 

gtcgcgttga tgaagatttt atagttttat tagttgcggg taagggggtt tgaggtagtt 2160 

ttaggtaagt tggggtttag cggggagaag ttgtagaaga attgattaga ggattttagg 2220 

aggttttaga gttgggcgag gtagagagtt ttttgtgcgt tttttttttt ttttgtaatt 2280 

cggggatttt ttgtattggg gtaggttttc ggttaggtgt atgggaggaa gtacggagaa 2340 

tttataagtt tttcgatttt ttagtttaga cgttgttggg tttttttcgt tggagatcgc 2400 

gtttttttta aatttttgtg agcgttgcgg aagtacgcgg ggttcgggtc gttgagcgtt 2460 

gtaagatagg ggagggagtc gggcgggaga gggaggggcg gcgtcggggc gggttttgat 2520 

atagagtagg cgtcgcgggt cgtagtatag tgcggagatc gtagtttcgg agttcgggtt 2580 

agggtttatt tgttttcgta gcgtcggttc gcgttttttt gtcgtagtta tcggtgagtg 2640 

tcgcggtttt gagattttcg ggtcggatgc gcggcggttt tagttttcga gcgtttgttt 2700 

ttttcgtttt gggttgttcg ggttttttgg gtttttcggc ggttgtacgg agttaaggcg 2760 

tttcgtttcg ggcgtttttc gcgggtgtcg atttaggttg ttcggagttc ggagtttaga 2820 

gaggagagag atagttgggg agtttggtta tcgcgggtat tttttttgcg ttgtagtcgt 2880 

tcgtttggtt tgttttttcg ttttttcgtt ttttgttttg attttttttt tttttgtaga 2940 

gtcgtcgttt agcgtttcga tttcgttatt atgagagttt tgttggcgcg tttgtttttt 3000 

tgcgttttgg tcgtgagcga ttttaaagtg agtgcgtttt tgttttgatt gatgttgttt 3060 

aaggattttt gattagtatt aggggagagg aggggttgtt tagggagttg gggtttttcg 3120 

gattttattt atagtagggt tagatttttt ttaggaaatg ggatagggtg gtagcggagg 3180 

tttgagaatt acgggggttg gtattggttg gtaagggagg aagaggtcgt cgggattgtt 3240 

ttagtttgcg ggtatttggt agatgaagtt tgtttgggtt aatttatttt ttttggttgg 3300 

aaatttatgg ttttttattt gagaattaga tacgaatagg gtgaggcgag agggagaggg 3360 

aagagtgggt tttgggattg gggttagttt atttttattt tggagttttt ggagtatggg 3420 

atttttgatg aagttttttt tcgaattttt tttagggtag taatgaattt tattaagttt 3480 

tatgtgagta tttattttta taatagttgg ttgtatagat aagttgggaa ggttttaggg 3540 

gatatttttt ttttgttttt tgttgtaggg ttgcgttatt ttttattatt tttatttttt 3600 

ttcgtttatt ttatttttgt tttttttagc gaattgtgat tgtttaaatg gaggaatatg 3660 

tgtgtttaat aagtattttt ttaatattta ttggtgtaat tgtttaaaga aattcggagg 3720 

gtagtattgt gaaataggta tggggatttt tattgtaatt gggagagaaa tttggggata 3780 

gggagggatg ggtgggaggt aagagtaggt aggagttagg agttggaggt agggtgggtg 3840 

atatttttat ttttatgtga taagtataaa tatatatata cgtttacgaa atagtggtta 3900 

tataaatgtg aggtggggtt ggaaggagat tttgtttagt tttttggtag gtttgaaacg 3960 

atatttttaa aatgttcgtt ggtagtcggg tatggtggtt tacgtttgta attttagtat 4020 

tttgagaggt taaggtgagt ggattatttg aggttaggag tttaagatta gtttggataa 4080 

tatggtgtaa ttttgttttt attaaaaatg taaaaattag tttggtatgg tagtggatgt 4140 

ttgtagtttt agttatttgg gaggttgagg taggagaatt gtttgaattt gggaggtaga 4200 

gattttagtg agttgagatt atattattgt attttaattg ggcgatagag taagatttta 4260 

ttttaaaaaa aaaaaataaa agttagttgg aatgtttttt ttttttttat atttttttat 4320 

ttttttgttt ttttgtagat aagttaaaaa tttgttatga ggggaatggt tatttttatc 4380 
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tttgtgggta 
gtttatggtg 
ttttgattta 
taattaattt 
tagtggtgtg 
tttagttttt 
atttttagta 
aaatgattta 
tcggttgtta 
ttttttttta 



seq_prot.txt 

gaggaaaggt tagtattgat attatgggtc ggttttgttt gttttggaat tttqttattq 
ttttttagta aacgtattat gtttatagat ttgatgttt 

<210> 265 
<211> 4479 
<212> DNA 
. <213> Artificial Sequence 

<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 265 

gagtattaga 
ggtagggtcg 
tatagtaggt 
aagaatattt 
agttggagtg 
atttttttgt 
tgatttttgt 
ttttgatttt 
gttattatgt 
ttggataggg 
gtgtgtgtgt 
ttaatttttg 
gttgtagtgg 
ttgtattagt 
ttatagttcg 
gtggcgtagt 
tttgtgtagt 
tgttttggaa 
ggtgagggta 
ttgttcgtat 
atttaagtaa 
tttttttgtt 
attttttggg 
gtagtttttt 
gagcgtattt 
ggtttttatg 
agggtaagag 
tgttcgcggt 
agtttggatc 
tcggggagtt 
ggtcgtcgcg 
ggagggcgcg 
gttttcgtat 
gttttttttt 
cgcgtgtttt 
ttaatagcgt 
tatttggtcg 
cgtataggag 
tttttgtaat 
tcgtagttga 
ttagaagtat 
ggtagttttt 
tcggtagtaa 
ttgaaaattt 
aggtgtcgtg 
gtgtttttta 
tatttgtgta 
ggaggttttg 
ggaggataaa 
tttaaaaagt 
atgtgtattt 
attgtttatt 
attttttttg 
tttgggtata 
tttttagtgt 



4440 
4479 



ttatgtttgt 
tttgtttttg 
agatttttat 
gaatgttgga 
ttggagagaa 
tttgtagtag 
taattgttgt 
gagattcggg 
aattggtttt 
ttagttttta 
gttttattta 
agttagtgtt 
gagagtttgg 
tttttttttg 
attttggagt 
gtggcgaggt 
gcggaggaac 
gattaggttt 
ggtattcgcg 
tagggagttc 
tattcggttc 
agtcggcgtt 
tgtgttgcga 
ttttcgttcg 
cgtaacgttt 
ttggattgag 

ggggtttgtt 
attttttatt 
tttttttcgt 
tggagttgtg 
agatagaaaa 
agatattatt 

gggggtttgt 

gtagtttggt 
atatgatgtt 
ttgaagtgtt 
ggtaagaata 
gagggtgtgg 
ataagaggaa 
atatttttgt 
agtgttttat 
aaataaaatt 
tttgaggata 
tttggaggaa 
taggaattag 



tggtacgttt 
ttagtgttgg 
tttataaggg 
ttattttttt 
attttagttt 
taagtagttg 
gaggtagggt 
tttattttga 
acggatattt 
attttatttt 
tatataggga 
ttttttattt 
atttatttta 
gaagtatttg 
taaaggtggg 
agggtaggga 
aggggtggat 
aggaggtttt 
aattttaaaa 
aatggaagat 
ttagatgttc 
aattttcgtg 
ttttgttgtg 
gtgttgatta 
cgtttacgat 
cggggcgtta 
gggaaggtag 
tttagttgtt 
ggggacgttc 
gggtagttta 

ggggatttta 

gcggggatag 
ttcgcggcgt 
gttttttttt 
ataaagattt 
gaatcgagag 
ttagtgtaag 
tcgtttagtt 
tgggttttaa 
aagtttttat 
atttttgttt 
aatataatta 



gttgaaggat 
ttttttttcg 
gataggagga 
tttttgagat 
attgagattt 
ggattatagg 
tatattatgt 
ttttttaaaa 
taaagatgtc 
atatttgtgt 
tgaagatgtt 
attttttttt 
tagtgttgtt 
ttggatatat 
gtaagcgagg 

ggggatgttt 
atttatatgg 
attaaaggtt 
tttatttttt 
tatgggtttt 
gtaggttggg 
gtttttaagt 
gatggaattc 
gaggtttttg 
taggacgtag 
gacggcggtt 
gttaggcggg 
tttttttttt 



ttgatggtgt 
ttaaagttcg 
ataggattgt 
aacgggtttt 
aaagtaaaga 
aagttttgtt 
gatttatttt 
atagtttatt 
tttattatag 
gttttttaga 
agtagatatt 
tatagfaggg 
ggtttggggt 



gggacggggc 
gggcggggga 
ggatcgcggt 
gtggattttg 
ttgttttata 
ttgttttgta 
gggggaagcg 
gtttgtaaat 
gagttttcga 
ttgaagtttt 
tttgtttaag 
taacgcgata 
gtaggggttt 
gtggatggaa 
tattagaggg 
tttttgatag 
gggaataaag 
ttgggaagtt 
taatggaaat 
tttatttttg 
ttattgtttg 
ttaaaaagat 
agttttgttt 
aaaatagatt 
tgtgtatata 
attttgtgat 
tattttgaat 



agtggtagag 
gtaaaagtga 
tgagagaata 
ggagttttgt 
ttgtttttta 
tatttattat 
tgtttaggtt 
tgttgggatt 
gttttagatt 
ggttattgtt 
atttatttta 
gtttttaaat 
tttcgaattt 
atgttttttt 
gggagtggaa 
tttgaagttt 
aatttgatga 
ttatgtttta 
tttttttttt 
tagttaggag 
gtagtttcgg 
tttcgttgtt 
ggaggatttt 
ggtagtatta 
agaagtaggc 
ttgtaaagga 
cgattgtagc 
ttgggtttcg 
gttttgattt 
ggtagacgtt 
atttatcggt 
gttcgggttt 
ttagggttcg 
gcgtttagcg 
cgatttttag 
ttttcgtgtt 
attgtagaga 
ttggggtttt 
attgttttag 
aatgtacgag 
tttttgtgcg 
taaagtcggg 
ggaaaggtaa 
ggtttgataa 
ttttagggta 
tttataatga 
atgtagatag 
agtagaggaa 
tttttttata 



ttttagggta 
ttattttttt 
tgagaaagag 
tttgtcgttt 
ggtttaagta 
tatgttaggt 
ggttttgaat 
ataagcgtga 
tgttagaaga 
tcgtgagcgt 
tttttagttt 
ttttttttta 
ttttgggtag 
atttagatag 
gtggtaaggg 
ttttaatttg 
agtttattgt 
gggattttag 
tcgttttatt 
aaatggattg 
cggttttttt 
attttgtttt 
agttttttga 
gttaaagtaa 
gcgttagtag 
aggagaagtt 
gtaggggaga 
gatttcgggt 
cgtgtagtcg 
cgggagttgg 
ggttgcggta 

cggggttgcg 
tttcggcgtc 
attcggattt 
cggaggggat 
tttttttatg 
ggagagaagg 
ttaattagtt 
atttttttgt 
atatattttt 



Seite 183 



aattatgtat 
attttattta 
atgtgtaaat 
tatatatata 
tftgttattt 
ttagtaggat 



tttgtttagt 
tttattgttt 
ggttagatta 
ggattgggtt 
taaattgttg 
gtaatttatt 
ttttaattgt 
ttgggagatt 
tttttaatat 
tttttaatgt 
gatagaaata 

gattgtattt 
tatacgtatf 

ttattttttt 
tcgttttttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 



seq_prot.txt 

agtgggaagg aggaggttga gttttagtaa tttattagcg gttttgggtt tttttttagt 3360 

ttaggtttgt agtatttttt tgttttgtaa atttggattt tgggtttggg tttatttgag 3420 

agtgatagaa ggaaggtagg gagagtgtta ggaaggtagg aaggaggaag gtttttttgt 3480 

tttgttttag tcgagggttt agggtgttag ttgtttgttg gggaaagtat aagttagttt 3540 

tttagttttt gggaggtagg tgatttattg ttttttggag agatttttgt gtttgttgag 3600 

ttgttagttt tttaggggat agattgagtt agcgatgtta taggttttag agttaatttt 3660 

gttatttttt ttagaagatt gtggttagtt ttgtttggat ttgagaattt attaattgta 3720 

aataaacgtg attagaatat aaatagagat gttgtagttt gaggatttat ttcgaaattt 3780 

ttaggttagt tattaggaaa tttttaggga tcgttatgat ttatgttgtt tttttagtag 3840 

tttttatgat tttagattat ttggttggag gttgttatgt tgattgttga gagtagatga 3900 

ttttattttt ttttggataa agaggtttgg agatcgttat gaagcgtgat attttttttt 3960 

ttttttcgtt tttttttttt tcgtgttatt tttttattta gtattttaat tttttataga 4020 

atttttttat ttatgcgtta atttttagta ttttttgaaa tttttgtgta tagttttttt 4080 

ttttttttaa agataaggtt ttgttgggtt tttgggttta aggggttttt ttattttatt 4140 

tttttaaagt gttgggatta taggtatgag ttattacgtt tagttatatt tttttttttt 4200 

tttttttgag acggagtttc ggtttgttat ttaggttgga gtgtagtggt ataatttcgg 4260 

tttattgtaa gtttcgtttt tcgggtttac gttatttttt tgttttagtt tatcgattag 4320 

ttgggattat aggcgtttgt tattatattc ggttaatttt ttgtattttt agtagagatg 4380 

gggttttatc gtgttagtta ggatggtttc gattttttga ttttgtgatt cgtttatttc 4440 

ggttttttaa agtgttggga ttataggcgt gagttatcg 4479 

<210> 266 
<211> 4492 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 266 

ttgtaggttg cgtttagttt ttgtgtatta gggatagtaa ggaaaattta agttagatta 60 

gttttagggg tggtagcggt ttttattttt agagaagaag aagatatttt ggatgggttt 120 

ataggtggta ggtataagtt agtttatttt gtagttatta tagttgttgg tttttaagtt 180 

gtttttttta ttggagaata aggatagtta cgtggcgcgg gatggtcggc gggagttttg 240 

gttgcggtta cggttgtggt ttcgttgtga acggtagttt ttgcggttgc gatgtttaaa 300 

tttttgtttt ttggttaagg aggggcgggg tgttatgttt gagatgtaga tgcggttagt 360 

tatggttgcg tttatttgtt ttggtatatc gtgttagttt cggttaatga attggggttg 420 

tttggtatta gttgtggtag ggaaggggtg aatgagaggt tttgggggtt cgaattttgt 480 

tttttttagc ggggttatta gagtttattt gttgtatttt gtttaaagat atatagtagc 540 

gaatggtaga gttaggtttt tgatttttag tttaattttt ttttttttgt gttttttttt 600 

tattatatta ttggtgtttc gattttattt ttttaggttt tttaaatttt ttagttatag 660 

tttttttttt tgttggttgt gtttttagtt gtgttaggta aagtttaagg tgtggggttt 720 

aggtagagtt gagttttttt taaaaggtta atagaagtaa agtttggttt gagtaaatag 780 

attttgatcg atagttttat aattatagtt ttttttagtc ggtttggttt tattaatatt 840 

aaggggtttt agtttttttg agggagaagt aagatttgtt ttttttttat attagaggtt 900 

ttgttttaga agagaagttc gaagatgttt ttttagttgt ttcgttgtat tatggtaaag 960 

tgtttaaata tagtcgatag ggagttgttt ttatattttt tttgattggg ttggtgttgg 1020 

aattggtatt tttagtattg tttttttgag gagtagggtg gtgggtatta ggagggttgg 1080 

gttagggtaa gattggagat tttagaggtt gttgaagtta ggatttttta gtatgaggtg 1140 

ggggttaggg gtgggtatat gttgaggttt gtttttagtg aaatgtgaaa ggaatttaag 1200 

attgttttat tgtttgttta gttttttatt attttttgag tatttttgaa gaagtagatt 1260 

cgttttcggt tattgtagtt atgggtaggg agggttaagg ttgtatttac gttgttcggg 1320 

atgttatcga agtcgttaga gatatttcgg gggtaattag ggtttaggat attattttta 1380 

aagcgttagt attgattatt ttaagaggtg gggaaagtga aaaggggtat ggaggtcgtt 1440 

ggttgggtat agaggtagag ttttttgttt taaggtagtc gtttttaggt ttaggtttat 1500 

tgtttaggat ttggagtttt ggggttgttt tgtgtttata gagtttttat taggttttgt 1560 

agggttttgg gtttagtttt tttgtttatt ttgtttttgg gagtaatagt ttttaaattt 1620 

tttttagatg ttttttattt tggtttatag tttttggtat tttgaagagg taggtttttt 1680 

tttgatagtt gatgcgggtg aaggcggtat cgatggggtt ttcgatgttt tagatatttc 1740 

ggatgagttt ggggtattta ggttttattg ttttttcgtt tagtttatag tagtattgtt 1800 

ttagagtgga ggagatggtg tgagagtagg gacgtttttg gggtagattc gtatttttag 1860 

tatttgtttt ggatttattt cggaaggtaa agagggaatc gtttttgagg tcggtgaagg 1920 

cgtcgaaggg tttt^tattg tatagttttt tttttgttgg gggttgaggt ttttttggat 1980 

gaagggtttt aggttttgag tttatttttt taggtttaga ggcgtttatt ttaggcgtag 2040 

gggttttttt tttaggtttt agaataggtg tttgtttagg attttttttg gattgggttt 2100 

ggaggttaga ggttagggag gggtttttta tttgtttatg gatagtggta ttgttttttt 2160 

tttcgttatc gttatagatc gtgtatttat ttttcggtat agtgaatata ttttcgcgag 2220 



ttattgtaga 
tttgggtttt 
ggtagtaaga 
agcggttttt 
attttcgttt 
ttagagtaat 
ggtttttagt 
gtttgaggaa 
ttttttttgt 
tgttgtagta 
tttgttaagt 
gtagaaagaa 
aaaataggat 
gataaaatgt 
tatgattgtt 
gtaggaaaaa 
ataaaagaat 
ggaaatttgg 
aggttatttt 
tatttttaga 
aatatttgat 
ttgaaaaaag 
gtgatttggg 
taatttttat 
taagagttat 
tttacgtatc 
attttttatt 
gttagggaag 
gtaggaggtt 
ttttttcggg 
tcggtcgggg 
cgagttcgtg 
ggagttattt 
ttatttttgt 
tatttttgtt 
ttcgggtttt 
gatgtttttc 
tgtcgttttt 



gagtggatgg 
gaatatattt 
gtagagttcg 
gtatgatttt 
tttttatatt 
ttatgttagt 
ttagtgtttg 
gggagggaga 
tgttttgttt 
aaggttatat 
ttagaattat 
ggtttattgg 
taataaagtt 
ttagtagttg 
tgattagata 
gggagtggga 
aaagttttat 
gtaaatatta 
ttgtttttag 
ttttcgagag 
tttgtgttta 
tgattttgag 
ttaatttttt 
tttataggta 
gatgattgtt 
gatgttacgt 
tttttttttt 
gttggttagg 
ggaggagttt 
gcgtggttgt 
atcgttgttt 
tttaggttac 
ttcgtttgtt 
taatcgtttt 
tggtattttt 
ttttttttta 
gcggtttcgc 
tcgttatgtc 



tagtgagttt 
tggggtttgt 
ttatattggt 
atgaggaagg 
gatttagatg 
agggttagta 
gtaagttggg 
ggtagagata 
gtttaatttt 
ttttggaata 
taggttatcg 
gtaatagttt 
ttgttttatg 
gagtagttat 
tatttttggt 
ggagagattg 
ttttgggttt 
atttattgga 
gtttagtttt 
aaatatgatt 
attttaggtg 
attttagttg 
ttttatgggt 
tgttgtgagg 
ttaattattt 
ggggcgggga 
aagtttagcg 
ggcgtgggcg 
cgcgggattt 
aggaggttgg 
tttcggggcg 
gttttgtttt 
cgtttttttt 
ttcgattttg 
ttttttattt 
aaattcgggt 
gtttatggtt 
gggtttacgg 



seq_prot . 
ttagtagtag 
atttagtcgt 
attttttgtt 
agtgttagtc 
gttattaata 
tgagaagggg 
ttttggtttt 
gggaaggagg 
tagtttattt 
ttgggtttgg 
gaaggggaat 
tatttttttg 
gggtgttggg 
aaattttttt 
tgggtgttta 
taagtatttt 
tttttttggg 
ttggatttaa 
tttatttgta 
tttttatttg 
tttatagaat 
attttttttg 
tttagtggag 
tttcgttaag 
agtagggagg 
ggcggggcgg 
tagggggagg 
tggttgggga 
agaggcgggg 
aggagtttcg 
tggttgttgt 
agtcgtcgtt 
tttcgtttat 
tcgatatttt 
tttttttttt 



txt 

ggggtatttt 

atagtttgtg 
tacgttgaag 
ggtgttatta 
tttttttgta 
ttttaggggt 
tttgaagttt 
gtattggaga 
tttttgtttt 
gcgagttggg 
tagtatcgtg 
agttttagtt 
agattatatg 
tttaaatata 
ttcgttatta 
ggggaatttt 
tgtattaatt 
tttttgagtt 
aagagtgggt 
gaaaatgttt 
ttttgaaatt 
ttttgtaatt 
ttagttgtgg 
aaatagttat 
gggtatttag 
agaacggaga 
tggttgcgat 
cggtttgggg 
cgtcggttcg 
cgggatttag 
cgagtatttt 
ttttttattt 
tttttgtatt 
ttttttaaat 
attttttttt 



tttttcgcgt ggtttcgttt 
ttgacgttgt tttttttcgt 
cggggcgtcg agcggttggc 



agtttattta 
tagtagtttt 
tttttagtgt 
agtttagatt 
ttttggttag 
gttatggtag 
tcgttttgat 
agaggaattg 
ttttagcggt 
agataagatt 
gatttggagg 
tatttttagt 
aattggatta 
gttattgatt 
tttatttagg 
attttttagt 
tttttaggtt 
ttttttgagt 
ttgtttaata 
attggtatga 
tttttattat 
agaggggatt 
aatggaatat 
aaaagggttt 
cggtgttgtt 
gcgttttttt 
tgattgaggg 
gtggggttgc 
gggattattg 
aggcggggcg 
ttagtttagt 
tacggcgttc 
tgggtttgta 
ttttgatcgg 
ttcgattttt 
ttaggtcggg 
ttataagttg 

gg 



<210> 267 
<211> 4492 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 267 

tcgttagtcg 
aggcggagag 
ggaggcgggg 
gggaaagaag 
aagtttgggg 
tagatgtagg 
tagggtagag 
gggagatgtt 
ttttgggttt 
ttcgggtcga 
atttttaagt 
tagtcgtagt 
gtttttcgtt 
cgttggatgt 
ttatggttgt 
ttttatagtt 
ttgattatag 
aggattttag 
gtgggtattt 



ttcggcgttt 
aagtagcgtt 
ttacgcggag 
atggaggggg 
gaggagtgtc 
gagtgggcga 
gaggcggcgg 
cggtagtagt 
cgcggggttt 
cgtttcgttt 
cgtttttagt 
tatttttttt 
tttcgtttcg 
tttitttttta 
tttttagcga 
gattttattg 
aatagaaaaa 
aagttttgtg 
tttaggtaga 



cgtcgtgggt 
aggattatgg 
agattcgggt 
aggaatggag 
ggtaggatcg 
aaggagagag 
ttggaataaa 
tacgtttcgg 
ttttagtttt 
ttgggtttcg 
tacgtttacg 
tgcgttggat 
ttttttcgtt 
ttaaatggtt 
aattttataa 
aagtttatga 
gttagttaaa 
gatatttgaa 
gggattatat 



tcgatatggc 
gcgcggggtc 
tttgggaaaa 
agaagggtat 

ggggagcggt 

cgagtaggcg 
gcgtggtttg 
aggagtagcg 
ttgtagttac 
cggggttttt 
tttttggtta 
ttggaggaag 
ttacgtgata 
aaaataatta 



gaagaagcga 
gcgggggata 
gagggattcg 
taggtaaggg 
tggtaaaggt 
agggatggtt 
ggtacgggtt 
gttttcggtc 
gtttcggagg 
ttagtttttt 
gtttttttta 
ggggtggaga 
tcggtacgtg 
ttataatttt 



tatgtttatg gggtgggaat 
ggaagaaatt ggtttaaatt 
attttaggat tatttttttt 
attgggtata aaattaggtg 
ttttttcgag agtttaaaag 
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tatagtttgt 
ttttcggttt 
agaggggtcg 
tgtcggttag 
gatgtaggtt 
tcggacgtcg 
cggttgagtt 
gacgtttcgt 
agtagtggtt 
gcgtagtttt 
gttttttagt 
atgagaggac 
ggagtaatat 
taaggttttt 
tgatgtttta 
atagtttttt 
aggtgatggg 
ttttatgtta 
tgtgttgaat 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4492 



60 
120 
180 
240 
300 
360 
420 
.480 
. 540 
600 
660 
720 
,780 
. 840 
. 900 
■ 960 
1020 
1080 
1140 



aagtttattt 
aagatttaga 
gagattgatg 
ataaaatttt 
aataatggcg 
gttatgtttg 
tttatgtaat 
tgaattaaga 
tttacggtgt 
tttttagttc 
agggggtaga 
tttttttagt 
ggagatttta 
gtatttttga 
ggtatagggg 
tttggtggta 
ggttttaacg 
tgtatagatt 
ttggggtgtt 
gatgtgttta 
aatgttattg 
tttaaaggga 
gaggtgggcg 
agattttagt 
gattttaaga 
gcgggtttgt 
tgttatgaat 
tggggtatcg 

tattttttta 
gagttattgt 
tggtgggagt 
ggatttggga 
ttatattttt 
gtgttttgga 
acgtggatgt 
tttttaaggg 
ttttttatat 
gttgggagat 
tttaatgttt 
agtttagtta 
tgatgtagcg 
gtatggagag 
tagagttagg 
ttaggttaga 
tattttggat 
gaggagttta 
gtataggggg 
gtgtttttgg 
cggattttta 
gtttattagt 
tatttatatt 
tcgtaatcgt 
tcgtcggtta 
agttaataat 
ttagagtgtt 
tttgggtttt 



tttatagatg 
aattaaattt 
tatttaagag 
ttagagtgtt 
aatgagtatt 
gagaaggaat 
tttttaatat 
tttagagaga 
taattttttt 
gtttaggttt 
ggggatgggt 
gttttttttt 
gggagattag 
gatttttttt 
agtgttggtg 
tcgattgata 
tggataagaa 
atacggttga 
ttttgttgtt 
ttatgtcgga 
tttatgaata 
attttgagta 
tttttaagtt 
ttttagtaga 
acggtttttt 
tttaggagcg 
tggacgaaaa 

agggttttat 
aggtgttagg 
ttttagggat 
tttgtgagta 
acggttgttt 
ttttattttt 
ttttgattat 
agttttggtt 
tatttagggg 
tttattgggg 
tttaatttta 
attattttgt 
ggaggagtgt 
ggatagttgg 
agggtaagtt 
tcggttggag 
ttttgttttt 
tttgtttagt 

gggggtttgg 

aagggaattg 
gtaaggtgta 
agatttttta 
cgggattggt 
ttaggtatgg 
aaaggttatc 
tttcgcgtta 
tatgatgatt 
tttttttttt 
ttttgttgtt 



gggagattga 
agtttagtgg 
aagaatttag 
tataattttt 
tagttaggga 
ttgtggttgt 
tttatgaagt 
taaagttgtt 
ttcgatgatt 
agtgttttag 
tggaggttga 
tttgtttttg 
agtttagttt 
tatattggtt 
gttatttggg 
tttttttttt 
gtgttagtgt 
gtgtaagttt 
ggagatttat 
ggatgagtat 
ggtggggggt 
gatatttgtt 
tgaggggata 
ggaggagttg 
ttttgttttt 
tttttgtttt 
ggtagtgagg 
cgatgtcgtt 
ggttgtgggt 
agggtggata 
tagggtagtt 
tagggtaagg 
tttatttttt 
tttcgaaata 
ttttttgttt 
gtggtgggag 
ataggtttta 
atagtttttg 
tttttaggga 
gaaggtagtt 
gaggatattt 
ttgttttttt 

ggggttgtgg 

gttgattttt 
atagttgaga 
gggggtgggg 
gattgagagt 
gtagatgaat 
tttatttttt 
acggtgtgtt 
tatttcgttt 
gtttataacg 
cgtggttgtt 
ataggatgga 
ttggaggtag 
tttggtgtat 



seq_prot. 
gtttggggat 
gttgatattt 
aaatgaaatt 
ttttttattt 
tgtgtttgat 
tttagttatt 
aaggttttgt 
gtttaatgag 
taatgatttt 
gaatgtgatt 
gtaaatagag 
tttttttttt 
gttaggtatt 
ttgttggtat 
ttaatgtagg 
atagagttat 
gacgagtttt 
taaggtgtgt 
tattatttat 
acggtttatg 
ttttttttga 
ttgaaatttg 
gatttaaggt 
tgtagtggga 
cgaggtgaat 
tatattattt 
tttgggtatt 
tttattcgta 
tagggtagaa 
gggaagttgg 
ttaagatttt 
gattttgttt 
agggtagtta 
tttttgacgg 
atagttatag 
attgagtagg 
gtatgtgttt 
ggatttttag 
aatagtattg 
ttttgtcggt 
tcgagttttt 
tttaaaaggg 
ttgtggagtt 
tggggaaagt 
gtatagttag 
tcgagatatt 
taaaggtttg 
tttaatggtt 
ttttgttata 
agggtaagtg 
ttttttggtt 
aggttatagt 
tttgtttttt 
ttggtttgtg 
gagtcgttgt 
aagggttgaa 



txt 
agggagtggt 
atttaaattt 
ttgttttttt 
tttttttttt 
taaataatta 
gggtattttg 
taattttatt 
tttttttttt 
gagtttggta 
tttgttgtag 
tagtagaaaa 
ttttttttag 
gagttagaag 
gggttgtttt 
gagggcgagg 
gtaagggtcg 
gtttttatta 
ttagagttta 
tttttgtagt 
acgatggcga 
tttttgattt 
aggaagaggt 
ttgagatttt 
agtttttcga 
ttagggtagg 
tttttatttt 
ttaagtttat 
ttaattgtta 
agtatttagg 
atttagggtt 
aggttttggg 
ttgtgtttag 
gtattggcgt 
tttcgatggt 
tggtcgggag 
tagtggagta 
atttttgatt 
ttttgttttg 
ggagtattag 
tgtgtttgaa 
tttttggggt 
ttgaaatttt 
atcgattaaa 
ttagttttat 
tagagggagg 
agtgatatgg 
gttttgttat 
tcgttggaag 
gttggtatta 
gacgtagtta 
aagaaataaa 
cgtggtcgta 
agtgaggaga 
tttgttattt 
tatttttgaa 
cgtagtttgt 



ttgtttagaa 
ttagtttggg 
atgttaaaaa 
gttttaaata 
tggattaata 
tttggtttag 
ttattgaaaa 
gttttttaga 
aaggttttat 
tagtcgttgg 
ggtagttttt 
gtattagagc 
ttttgttatg 
ggttgattaa 
gtggtttggg 
ttgtattgag 
ttagagttgt 
ggtgggtggg 
gattcgcggg 
ggagaaaaat 
ttaggtttag 
ttttgcgttt 
ttatttaggg 
cgtttttatc 
tattggggat 
agggtagtat 
tcgagatgtt 

ggggaagatt 

gagggtttga 
ttgtaggatt 
tagtgaattt 
ttagcggttt 
tttgaggatg 
atttcggata 
cgggtttatt 
gttttggatt 
tttattttat 
gtttagtttt 
ttttagtatt 
tattttgtta 
agaatttttg 
ttggtattgg 
gtttgtttgt 
ttggatttta 
ggttgtggtt 
tggagggaaa 
tcgttgttgt 
gggtaagatt 
gatagtttta 
tggttggtcg 
ggtttaggta 
attagaattt 
gtaatttggg 
gtgaatttat 
gttggtttag 
aa 



<210> 268 
<211> 4448 
<212> DIM A 

<213> Artificial Sequence 
<220> 

<?23> chemically treated genomic dna (Homo sapiens) 
<4Q0> 268 . ,. 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 . 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4492 



-ttagtatttt-aggagttgat- .gagtt-t-t«atn^ggg^at-t^ ttl^-tttt^^gttt^agcta 
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seq_prot . txt 

ggtcgggggg aatttgtaag gaatatgtgg aattttatat ggtgtttttg atattgcggt 120 

aatgttaggg agaggtttgt ttaattggtt agggttttga aatttcgatt atttttttat 180 

ttttgtattt tgaaggttag tagagaggag gaattttgag ttttttagga ggtgggtagg 240 

tggtggtagt atagattttt agatgtttaa acggagttgt agagagaagg ttttttcggt 300 

agattttgga agggattttt acggtagaat ttttatttta taagtaggga tagaggttta 360 

gatggggata gagacgtgtt taaggtaaaa taatggacgg aggtagggta gggagaggtt 420 

aggttttaga ttcggtttta gtattttttt tatatatata tcgatataga agttggttta 480 

gatttataat ttaatggttg tgtagatttt agttttttat ttttgtcgtt tacgtgttta 540 

ttggtttggg gttagggttt tttgtttaaa ggtatggatg tgcgggagtt tttttgttag 600 

gtacgcgttt attagtttgg ggcgagagat ggggttagga aaaggttagc ggtgatcgta 660 

tatttcgagg ttagggtgat ttaagagatt ggtgttttta aggttgtagt ttttaggtcg 720 

tgggaatacg tatttggaga gagggtttat atcgaaatat tcgaagtagg ggttacgaag 780 

gcgggatttt taaggttttt gagttacgtt agggggagtg cgggagttag gttgtagggg 840 

tagggggagt ggttggggtt gtggtatagg agttaggtaa ggtgttttta gggttttatt 900 

tgtgtttttg aagtagcggt ttnttttcga atttggtcga tattattagg attcggaagt 960 

tataggagta acggttgagg gtcgtgtttt ttatttttta tcgagcggaa gaatatgaat 1020 

ggtgtcgaat tcgtttttcg agtttttttt tttttgrtcg gcggtttaat ttattatatg 1080 

tttcgttttt aggtttcgtt gtagtttttc gcggaggtat tcggcgttga gttttatggc 1140 

ggtagtttag ttggaacggt agtttagtag ggatataatt ttagttcggg cgtcggttac 1200 

gttattttgt tgttttatag gagttatttt cgttggaaaa tttattttcg tttttattaa 1260 

ggcgtacgtt aacgtagtat tttcgttttt aagtagtcgg gttttttagc ggtttaggtt 1320 

tttcgcgttt gttatgtgtt agttaattag agtttgagga aggtgggatt cgggcggagt 1380 

cgatgttgaa tggtacgcgt tcgtcgattg gatagtagtt tggttttcgc ggtcggattt 1440 

ttatattcgt ttttagatag gagaggggta cgtatcggcg ttacggtttt ttgtaggttg 1500 

ttttcggata gttttcgaga gtttgtttcg aagtaagtat tttgtagttt tagcgattta 1560 

gttttttttt ggattttagg ttacggtaat taattttttg ttgtggtttt cgaattttaa 1620 

ggttcgatgg gtttcgcggg ggtcgcggcg aggttagggc gttttttcgg cgtttatttg 1680 

ttttattgtt tgaattttcg gtatcggggt cgcgtttacg tggggtttat tgttaatatt 1740 

gttcgtcggg tttagtagta taatgggggt cgtaaaaaag gcggggtttg gcggattagc 1800 

gggcgagggt tttggtgaga gggggaggtt ttttgtgtcg ggaggaaggc gttttagagg 1860 

aggcggattt cgcggggtat aggtttgttg agaaggatcg gttaggattg tgatagaggc 1920 

ggggcgtttg tggtggggac ggggtttgtc gtaggggagg agcgtgacgg ggaggcggtg 1980 

ttcggggtat tttcgcggcg gaatttaggg aaggagttag ttggggtcgg ggtgatgatt 2040 

taggttgggt tttagtatag ggtttttggt gggtacgttg gggtcgggtg gaatgtagga 2100 

gagagaggag gtgggatagg tgggtatttg ggttggaggn agggtttgag gtgggtaggt 2160 

gtagagggtt gtatttttcg gttgaagtcg ggaatgagga tttcgttttc gggtgggatt 2220 

ggaggggatt cgcggttgag gcgttgggtt gcgataggga tattatcgtt ttttttttta 2280 

gggagatggt gttcgtcgtg tacggttttt cgtttttcgt ggtcgttttt cgggtaagga 2340 

aggagatcgg gtagcggcgg tcgggtgagg gtttgggttt cgtttttcgt ttcgagtcgt 2400 

tttgatgttt ttgtacgtcg ttcgtagttt gagtgggttt ggtagtattc gtacgtttcg 2460 

cgtcgtttgg cgtacgtggg gtttcgtttg cgaggagaga tagttttcgt tttttatttg 2520 

cgcgtgttgg cgtatatgtt gcgcgtatcg ttttgggttc gtttttcgtt tacgttgcgt 2580 

tgggtgcgtt tagattttcg ttaggatttt tgtttttcgt cgtcgtcgta cgtgtttttg 2640 

gttttcgggt ttttatcgtt ttaggtttcg gttttaaggc gtcgcgtagg tttttttgat 2700 

gacgcggagt ttgagttaga ttagggggat ttaggggttt gttgtttttt gtgcgtttag 2760 

attatttagg tgaggttttt tcgaggaatg gatggttrta gagtttagac gatttcggat 2820 

ttagtttttt tttgagggaa atttattgat acgttttggt tattttattt ttatttttaa 2880 

gatgttgatg ttataggatt tacggttatt tttgagtaaa gtaggttttg tttagggttt 2940 

ttattatttt ttttttaaag aaaaggtttt tttttaggtt attagtattt cgttgtattt 3000 

tagtgtggtc gatagagtat gatttttttt tttttttttt ttttttttga gatggagttt 3060 

agttttgtcg tttaggttag agtgtagtgg cgcgattttt gtttattgta agtttcgttt 3120 

tttgggttta cgttattttt ttgttttagt ttttcgagta gttgggatta taggcgttcg 3180 

ttattacgtt tggttaattt tttgtatttt tagtagagac ggggttttat cgtgttagtt 3240 

aggatggttt cgatgtttta gtttcgtgat tcgttcgttt tagtttttta aagtgttggg 3300 

attataggta tgagttatcg cgttcggttt tttttttttt tttagataga gttttgtttt 3360 

gttgcggagg ttggagtgta gtgattttag tttattggaa gtttcgtttt ttgggtttaa 3420 

gcgatttttt tattttaant ttttaagtgt gttattarat ttgtcgaatg tttgtgtttt 3480 

tagtagggat ggggttcgtt atgttgggta ggttggtttt aaatttttga ttttaagtga 3540 

tttgttcgtt tcggtttttt agagtgttgg gattataggc gtgagttatt atgtttggat 3600 

atgattttgt ttttaaaata atgtaatgaa ttgtagttta tttttgtgtg tatttttttg 3660 

tatttttttt tttttaatat ttattattgt ttggtatgtt acgtgttttt ttgattttat 3720 

tagttttttt tattttaatt gttatattgt attattagag ggtagtattt attattgttg 3780 

tagaataaag ttttgtattg agtxgatggt ttagaaattt tttttttatt tatttattta 3840 

tttttttttt tgagatagag ttttattttg tcgtttaggt tggagtgtag tggcgcgatt 3900 

ttagtttatt gtaatttttg ttttttggtt ttaagtaatt tttttttttt agtttttaga 3960 

gtagttggga ttatacgcgt ttgttatcgc gtttggttaa tttttgtatt tttagtagag 4020 

atggggtttt aattatgttg gttaggttgg ttttaaattt ttgattttag gtgattttgt 4080 

ttatttaagt tttttaaaat gttggtatta taggtatgag ttatcgtgtt cggtttaaat 4140 
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atttaataaa ataatggacg atgggtgttt tttattgagt tttcggtaat tgtgagtgag 4200 

tagaggattt gttttgggga tatttagtga tttgttgggt gttgttgagt tgtgaggaag 4260 

tttaggtttg gttgtagtgg tgaggttgtg atttaattaa ttattgttga tgtttttagg 4320 

atttgtatta gtttagtttt aggggtaagg attttaattg tttattttag tttttttatt 4380 

tgtaagatgt aaataatagt tatttttgtt ttatgggatg gagttgtgta atgttcgtaa 4440 

tagtgttt 4448 

<210> 269 
<211> 4448 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 269 

aggtattgtt gcgggtatta tatagtttta ttttatgagg taggggtgat tgttatttgt 60 

attttataaa tgaagaaatt gaggtggata gttaaaattt ttgtttttag gattaagttg 120 

gtgtaggttt tgggagtatt agtagtgatt gattgagtta tagttttatt attgtagtta 180 

gatttgaatt tttttatagt ttagtaatat ttagtaggtt attgaatgtt tttagggtaa 240 

gttttttatt tatttataat tatcgggagt ttagtagaag gtatttatcg tttattattt 300 

tattaaatat ttaggtcggg tacggtggtt tatgtttgta atattagtat tttgggaggt 360 

ttaggtgggt aggattattt gaggttagga gtttgagatt agtttggtta atatggttga 420 

aattttattt ttattaaaaa tataaaaatt agttaggcgc ggtggtaggc gcgtgtaatt 480 

ttagttattt tggaggttga ggaaggagaa ttgtttgaaa ttaggaggta gaggttgtag 540 

tgagttgaga tcgcgttatt atattttagt ttgagcgata gagtaagatt ttgttttaaa 600 

aaaaaaaata aataaataaa taaaaagaaa atttttgggt tatcggttta gtgtaggatt 660 

ttgttttata gtagtgataa gtgttgtttt ttgatagtgt agtgtggtag ttggggtggg 720 

agaggttaat gagattaaga aaatacgtag tatgttagat aatagtaagt gttaagggaa 780 

agggggtgta gaaaagtgta tataagaatg agttgtagtt tattatattg ttttagagat 840 

agggttatgt ttaggtatgg tggtttacgt ttataatttt agtattttgg gaggtcgagg 900 

cgggtagatt atttgaggtt aggagtttga gattagtttg tttaatatgg cgaattttat 960 

ttttattaaa aatataaata ttcggtaggt gtggtggtat atttgggagg ttgagatagg 1020 

agaatcgttt gaatttagga ggcggagttt ttagtgagtt gagattattg tattttagtt 1080 

ttcgtaatag agtaagattt tgtttaaaaa aaaaaaaagg gtcgggcgcg gtggtttatg 1140 

tttgtaattt tagtattttg ggaggttgag gcgggcggat tacgaggtta ggatatcgag 1200 

attattttgg ttaatacggt gaaatttcgt ttttattaaa aatataaaaa attaattagg 1260 

cgtggtggcg ggcgtttgta gttttagtta ttcgggaggt tgaggtagga gaatggcgtg 1320 

aatttaggag gcggagtttg tagtgagtag agatcgcgtt attgtatttt agtttgggcg 1380 

atagagttag attttatttt aaaaaaaaaa aaaaaaaaaa agagggttat gttttgtcga 1440 

ttatattgga gtgtagcgga gtattgatgg tttagggaag aatttttttt ttgaagggga 1500 

aatggtgaag gttttgagta gggtttgttt tgtttaggaa tggtcgtaag ttttatagta 1560 

ttagtatttt agagatgggg atagggtggt taaagcgtgt tagtgggttt tttttaagaa 1620 

agagttggat tcgaagtcgt ttggatttta gagttattta tttttcgggg ggattttatt 1680 

tggatggttt gggcgtatag ggagtagtag gtttttggat ttttttggtt tggtttaggt 1740 

ttcgcgttat taaagggatt tgcgcggcgt tttggggtcg gggtttgggg cggtggaggt 1800 

tcgaaggtta gaggtacgtg cggcggcggc gggaggtaga ggttttggcg gaggtttggg 1860 

cgtatttagc gtagcgtgag cgggaggcga gtttagggcg gtgcgcgtag tatgtgcgtt 1920 

agtacgcgta ggtggaaagc gaaggttgtt ttttttcgta ggcgaggttt tacgtgcgtt 1980 

aggcggcgcg aggcgtgcgg gtgttgttaa gtttatttaa attgcgggcg gcgtataggg 2040 

gtattagggc ggttcggagc gaggggcgga atttaagttt ttattcggtc gtcgttgttc 2100 

ggtttttttt tttattcgaa gggcggttac ggaggacggg aagtcgtgta cgacgagtat 2160 

tatttttttg aggaggagaa cggtgatgtt tttgtcgtag tttagcgttt tagtcgcggg 2220 

ttttttttaa ttttattcga gagcggggtt tttattttcg gttttagtcg agaagtgtag 2280 

ttttttgtat ttatttattt taggttttgt ttttagttta ggtatttatt tgttttattt 2340 

tttttttttt ttgtatttta ttcgatttta gcgtgtttat taagagtttt atgttggaat 2400 

ttagtttgga ttattatttc ggttttagtt agtttttttt ttgagtttcg tcgcggagat 2460 

gtttcgggta tcgttttttc gttacgtttt ttttttgcga taggtttcgt ttttattata 2520 

gacgtttcgt ttttgttata gttttggtcg gtttttttta ataggtttgt gtttcgcggg 2580 

gttcgttttt tttgggacgt ttttttttcg gtatagaagg tttttttttt ttattagggt 2640 

tttcgttcgt tggttcgtta ggtttcgttt tttttgcggt ttttattgtg ttgttggatt 2700 

cgacgagtag tgttgatagt gaattttacg tagacgcggt ttcggtatcg ggggtttagg 2760 

tagtagagta ggtagacgtc gaaaaagcgt tttggtttcg tcgcgatttt cgcgggattt 2820 

atcgggtttt ggggttcggg aattatagta gggggttgat tgtcgtgatt tagggtttag 2880 

gggagggttg gatcgttagg gttgtagggt gtttgtttcg gaktaagttt tcggggatta 2940 

ttcggaggta gtttgtagga agtcgtacjjcg teggtacgtg tttttttttt gtttggaggc 3000 
gggtgtagaa gttcgatcgc ggaagttdga ttgttgttta gtcggfcgagc gcgta tta tt _3060 

seite 188 



seq_prot.txt 

tagtatcggt ttcgttcgag ttttattttt tttaggtttt gattggttga tatatggtaa 3120 

gcgcgaaaag tttggatcgt tggaaagttc ggttgtttga gggcggaagt attgcgttga 3180 

cgtacgtttt agtaagggcg gaagtgagtt ttttagcgga agtggttttt gtaaggtagt 3240 

aaggtagcgt ggtcggcgtt cgagttgggg ttgtgttttt gttgggttgt cgttttagtt 3300 

ggattgtcgt tatggaattt agcgtcgaat attttcgcga gaagttgtag cgggatttgg 3360 

aggcggagta tgtggtgagt tgagtcgtcg gataagggga aaagagttcg gggggcgggt 3420 

tcggtattat ttatattttt tcgttcggta ggaggtggag gatacgattt ttaatcgttg 3480 

tttttgtagt tttcgagttt tggtggtgtc ggttaagttc gaggggaaat cgttgtttta 3540 

gagatatagg taaggttttg ggagtatttt gtttagtttt tgtattataa ttttagttat 3600 

tttttttatt tttgtaattt gattttcgta ttttttttgg cgtggtttag ggattttgag 3660 

gatttcgttt tcgtgatttt tgtttcgagt atttcgatgt agattttttt tttaagtacg 3720 

tgtttttacg gtttggggat tataattttg gggatattag ttttttgggt tattttagtt 3780 

tcggggtgtg cgattatcgt tggttttttt ttaattttat ttttcgtttt aggttggtga 3840 

acgcgtgttt agtagaagag ttttcgtata tttatgtttt tgaatagaaa attttgattt 3900 

tagattagtg ggtacgtgag cgatagaaat gagggattgg gatttgtata gttattaaat 3960 

tataaatttg gattagtttt tgtgtcgatg tgtgtgtggg aggggtgttg aggtcgagtt 4020 

taggatttgg tttttttttg ttttgttttc gtttattgtt ttattttgga tacgtttttg 4080 

tttttattta ggtttttatt tttgtttgta aaatgaaagt tttatxgtaa aagttttttt 4140 

tagggtttat cgagaaagtt ttttttttgt aatttcgttt agatatttag gagtttgtat 4200 

tgttattatt tatttatttt ttagagagtt tagggttttt tttttttgtt agtttttaag 4260 

atgtagagat gagaaagtga tcggggtttt agggttttga ttagttgaat aaattttttt 4320 

ttgatattat cgtagtatta gaggtattat atagaatttt atatattttt tgtaggtttt 4380 

tttcggtttt rttaggttta agagggagtg gtgtttaggt aggggtttat taatttttgg 4440 

aatgttga 4448 

<210> 270 
<211> 4001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 270 

gagtttaatg tttgtgattt tttttttttt aaaaattttt gaggtaatat ttatatttag 60 

tgaaatgtat agattttaag tggattattt tattagtttg ataaatgttt gtaaggtgga 120 

atttaagttt ttattagtat ttgttaaatt tatgttagag gttgattaga gaagtatagt 180 

ttttaagatt tttattttgt tttgcggaga atggtaaaaa tttgcgaaat attttagtag 240 

agttaggaga gagtaatatt ggtaacgtga agtacgttaa gttaagaaag gaagtattga 300 

aattaattgt tttgaatgga atttgataag aatttgttat tttatttttt taagattgga 360 

tttaagttgg gagagaatat tgttttttgt tttgtttaag ttaatattat tgtttagatt 420 

tataaatttt tttatttttt aatgtttata tattaatttg tattataagt taaaataata 480 

tagagaattt ggaaaaagaa gaaggggaat gttttttacg aaaagtatat tttttaaagc 540 

gtttggtttg agagttatat agtatgtata aaatttgata ataagtatat tatgatttat 600 

attgtatata ataaaaagta gggtattttt gaagggtttt gagaaaaagt tttgggtttt 660 

taaaatttgt ttaagaaggc gattaacggt atattatttt taaacgtagt attgatttgt 720 

atgttttata gttattaata gtgaaaataa tagtttttaa taagagttgg aaatattgag 780 

ttgagggttt tttttttttt tttaaagttt gaggttaaaa tcgatattta tttttttgaa 840 

ttatgtttat attttttttt tggttaatac gtacgcggtg gtataatttt gaaaattacg 900 

taacgttata gaattattat ttagaaacga attttgttaa taaatttttt gtatttaaat 960 

tttttttatt gtatagtttt tgataatgtt ttttagataa tttttttttt agtaattagt 1020 

atttttaaat aaaaattata gagaatagta agtttttttt ttttttttcg gtagatttga 1080 

tttagaaatt gttatgggaa gaaagtgtta attatattaa aaaatagttt gatagaaagt 1140 

atttaaaaag agaaagggag aatattacgt ttttattttg gtgaattagt aataaagaaa 1200 

aagattagta tggacgggta ttttttaaaa atatattttt ttttttttgg ttttcgttag 1260 

ggtggaggaa gttgtttttt tgttagagat agggtggaag agagtgaaag gataaatgat 1320 

tgagaggttg ttttttttta ttggtgtagg cgtgcggggg tcggatgggg ggtcgcggag 1380 

gggggaggtg gttagtaggt gtttgggttt taggttttta tttttatttt cgtttttatt 1440 

tttatttata tttagggatt ggtttcgttt tttggcgggc gagcggtagg tgtcgaagta 1500 

ttggggtggg gggtgtaaat ttcgcgggta gcggaaaaga ggtcgtgggg ggttttttag 1560 

cgttggtaga tatcgtgagg ttggtagtcg tcggtacgta tatttagttc gtagtttcga 1620 

ggaatatgtt cgtagttagg gcgcggagta gagtttcggg taggagaatt aagggagggc ,1680 

gtgtgttgtg gcggcggcgg tagcggtagc ggagtcgtta gttttttttt ttttgagtga 1740 

gagaatgtta tatttaggaa tatagttaat tagggaagtg aa^gattttt ataatgagaa ,1800 

ttattaaata ttgtt£aaag aaagtataga cgatattaat taatggaaaa atattttatg |1860 

attatggata ggagagagta atattattaa aatggttata ttgtttagag taatttatgg 1920 

atttaatgtt atttttatta aattattaat gatattgttt t atagaattag aagaaattat 1980 
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tttaaaattt atgggtcgtg ttcggggcgt tgttttcgta gtttgcggcg ttttcgttcg 2040 

cgttttcgta gggttttttt tgcgtcgttg tcgtcggggg ttcgggcgtt tgcggttcgg 2100 

cgacggtagt ggttgtagcg ggtacggtcg aaggatcggg aggcggtggt tcggttcgaa 2160 

tcgtcgtgaa gaaagcgtaa ttacgtttcg tttcgcgggt taagaaattg gagaaattcq 2220 

gagtgtattt cgtttgtaag gtacgcgttc gtcgttttcg gatcgcggat gggtgttagg 2280 

ggtttagttc gcgggatttt tttttttttt tcgtttttat tttcgttttt cgttttttgt 2340 

ttttcgtttt tcgtttgggg tcgttgtatc gcggaagtgt ttttgtcgtt cgcgtttaat 2400 

SSSHSfi gttttgttgg gtttgaagaa gggggattat taagacggag 2460 

""^H 9 " tgaaaggagt agtttcggta ttcgagtttt cgggttattt 2520 

HSSS*?" 9 tt 9 tttt 9ta ggagttcgtt ggagttattt tttcgggagt ggagcggtgt 2580 

2a??2JS^ ^agggg^aag gaaaaatttt ttgggggatt gagtggtttt 2640 

ttttgtaatc gatttttttt tgtgttttat tgggaggaat tgggtgtaag gagtggqggt 2700 

ggagaagatt ttcggatttt tggcgttttg ggaaagcgaa ggggaggaaa gcgcggggtg 2760 

ggaaggtggg tagagttcga ggcgagggtt ttgtggtgtt tggcgttttc ggttgggggc 2820 

ggaggtattg tcgcgcgcgg tgatagtttt gttattattg tattattatt tgtttagtta 2880 

arnnnrlfrl SUSSSSS gaggtgtaaa taggatttgg taaagtgata gtttagcgcg 2940 

2S? gg 5 g £5 a 5 tg i" agtt tgaggatatt tattttttgt tatttatttt ttatgttigt 3000 

«HS§^3H t 9 tta i tttt agttgtagat ttagatgtgg aggttgtgtt ttcgaggtgg 3060 

gaggattgtt cgaggtcgcg gggtaggttg gttttgcgtt gttagtagcg gttttgtttt 3120 

gatagggtga tagttgaggg gggcggggat ttggttttgt tagtttttgc ggtttgtqqt 3180 

tgaggtaggg aagagatata tttatatata tttatatttt tttgtatttt ttttgtttqt 3240 

tg i aaagagg 9 ttaa 9tttt tagattttag attttcgatt gttataatta agttgtagtg 3300 

J" 3 "!"? aattttgata taattataat ttattttatt agaatattta ggttatattt 3360 

tttttggtat ttgtaggagt tttttagtta taatggaatg tttatgtttg tttttttttt 3420 

atattatatg tttaggttta tgttaagtat ttttgtattt tatattggtt taataaatat 3480 

StEXnSlE n?l%nT^l H a H" ta ? t gttttttaaa ttttaatgtg tatgggaatt 3540 

ttttggggaa gttggttaaa atgttgatta tgaattaggt ggtttgaggt qqaaqttqaq 3600 

a S" t i3 a i t tttgataaat ttttaagtga tgttatgtta gtttataglt tatattttal 3660 

tttagatttg gataatatat tatttgtttt ttttgatttt agtagaaaag atagaaataa 3720 

^ aa i aa ^ gt tat 9tggtta tttttgaaga aggaatgtaa aaaagatgaa gtggggagtg 3780 

3?nn»££?S aaa9gtg * aa Stttaggagt tgattatgaa gggtlttaga cgaitgagaa 3840 

IfSStlt- 9 a "9tagtag gtagaagggc gagttatgta tagaattaag aataaggtta 3900 

^QQftatagg gtatattgtt tgggtgatga ttatattaaa attttataaa ttattittaa 3960 

agaatttatg taattaaata ttatttgttt tataaaaatt t 4001 

<210> 271 
<211> 4001 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 271 

a 2H3 tt: J tgt ggfataggtg gtgtttgatt atataagttt tttaatggtg atttgtgaga 60 

22ii?*£ a g "attattt aagtagtatg ttttgtattt atgattttgt ttttiatttt 120 

?£ a * a £ ag " ^gtttttttg "5 a 3 ta H§ tcggtgtttt gtttttattc gtttaatgtt 180 

"" a * ga " aa HH tggg "tgtatttt ttttagaagt atatttttta ttttattttt 240 

tttgtatttt ttttttagaa ataattatat agtattatta tttgtttttg ttttttttqt 300 

H aaa H aaa ggg9a £ aaa * lf;ltl a ll gt tta ggtttga attggaatgt ggtttgtgga 360 

H ag J a ^ ag 3 a ttatttggg agtttgttag aaatgtaaat ttttagtttt tattttagit 420 

« a £^? a ~ tt: ataatta 9 ta ttttaattag tttttttagg agatttttat gtatattiaa 480 

gtttgaggag tattgggtta gattttatat ttatttaata aatatttatt gaattagtgt 540 

| gga t a taga ggtgtttaat ataaatttga gtatatggtg tggaaaaaga gtaagtatlg 600 

gtgat J gaga a 9tttttgta agtattaaga agagtataat ttagatgttt 660 

tagtagaata aattatagtt atgttaaaat ttagaagtgg ttattgtagt ttqqttqtqa 720 

taatcgggaa tttggaattt gggaatttga ttttttttat agtaggtagg aaga2a£ag 780 

ggaggtgtga gtgtgtgtga gtgtgttttt tttttgtttt agttataggt cgtaggagtt 840 

gataaggtta agttttcgtt ttttttagtt gttattttgt taaaataaag tcgttgttga 900 

SSSS2S22 « a f5^ gt I 5 cgcggtttc Mgtaatttt tttatttcgg gaatatagt? 960 

tttatatttg gatttgtagt tggaggtgat agggtgttta aattggtatg gagagtgqqt 1020 

o?^ 3 ?^ SSS^i^Ii aaa ttgagta gtagcgttcg tcgcgttggg ttgttattt£ Io80 

£2?S52 S»2«S2S cgagtagttg gg a tagcgtt ttaattaaat aagtaataat 1140 

ataataatga tagggttgtt atcgcgcgcg gtagtgtttt cgtttttagt cgaggqcqtt 1200 

885J22 a ? a H ttC ?^ ttcggatt:t t gtttattttt ttatttcgcg ttttgtttt 1260 

JSSSSIH ttaaagcgtt aggagttcgg gagttttttt tatttttatt ttttgtattt 1320 

agtttttfctt agtgggatat- aggaagggat -cgattgtaga -agggattatt -tagtttttta- 1380 — 
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aaagattttt ttttgttttt gggttattcg ttatatggag agtatcgttt tattttcggg 1440 

aaagtgattt tagcgagttt ttatagagta ataaaatttt aaggtagttc gggagttcgg 1500 

atgtcgaagt tatttttttt aagcggaagg aatgaagggt ttttcgtttt agtgattttt 1560 

tttttttaaa tttagtagga tttttattta agaagaagtt aattgggcgc gggcgataag 1620 

agtattttcg cggtgtagcg gttttaggcg ggaggcgaga ggtaggaggc gggaggcgga 1680 

ggtggaagcg ggagggggag gggggtttcg cgggttgggt ttttagtatt tattcgcggt 1740 

tcgagagcgg cgagcgcgta ttttgtaggc ggagtatatt tcgagttttt ttagtttttt 1800 

ggttcgcgga gcggagcgta gttgcgtttt ttttacggcg attcgggtcg agttatcgtt 1860 

tttcggtttt tcggtcgtgt tcgttgtagt tattgtcgtc gtcggatcgt aggcgttcga 1920 

gttttcggcg gtagcggcgt agggggagtt ttgcgggggc gcgggcggaa gcgtcgtagg 1980 

ttgcgggggt agcgtttcgg gtacggttta tgaattttaa aatagttttt tttagttttg 2040 

tgaataatgt tattggtagt ttaataggaa taatattgaa tttataaatt attttgggta 2100 

gtatggttat tttaatgata ttgttttttt ttatttatga ttatggaatg tttttttatt 2160 

agttgatgtc gtttgtgttt tttttgagta gtgtttagta atttttatta tagagatttt 2220 

ttattttttt ggttagttgt atttttaggt atggtatttt tttatttagg agggagggga 2280 

ttagcggttt cgttgtcgtt gtcgtcgtcg ttatagtata cgtttttttt tggttttttt 2340 

gttcgggatt ttgtttcgcg ttttggttgc ggatatgttt ttcgggattg cggattaggt 2400 

gtgcgtgtcg gcggttgtta gttttacggt gtttgttagc gttgggaggt tttttacggt 2460 

ttttttttcg ttgttcgcgg ggtttgtatt ttttatttta gtgtttcgat atttatcgtt 2520 

cgttcgttag ggggcgaggt taatttttgg gtgtgggtga gggtggggac gagggtgggg 2580 

gtaggggttt ggagtttagg tatttgttgg ttattttttt ttttcgcggt tttttattcg 2640 

attttcgtac gtttgtatta gtgggagggg gtagtttttt aattatttgt ttttttattt 2700 

ttttttattt tgtttttgat agagagataa ttttttttat tttagcggga gttagaggaa 2760 

aaaaaatata tttttgaaaa gtattcgttt atgttaattt ttttttttgt tgttaattta 2820 

ttaaaataaa gacgtgatgt tttttttttt tttttttaaa tattttttgt taaattattt 2880 

tttaatataa ttaatatttt ttttttatag taatttttgg gttaaatttg tcggaggagg 2940 

ggaaaaaaat ttattatttt ttgtaatttt tgtttgaaag tgttggttgt taaaagaaag 3000 

attgtttaaa aagtattgtt aaaaattatg tagtggaaaa agtttgaata taaaaagttt 3060 

gttaatagga ttcgtttttg ggtggtggtt ttataacgtt acgtagtttt taaaattgtg 3120 

ttatcgcgtg cgtattaatt aggaaaaaga tataagtata atttaaaaga ataaatatcg 3180 

attttggttt taaattttaa aaagaaaaaa aaaattttta atttagtgtt tttaattttt 3240 

gttgaaaatt gttgttttta ttgttgatga ttgtggagta tataagttag tgttgcgttt 3300 

gaaagtggtg tatcgttgat cgttttttta agtaggtttt aaaagtttag ggtttttttt 3360 

taggattttt tagaagtgtt ttattttttg ttatatgtaa tgtgggttat aatgtgttta 3420 

ttattagatt ttatgtatgt tgtatagttt ttaggttaga cgttttaagg ggtgtgtttt 3480 

tcgtgggaaa tatttttttt tttttttttt tagatttttt gtgttatttt gatttgtaat 3540 

gtaggttaat atataaatat tgggaggtgg gggagtttgt agatttaaat agtgatatta 3600 

gtttagatag ggtaaaaggt aatatttttt tttaatttga gtttagtttt gaggggatga 3660 

aatggtagat ttttattagg ttttatttag gataattaat tttagtgttt ttttttttag 3720 

tttgacgtgt tttacgttat tagtgttatt tttttttggt tttgttgaga tgtttcgtag 3780 

atttttgtta tttttcgtaa agtagggtga gagttttaaa ggttgtgttt ttttgattaa 3840 

tttttaatat ggatttaata gatattaata agagtttggg ttttattttg taggtatttg 3900 

ttaaattgat ggaatgattt atttgagatt tgtatatttt attggatata aatgttgttt 3960 

taaaaatttt taaaaaaaga aaaattataa gtattgaatt t 4001 

<210> 272 
<211> 4322 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 272 

ggtaagtcgt tttttttttt tttgagtttt agtttgttta ttattaaaat tggtattaat 60 

agtagttttt attttttgag aattgtttag attgtttata tttagtattt agtatatagt 120 

ttggtgttgg gtagttttta ttagcgatag ttatgattat ttattttttt tttttatggt 180 

gaattatacg gagtgggaat aaaggtttgg tggttggatt tggagttttt taagggttag 240 

tttttttttt ggttgtagtt tttggtatag ggtgggttta ttgtttatgt tgagggagag 300 

attgggttgg tttattgtgg atttgaagga ggagttagaa ggaaattcga ttttaagtgg 360 

gtaaaggtta tttgaggttt attttgttta tttttttttt tagtttaatt ttagggaagt 420 

tatggggt&t gatgggtggg gtgggatggg attaggatat agaatatgtt aatgaatatg 480 

gttttttatt tagggaagtg tcgtgtatta taattgggta ttagggagga gggatatttg 540 

aatggatata attagggtag gtagaggttt gggggtgggg tgggaaatgg tatttaggtt 600 

tagagaatag tatgtttaag gtttgggtgt gaagagaaag ttttttttaa aggatagttt 660 

ttgaatttat ttggtttggt tttttttttt tatttatttt tcgtaatttt aattagggtt 720 

tttttatttg aataacgtgg tagttttttt gttttttatt tagttttttt tttgttttgt 780 
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tgtttttttg ttttatttat ttcgtaggag tttttttgag tttgttattt tgtttggttg 840 

taatagtttt tattattttt ttattgtttt gtgagaagat taaagtttaa tggtttgttt 900 

ttgttataag tttttttttt agtttaggtt attttaggga ttgtttttta tttttatttt 960 

ttgtgtttta gttgggaaga gttgtttaga ttttttttat tttttttagg tggtttttta 1020 

ggatttggaa tttttttttt tgttttttgg gaagtgatat ttttttggat gagtataagg 1080 

ttaatgtttt ggttttttga gttttgggga taggggaatg gattttttag aattttaatt 1140 

ggtttttaga ttttgagttt aggggatatt cggtgaatag atgaatgagt gaatgaatga 1200 

atgaaggagt tatggttgat aattaagttt tatgtttgga aaggaggttt aggcgtttgt 1260 

taggagtaat agagaggttt taaaggtggg ggtaggaaag tagaagattg ttaatcgggg 1320 

taggggtaat tgttagttcg agtaggggta gtttttagaa tcgtttgagg agtagtgtag 1380 

cgttcgatta gcgagaattt tcgattttgg agtttaatta gtttagtttt tggggtagag 1440 

tcgggatagc gggtttggcg gcgggtatcg cgttcggtcg cgttcgagtt ttagttaata 1500 

ggaagagtag gagtggggtt aggagaggcg ggttagttgg ttagtgggga gtttcgggcg 1560 

tgggtgtggg gagcggggga attttcgtcg gtttttttta ggttttagcg ggcggaggga 1620 

gtttttttat cgggttttag cgtttttttt tgtaaaatgg gttttttttg agggttggtt 1680 

gcgcgttaat ggcgggagga tgcgaagagg gttaggtagg gtagagggat cgggagattc 1740 

ggtgtttcgc gtaggtagag gcggagggac gttgttttcg attgtagttg gcggggcgag 1800 

gtttggtagg gcgaggtttg gcgggggcgg ggtttggcgg gggcggggtc ggcggtattt 1860 

gggtgaggtc gttcggtttc gttttttcgt attttcgtgt gtcgcggcgt cggagttcga 1920 

ggcggttgta gtttatattt ttcgagcgat tttcggcgtt cgttcgtcgc gcggaggttc 1980 

gggttatatt ttattggtcg tttggtttat tttagttagc gtcgcgtcga atttcgttcg 2040 

cgcgcgtcgg ggagcggcgt tttcgtcgtt gtcgtcgcga tttttggcgt ttgtttttgt 2100 

aacgggaggt aagtgagggt cgggttcggg cgcgggatcg gggttatttc gagggcgcgg 2160 

cgtttttttt tttttttttt cgttttagtt ttcgttttcg aaattgagaa attgagtcgc 2220 

ggttagcgaa agtttcgtcg cgtcgagatt tgtcggtgat ttcgtttttt tgttcgtttg 2280 

tgtgtttttt cgttagatag tgggggttga ggtttttttg tttgttcgtt ttcggttttt 2340 

ttattcgttc gtgtgttttt cgttttttcg aggggttcgt tcgttttttt cgattttttt 2400 

tcgggttttg tgttcgttcg tttgtcgtta tttattttgt gttttttttt tgcgcgtttt 2460 

ttcgagattc gatttttttg tttttcgtta ttcgattcgt tcgttttttg tttttcgcgt 2520 

gttttatcgt tttattttta tttatatttt tattttcgtt cgtcgttttt tcgatttcgt 2580 

tttttcggag tcgcgttttc ggtttacgtt cgtcgttttt tgttagttta gtcgtttttt 2640 

cgttttatcg cggggtttag gttaggtcgt tcgtcgggtt tcgtttttcg tttttgggtt 2700 

tttggaggtt cgattcgcgc ggcgtttttt tttcgttgtt ttttttttcg gttttaagag 2760 

tttttttcgg tttttttatt cgtttttggg ttgggcgtgg ttttagtttt tattattttt 2820 

ttttttattt tattttttcg tttttttgag atttttttta gggtggtttt tgttcgatac 2880 

ggtttgtatt cgtttttaat ttagatgtta ttttcgttgt gggggtttta ttagatgagg 2940 

ggaggaggtc gggggggggg gtttttttgg ttggtggtgg agcggtagaa ggagagtatt 3000 

atcggaattg tttttatttt tttagttttt ttttttttgg ggtaggggag agttttatcg 3060 

agtttcgttt taaatttggt tagaaggatt tgttgtgagt gggcgggggg tttttttttt 3120 

tgacgtttat tttgtatgcg tggaagttgt ttttttgtgt taagttattg gtatttgggt 3180 

gtgtgggtat gttgtgtgtt tgtgttagtt tttgtgttgt ttgttattgt gggatttgtt 3240 

tgtgtatgtg tgtttttttt cgtatgtgtg atttgtttgt gtgtatgtgt gtatgtttgt 3300 

ttgtgttttg aggttttgtg tgtgtttttt gtaattgtgt gtattatttt tggtgtgttt 3360 

gttgttatta gattcgtgtg tgtgtgtgtg tgtgtgtgtg tttttgtttg ttagtgtgtg 3420 

tgtttttgtg tgtttggagg tttcggtgtt gtatattggg tatattatgg tgattaattg 3480 

gttatgagta cgttagattt gttaattgtg gaagtttttg gtgttgggtg tttatttatt 3540 

agttttatgt gtgagtgtat ttgtataagt gtgtgtatgt gtgtgtattt acgtgtggtt 3600 

ttttttttgt tattgagttt ttgtgattag gtttatttgg ttattaaggg aggggattga 3660 

tttttggggt tttttcgttt ttgttgtgga gtgagttggg gtttggtgag taggttagta 3720 

gagggaggtt gtatggtgta tgtgtgatat gtgtggttgt gtgtgtgcgt gtgtgtgtga 3780 

tagagaggtg tggtggtgtt tgggtagttg tatttggttg atgggtggtg tgagggttat 3840 

gcgatgggat agtgttgggg taagtttggg agtgtgtggt tacggtagga gtgtgtgatt 3900 

ggtgggatat agtttatgaa attgtcggtg tatgtttggg gagtgattgt gattgtgggg 3960 

aatcgagagt gtttgggcgt gggtgttagc ggtaggaggg ggtgtgataa ggtgtgtggt 4020 

gatagtgttg ggagggaagg tgagagttta gtgtgggggt gtgtgtgtga gtgtgtggga 4080 

gattatgagg atgtgggtgt gtatatggag agggttagag ggtaagtttt ggggtttgta 4140 

gtgttgtgat taagatttcg ggttttgtat ttaagaatta tttgggtttt aatatttgtt 4200 

tgtgtttgtt aagagtaaga gatttaagtt aggtttttta aggttgagtg tttttatttg 4260 

tagaatggag agagtgagga taaaagtttt tgtgttaaag gttttttaat ataattttgg 4320 

tt: 4322 

* 

<2lb> 273 
<211> 4322 

<212> DNA 4 - 

<213> Artificial Sequence , ! 

<220> ' ' ** " v ' • ...'■.»•;"!. > » • 

<223> chemi cal 1y ' treated genom i c dna (Ho mo sa pi ens;) ^ i _ 
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<400> 273 

ggttaagatt gtgttaaagg atttttaata taggaatttt tgtttttatt ttttttattt 60 

tgtagataag gatatttagt tttgagaggt ttagtttggg ttttttgttt ttaataggta 120 

taagtaagta ttggagttta aatzaattttt gggtatagag ttcgaggttt tggttataat 180 

attataaatt ttaagatttg ttttttagtt ttttttatgt gtatatttat atttttatgg 240 

ttttttatat atttatatat atatttttat attgggtttt tatttttttt tttagtattg 300 

ttattatata ttttgttata tttttttttg tcgttaatat ttacgtttag gtattttcgg 360 

ttttttataa ttatagttat tttttaagta tatatcggta gttttataga ttgtgtttta 420 

ttagttatat atttttatcg tggttatata tttttaggtt tattttagta ttgttttatc 480 

gtatagtttt tatattattt attagttagg tatagttgtt tagatattat tatatttttt 540 

tgttatatat atacgtatat atataattat atatattata tatgtattat atagtttttt 600 

tttgttggtt tgtttattaa gttttagttt attttatagt aggaacgagg aaattttagg 660 

gattagtttt tttttttagt gattaggtgg gtttggttat agaaatttag tggtagaggg 720 

agagttatac gtgagtatat atatatatat atatttatat agatatattt atatatgaga 780 

ttggtgggta ggtatttaat attaggggtt tttataatta atagatttaa cgtatttatg 840 

gttagttggt tattatgata tatttagtgt gtaatatcga aatttttaga tatataggga 900 

tatatatatt agtaggtaag gatatatata tatatatata tatatacgag tttggtgata 960 

ataaatatat taagaatggt atatatagtt gtaaaagata tatatagagt tttaggatat 1020 

aaatagatat atatatatat atataagtaa gttatatatg cgaaaaagga tatatatgta 1080 

taggtaagtt ttatagtgat agatagtata gaagttgata tagatatata atatatttat 1140 

atatttagat attagtgatt tagtataagg agatagtttt tacgtatata gggtaggcgt 1200 

tagagaaggg gatttttcgt ttatttatag tagatttttt tgattaggtt taaggcgggg 1260 

ttcggtgggg tttttttttg ttttaagaga gagggaattg gggaagtggg agtagtttcg 1320 

atggtgtttt ttttttgtcg ttttattatt agttaggagg atztrtttttt tcgatttttt 1380 

ttttttattt ggtggagttt ttataacggg agtggtattt gggttggggg cgagtataga 1440 

tcgtatcggg taggggttat tttgggaggg gttttagagg ggcggagggg tggggtggga 1500 

agggggatgg tggagattgg agttacgttt agtttagaga cggatgaagg gatcgagaaa 1560 

gatttttgag atcgaaagaa gagatagcga ggaggaggcg tcgcgcgggt cgggttttta 1620 

gggatttagg ggcgggaggc gaggttcggc gggcggtttg gtttgggttt cgcggtagag 1680 

cggagaggcg gttggattgg tagggggcgg cgagcgtggg tcgggggcgc ggtttcggga 1740 

ggacgggatc gagggggcgg cgggcggggg taggggtgtg ggtgggggtg gggcgatgga 1800 

gtacgcggag ggtaagggac ggacggatcg ggtggcgagg gatagaagga tcgggtttcg 1860 

gagaggcgcg tagggggaga atatagagtg ggtggcgata gacggacgga tatagggttc 1920 

gggagagagt cgggagggac ggacgggttt ttcggaggag cgggagatat acgggcgggt 1980 

ggaggggtcg ggagcgggta gatagaagga ttttagtttt tattgtttag cggagggata 2040 

tatagacgga tagggggacg gagttatcgg taggtttcgg cgcggcggga ttttcgttga 2100 

tcgcggttta gttttttagt ttcggaaacg ggaattgggg cgggaagggg gaaggaaggg 2160 

cgtcgcgttt tcggggtagt ttcgatttcg cgttcggatt cggtttttat ttatttttcg 2220 

ttgtaggggt aggcgttaag ggtcgcggcg gtagcggcgg gggcgtcgtt tttcggcgcg 2280 

cgcggacggg gttcggcgcg gcgttgattg ggatgggtta agcggttagt gaggtgtggt 2340 

tcgggttttc gcgcggcggg cgggcgtcgg gggtcgttcg ggagatgtgg gttatagtcg 2400 

tttcgggttt cggcgtcgcg gtatacggaa gtgcgagggg gcggggtcga gcgattttat 2460 

ttaggtgtcg tcggtttcgt tttcgttagg tttcgttttc gttaggtttc gttttgttag 2520 

gtttcgtttc gttagttgta atcggggata gcgttttttc gtttttgttt gcgcgggata 2580 

tcgagttttt cggttttttt gttttgtttg gtttttttcg tattttttcg ttattagcgc 2640 

gtagttagtt tttagaaggg gtttatttta tagaggagga cgttgaggtt cggtggggag 2700 

gtttttttcg ttcgttgggg tttgggaggg gtcggcgggg gttttttcgt tttttatatt 2760 

tacgttcggg attttttatt ggttagttgg ttcgtttttt ttggttttat ttttgttttt 2820 

tttgttggtt ggagttcgga cgcggtcggg cgcggtgttc gtcgttaagt tcgttgtttc 2880 

ggttttgttt taaagattga gttggttaga ttttagggtc ggaggttttc gttggtcggg 2940 

cgttgtattg ttttttaggc ggttttggga attgttttta ttcggattgg tagttgtttt 3000 

tatttcgatt ggtagttttt tgtttttttg tttttatttt tgagattttt ttattgtttt 3060 

tggtaggcgt ttgagttttt tttttagata tggaatttgg ttgttagtta taattttttt 3120 

atttatttat ttatttattt atttatttat cgagtgtttt ttaggtttaa agtttaggga 3180 

ttagttgggg ttttagggag tttatttttt tgtttttaga gtttagaaag ttaagatatt 3240 

agttttatat ttatttaaag aaatgttatt ttttaggaga taaaaaagag ggttttaggt 3300 

tttggggaat tatttagaag ggatagagaa agtttagata gtttttttta gttgagatat 3360 

aagggatgga agtgggggat aatttttgag gtggtttgag ttgagaaagg ggtttgtgat 3420 

agagataggt tattggattt tggttttttt atagagtaat gagaaggtgg tgaaagttat 3480 

tataattagg taggatgata ggtttaaggg ggtttttgcg ggatagatgg ggtagaggga 3540 

tagtagggta gaggagaggt tggatggggg atagaagggt tgttacgttg tttaggtggg 3600 

aggattttgg ttgagattgc ggaagatgga tggggagaaa aggttagatt aggtgggttt 3660 

aaggattatt ttttgggaaa gatttttttt ttatatttag gttttgagta tgttgttttt 3720 

tgggtttgaa tgttattttt tattttattt ttaaattttt atttgttttg gttgtgttta 3780 

tttaaatgtt tttttttttt gatgtttagt tgtaatgtac ggtatttttt tQagtgdabg 3840 

attatattta ttagtatgtt ttgtgttttg attttatttt jattttattta ttatgtttta 3900 
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tggttttttt ggggttggat tggaggaaag ggtaaataag gtgagtttta agtgattttt 3960 

gtttatttgg gatcgggttt ttttttggtt ttttttttag gtttatagta agttaattta 4020 

gttttttttt taatataaat aataggttta ttttgtgtta gggattgtag ttagagggag 4080 

gattgatttt tgaggagttt taggtttagt tattagattt ttgtttttat ttcgtgtggt 4140 

ttattatgga agaaggaaat gaataattat ggttatcgtt gataggggtt atttagtatt 4200 

aggttgtgtg ttgggtgttg gatgtgggta gtttgaatag tttttaagag gtgagggttg 4260 

ttgttagtgt tagttttggt gatgaqtaaa ttgaaattta aagaaaaaaa aaacaattta 4320 

tt 4322 

<210> 274 
<211> 8467 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 274 

atgaattaaa tttttttttg gtattaagaa gggatatata gtatttcgag ttttgttttt 60 

gtttttttat taagtttttt ttaatagttg gaagtggttg aagggggttg ttttaaggag 120 

attttttgtt ttttttttga tagttgatta aaagaaaata ggttacgttt tttaaattta 180 

tgtttatttt gtttttaatt attgtttttt taaataaagg taatatttgc gtaattttag 240 

ttagtttgaa ttttcggagg taaatagagc gtaatttgtt ttggaagaat tttgttacgt 300 

ttaaggtttt atgtcgggtg ttggtttttt tgtttttaat tattttttta tttcgattcg 360 

gattgtagaa atttttaatt ttttgttttt tagttttatt tttattttta gatttggggt 420 

tgttaaataa cggaaatttt aggaagacgt aggtgcggtt tttaaagttt tggtttgtga 480 

gcgtttttag gttggtcggg gcgttggttt ttttagaaat aaggtttgaa tatgtaggtg 540 

ttagttagga ttttttgtcg tttgtttatc gtcgtttttt tttcgggatt tgagagaagc 600 

gggagtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgcgcgt gtgtgtgttt cgtgttagga 660 

gaaaggttcg tggttgtcgt tttagttggt ttgtttattt tttgttttta ggagaagttt 720 

gtttatttgg gatattagcg atttttttta tttttatttt atcggtttta tatttttttt 780 

attttttttt aagtcgttga tgtagaaagt atttggttat tttgaaacgg tagttttcgg 840 

aattttagtt ttgttttgta atttagtagg gtttttcgga agttttataa ttgagattta 900 

tgggtttatc gggtaattaa atgatgttat tgtgttaatt ttgtatgtat tattgttgtt 960 

cgcgtcgttc gtgggtcgtc ggtcggtcgt agttcgttgg cgacgagggt attatagttg 1020 

ttttgatcgc gtagattatg tatgtttcgg tttcgggaat ttattatgta ttagaatata 1080 

ttagcgtttg tattttaaaa ggttaaatta ttggttttta gttagggatt ttcggtaagt 1140 

ggtttgttag tgacgagtgt ttgtttatat tggtatatag cggagttttt tgttttcggt 1200 

ttattcgttt ttaggaaagt tttggggtga ggcgaaggcg attgaagtaa tgtttttttt 1260 

tttagatcgt agttgtttag gggggatata gtacggtatt ttttatcgaa tttttttcgt 1320 

tcgttcgtat tttagttttt ttttttttag cgattttttt attttttttt tttttttttt 1380 

ttttgacgtt tgattttagt agtaaaggag gtaaaaaagg tatcgagtcg ttagttaaat 1440 

ttgaaaagtg cggtttcgtt ttttttatag ttattggtag tttttcgtgg aaggttcgtt 1500 

tttcggggta gttgcggttt cggagtggtt gcgtttggcg ttcgtcgggc gtggtttcgt 1560 

tttaggttcg ggagggtagg ttggttgttt cggcgagcgg tagagttttt ttggatagtt 1620 

ttcgtttatt taaatagaag acgtcggcgt cggagcgggt tcggatatgg cgaggttgcg 1680 

agtcggttcg agcggcgggg ttcggtgatt tttttttttt ttttcgtttt tttttttttt 1740 

tcgtacgtac gtttcgttcg tttttatttc gtttttattt cgggcgagtt cgttcgtagt 1800 

tcggggcgta tattcgtacg cgtatttttt tttatttatt ttcgcgttcg tttttatttt 1860 

cgtagtcgag tttcgttacg cgcgttttgt tcgttcgtcg gtcgttttcg tcgttttcgt 1920 

cgttttcggg ttttgatgga ttgaatgaag gttgtttata tcgtttatcg atgttttatt 1980 

aaagatttag aaggttgcgt tatgaattcg gagttgataa tggaaagttt gggtattttg 2040 

tacgggtcgg tcggcggcgg tagtggcggg ggcggcggcg ggggcggcgg gggcggcggc 2100 

gggggttcgg gttatgagta ggagttgttg gttagtttta gtttttatta cgcgggtcgc 2160 

ggcgtcgttg gttcgttgcg gggtttttcg tcgtttttaa tcgcgtatta ggagttgggt 2220 

acggcggtag cggcggtagc ggcggcgtcg cgttcggtta tggttattag tatggtttcg 2280 

attttggacg gcggcgatta tcggttcgag ttttttattt cgttgtatta cgttatgagt 2340 

atgttttgcg attcgttttc gtttggtatg ggtatgagta atatttatat tacgttgata 2400 

tcgttttagt cgttgttatt tatttttatc gtgtttgata agttttatta tttttattcg 2460 

tattattatt cgtattatta ttattattat tattattagc gtttgttcgg taacgttagc 2520 

ggtagtttta tttttatgcg cgacgagcgc gggttttcgg ttatgaataa tttttatagt 2580 

ttttataagg agatgttcgg tatgagttag agtttgtttt cgttggtcgt tacgtcgttg 2640 

ggtaacgggt taggcggttt ttataacgcg tagtagagtt tgtttaatta cggttcgtcg 2700 

ggttacgata aaatgtttag ttttaatttc gacgcgtatt atattgttat gttgattcgc 2760 

ggtgagtaat atttgtttcg cggtttgggt attttatttg cggttatgat gtcgtatttg 2820 

aacggtttgt attattcggg ttatatttag ttttacgggt cggtgttggt atttagtcgc 2880 

gag cggttat- tttcgt tttt atcgggtt eg t aggtggtta cgt egggtta gtt ggaagaa 2940 
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attaatatta aagaggtggt ttagcgtatt atagcggagt tgaagcgtta tagtattttt 3000 

taggcgattt ttgcgtagag ggtgttgtgt cggttttagg ggattttttt cgatttgttt 3060 

cggaatttaa aatcgtggag taaatttaaa tttggtaggg agatttttcg taggatgtgg 3120 

aagtggtttt aggagttcga gttttagcgt atgttcgttt tacgtttggt aggtaaggtc 3180 

ggggttagtt aggggttagg ttgttgggaa gagggtttcg ggttcggtgt ttgtggttta 3240 

agtttgcgcg tcgagttatt ttttttgatt tttttttttt tttttttata tacgtttttt 3300 

tttttttcgt ttttattttt ttttttattt tttttttttt tttcgttttt tttttatttt 3360 

tttttttttt tttttttttt tttttttatt ttttttttgt ttttgagttt ttattgatcg 3420 

attttttttt attttattcg tttttttttt aatgtgttaa tttttgtttt attttcgatt 3480 

tttttaggta ttgggaggcg ggatgggggt gtgcgttttt ttttaggagt tttgtttttt 3540 

taagatttat agaaattagg atttgttttt atttaaaatt ttatgtattt taagtttttt 3600 

ttagataata tattttaatt tttcgggttg attagttttt ttgtgtagag gtagttgaga 3660 

ggttttgttt tgtagaggga aaagagtttt ttattttttt atttattata taggtaaatt 3720 

tatttggtta ttggttgaag gtatagtttt gttttcgcgg ggaatcggcg gttaggatat 3780 

aatagcgttt ttggagttta tttttggttt tggcgttggc gtagggattt tttgatcggg 3840 

tttgaggggt tcgggttagt tttaatgtta ttatttatag cgagggtagg gtgtaaggtt 3900 

gagaaggtta tatttatcgt tttgggagga cgtgggagaa gagattgagg tggaaagcgt 3960 

tttgttttgt ttatcggtcg tttttgtttc ggttttagcg tttgttggga tttgttagga 4020 

tttgtcgggg tttcgggaga ttttgagtat tcgtaggaag aggtgttgag aaattaaaaa 4080 

tttaggttag ttaatgtatt tttgtcgtcg gttgtaggtt tcgtttttgt attaagcggg 4140 

cgttgattgt gcgcgtttgg cgatcgcggg gaggattggc ggttcgcggg aggggacggg 4200 

tagaggcgcg ggttatattg ttttggagtc ggttcggttt tttgtgtttt ttttagcggt 4260 

taagttgcga ggtatagttt tttattgttt taggagtata gaaatttttt gtgtgggcgg 4320 

cgggtgcgcg agttagaggg aaagatgtag tagttattgc gattggtacg tagttgcgcg 4380 

tttttgtgcg tacggatttc gcgcggtgtg cgtggcgatt gcgttgtttt taggagtaag 4440 

ttacgggttt agaggggtaa aatgtttagg tttttcgttg ggaaggatat attatatttt 4500 

atggtaagtt agggtgggcg attttttatg gatcgggtgg aggggggtat tttttaggat 4560 

cggcgggcgg tttaggggaa taattcgtgg tggcgatgat ttgtatagcg cgggttttgg 4620 

gatgcgcgcg gtttcgagtt agtttcgtat agttcgtttt cggagttgcg agtttaggtt 4680 

tttattttcg atttttcggg tttttttcgt atcgttgagt ttagtttgtg gggtgtattc 4740 

gattaacgtt cgatagggtt ggggaatgtg ataggtagta ggtttattcg ggtttgggga 4800 

gggggagttt tcgttttgat agtatttttt tttgtcgttt gttggtggat ttttattttt 4860 

agtcggtaat cgtttcgtag tgttgattta agaaggtaaa gaaaattagg tttttttgta 4920 

aagagttttt tttaaatcgg cggatttcgg atattttgag tggatttaga aatttatgta 4980 

attttttttt tttagtttat tttttatttt tttttatagt tttttttgat ttgttgttgg 5040 

ttcggggtaa gataaagtag ttagtagaga gcgataataa tagcggcggg aaatgaattg 5100 

gagattggtt gatagttttt aatattttgt tatagatttt ttcgaatgtt ttaggttgtt 5160 

tttggtgggt tttagtattc gtcggttttt tgggtatcgg ggattagaag gaatttggta 5220 

gttggtttta ggggtatagt taaaggtagg atgatagtta tttttttgtt tattttagag 5280 

cgttgtcgtt tttttatgtc ggtcgcgtaa agaatatagt ttttaaaaaa tacgtgtttt 5340 

ttgtttatat aggtttgaaa gtgatgagga aagtaatgtt tcgtttatta gcgagtttta 5400 

gtttttaaaa tgattttaag cgttgttgag atgagaaagc gtggtatttc gggggttttt 5460 

agttttattc gcgtttatgg tgtaagtttg tagggatagg ttcgggatag tattgtttac 5520 

gttgttagat ttttcgtaga ggatcgttga agttgttttc gtgggagata gaatgttttt 5580 

tttagcgagt ggaaaaggtt tgttgaggat ttcgttttgt tcgagtattt aaatgtgtgt 5640 

ttgttttatt attttgggtt gaaaagggat aagagtttta gtttttttat ttggttattt 5700 

tattagtaat tataagtgtg ttgagtggtt attattatat aggaggtttt ttagtttggg 5760 

gttagtagat tagttttttt agatattgat gtagaagttg ggattggtaa gtaggtatta 5820 

tgtgttcgga gcgttagggg ataggagtaa atggagaaga aaagcggagg tttttttcgt 5880 

tcggagtatc gatcggaatt ttcgtcggta cgtcgtagag ggttttcgtc gttgggtttc 5940 

gggggtttaa taagtttagt cgtttcgtag gcggttcggt cggattttta gatcggtgtt 6000 

tggaagatat cgtttttgtt tttttttcgt taaatttgtt tttttttttt tttttatagg 6060 

ttataggttt tttttttttt ttattttggt ttcgttttcg ggttttgtta aatagttaag 6120 

taggtcgggg tttagggggt ttagaatgaa gaggtttgat ttggttagcg tcggtaaagt 6180 

ttatttttag gcgaggttat aatagaggta ggtttttttt gtttagtttg tcggtgtagt 6240 

tatagttaag ggtggtattt gaaaggaaaa gggagaaaat ttcggagaaa tttagattgt 6300 

tttaacgtta gattttagag aaattgattt taaatgtacg gattcgttcg gaaagggcgg 6360 

ttaagtggta ggtggttgta atttcgtteg gtcgggtttt cgtagaggtt ttttaagatt 6420 

agtttttgta gggcggtttt tagtaatttg ataagaggcg gttaagataa atttttgcgg 6480 

gttcgagtat atattttcgg gcgttgggtt ttagagattt ttaaattaag tataaataag 6540 

aagggagtga gagaatttag gttagaattt gtacgggtat tttattgagg aaaagcgagg 6600 

tttcggtggt aggtatgttt tttttcgacg ttcgaaaatc gagtcgagcg ttcgattata 6660 

tttattgtag aggttttcgt ttttagtgag ttcggatttt ttagcggttt gttcggagtt 6720 

ggtttttagr tttcgtcgta gttcgacgta cggttttttt ttggtagtaa gtttttagcg 6780 

gttagtttga agttaattrttr gtttaggcgg tcgagggttt ttagttaatt tattatgatg 6840 

tcgtttgggt tatttgatgt tcgtagcggc gggatacggt tcgggtagtg cgtagtggtt 6900 

tttgttaggg gtatcgcgtg cgtgtttgtt tttcgttgcg tcggggacgt ttttgggtga 6960 

tacgggtcgt tgggtatttt ttaagtcgag gaaacggatt tttttcgtag agtttcgcgt 7020 
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ttatttttta attttttatt tcgtttttcg ttgttagggt tttcgattta gtttattttt 7080 

tttggcggtt tagttaggga ttagagttgg agaggttgaa cgtaattcgt gttagtacgg 7140 

aatagacgat atgtttgttt gttagttgtt tggatgaata attgaaaagt tcgttgtagt 7200 

ttgtgtttcg ttaagtttcg ggtgtcggga gaatattttt ttaatacgta ttagggtggg 7260 

cgggagcggg tagaggaggc gggattcgag ggaggagagt gaattcgagt aggagaagta 7320 

gtttaggtag ttaggcgttt tcgatgcgag aggttgggta tttattttta rtttaggttt 7380 

tttattgtgt ggttatgtta tttttttaaa taaatgtgta tatggaggga gatcgatgtt 7440 

gataatgttt agaagattaa aagagtatta atgttggtaa taataacgta aacgtgtgga 7500 

tttagatttt attgatttgg aatttgattc ggcgcgtttt tagtaagttc gacggcgcgt 7560 

tttttttagt agagcgttta ttagcgttac ggtttcgcgg ttttttagcg gtgtcgtttc 7620 

gttagttttg cgcgggtttt ttcgtttgat cgtagttttt ttttcgcgag gttttagttc 7680 

gttttatttt ttcgaggttt tttttttttt tcgcggggtt ttttgttttt tgtatttttt 7740 

ttttcgattt ttgtattatt cgtttttgtg cgtatatatc gttatttgcg ttttcggcga 7800 

ttcgtttggg cggttgggtt cgcgaagtta atgcgttgaa cggtgttcga gtttttttaa 7860 

ttattttgtg tttggtcgtt gttattgggt rttggtgatt aagtttaagt ttgaaaatga 7920 

cgtggttaaa gttgtttgtt aacggtagag tttaattagt cgtagatttt tttttatttt 7980 

tatcggtttt cgtattaaaa aggtttacgc gtagattttt tacgtaggtg tattcgttgg 8040 

ataattaaac gtggttttag taataaaagt ttgagtttta ttttttcgag tagtaggttt 8100 

tgcgttggat tggtcgggtt taatagggag aattttgtgt tttattttgg ttgggatagg 8160 

aagtaagaga agtaaattgg ggaatttttg ttttattttt ttttattttt tggacgtttt 8220 

gggtcgaggt tcgggttatt ggtttatcgc gggtaaggag gtgttttgtg gaatatttgt 8280 

attgattgga aaagaaagaa ttttaaagtt tttttttttc gtttgtgggg gatattatta 8340 

tattttgtta attatttttt tggttaaatg gtggtttgtt tttttagaag gagtattaag 8400 

gtgaatttta tggaaaaaaa ataaaagatt agggatttaa gtgggagtag gatgagaatt 8460 

aattttt 8467 

<210> 275 
<211> 8467 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 275 

aagaattagt ttttatttta tttttattta ggtttttgat tttttatttt ttttttataa 60 

aatttatttt agtgtttttt ttggaaaaat aagttattat ttggttaggg gagtggttgg 120 

tagggtgtgg tggtgttttt tataagcggg aaggaagagt tttaaagttt tttttttttt 180 

agttaatgta ggtgttttat aggatatttt tttattcgcg atgggttagt gattcgggtt 240 

tcggtttagg acgtttaggg ggtgggaagg ggtggggtag gggtttttta gtttgttttt 300 

tttatttttt gttttagtta aggtaggata taagattttt tttgttagat tcgattagtt 360 

tagcgtagag tttgttattc ggagggatga agtttaggtt tttgttatta gggttacgtt 420 

taattgttta acgaatgtat ttgcgtaaaa agtttgcgcg tgagtttttt tagtacgaga 480 

atcgatgagg ataggagggg atttgcggtt gattaagttt tgtcgttagt aggtagtttt 540 

agttacgtta tttttaaatt tgggtttagt tattagggtt tagtggtaac ggttaagtat 600 

aggatagtta ggaagattcg ggtatcgttt agcgtattgg tttcgcggat ttagtcgttt 660 

aggcggatcg tcggaagcgt aagtagcggt gtgtgcgtat aggggcgggt ggtgtagagg 720 

tcgggggagg gggtgtagag ggtagagagt ttcgcgagag gaggagaaaa tttcggggaa 780 

gtaaggcggg ttggggtttc gcgggggagg agttgcggtt agacgggaga attcgcgtag 840 

agttggcgaa gcggtatcgt tggaaaatcg cggggtcgtg gcgttggtga gcgttttgtt 900 

gggaagagcg cgtcgtcggg tttattggaa acgcgtcgga ttaagtttta gattaatgaa 960 

atttgggttt atacgtttac gttattgttg ttagtattaa tgttttttta attttttaaa 1020 

tattattagt atcgattttt ttttatatat atatttgttt gagaaagtga tataattata 1080 

tagtggaaag tttggaataa aaataaatgt ttagtttttc gtatcgaggg cgtttggttg 1140 

tttgggttgt tttttttgtt cgggtttatt tttttttttc gggtttcgtt ttttttgttc 1200 

gttttcgttt attttgatgc gtgttgggaa ggtgtttttt cggtattcgg gatttgacga 1260 

agtatagatt gtagcgaatt ttttaattat ttatttaagt agttagtagg taaatatatc 1320 

gtttgtttcg tattggtacg ggttgcgttt agttttttta gttttgattt ttaattaaat 1380 

cgttaggaga ggtgggttga gtcgggagtt ttagtagcga gaaacgaggt gggaggttgg 1440 

gggfltgggcg cgagattttg cgaagggggt tcgttttttc ggtttgggag gtgtttagcg 1500 

gttcgtgtta tttaaggacg ttttcggcgt agcgggagat aagtacgtac gcggtgtttt 1560 

tagtaggagt tattgcgtat tgttcgggtc gtgtttcgtc gttgcgggta ttaggtggtt 1620 

taggcgatat tatggtgggt tggttaaggg ttttcggtcg tttgaataga attggtttta 1680 

gattggtcgt tgggagtttg ttgttaggag agggtcgtgc gtcggattac ggcggggatt 1740 

ggagattagt ftcgggtagg tcgttggggg attcgggttt attggaggcg gaaatttttg 1800 

tagtaaatgt agtcgggcgt tcggttcgat tttcgggcgt cggaagaaaa tatgtttgtt 1860 

atcgaggttt -egtt-t_t^^^^ 1^20- 
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tttttttttt gtttgtgttt ggtttagagg tttttggggt ttaacgttcg agagtgtgtg 1980 

ttcgaattcg tagaaatttg ttttggtcgt tttttgttag gttgttgaaa atcgttttgt 2040 

aagggttggt tttggggaat ttttgcgaag gttcgatcgg gcggggttgt aattatttgt 2100 

tatttagtcg tttttttcga acgaattcgt gtatttggag ttaatttttt tgaaatttaa 2160 

cgttggggta atttaaattt tttcgaagtt tttttttttt tttttttaag tgttattttt 2220 

ggttgtgatt atatcggtag gttgggtagg aaggatttgt ttttgttgtg atttcgttta 2280 

agggtgagtt ttgtcggcgt tggttaaatt agattttttt attttgagtt ttttaaattt 2340 

cggtttgttt ggttgtttgg taggattcgg gggcggggtt aaaatgagag agaaagggaa 2400 

tttataattt atgagagaaa gagaagaggt aggtttggcg ggaggggggt agggacggtg 2460 

ttttttaggt atcggtttga gagttcggtc gagtcgtttg cggagcggtt gggtttgtta 2520 

aattttcggg gtttaacggc gagggttttt tgcggcgtat cggcggggat ttcgatcgat 2580 

atttcgggcg gagaaagttt tcgttttttt tttttatttg tttttgtttt ttagcgtttc 2640 

gagtatataa tatttattta ttagttttag tttttgtatt agtgtttgaa gagattgatt 2700 

tattgatttt aaattgaaaa gttttttatg taataataat tatttaatat atttgtagtt 2760 

gttgataaaa tggttagata aaaaggttaa agtttttgtt tttttttaat ttagggtaat 2820 

aaaatagata tatatttgaa tgttcgagta aagcggggtt tttagtaggt tttttttatt 2880 

cgttggagga ggtattttgt tttttacgaa ggtagtttta gcgatttttt gcggaaaatt 2940 

tagtagcgtg ggtagtgttg tttcgggttt gtttttgtag atttgtatta tgggcgcggg 3000 

tggggttgag gattttcggg atgttacgtt tttttatttt agtaacgttt gggattattt 3060 

taaaagttga aattcgttaa taggcgaagt attatttttt ttattatttt tagatttata 3120 

tgggtagaag gtacgtgttt tttaaaagtt gtgttttttg cgcgatcggt atgagggggc 3180 

ggtagcgttt tgagatgagt aggagaatag ttgttatttt gtttttaatt gtatttttaa 3240 

gattagttgt taagtttttt ttggttttcg gtgtttaaga agtcggcggg tattaaaatt 3300 

tattagagat agtttgggat attcgggggg atttatgata aaatgttaag aattgttagt 3360 

tagtttttag tttatttttc gtcgttatta ttatcgtttt ttattggttg ttttattttg 3420 

tttcgaatta atagtaaatt agagaaaatt gtaggagagg atgaaaaata aattaaagga 3480 

gaaagattat ataaattttt aaatttattt aaagtattcg gagttcgtcg atttggggga 3540 

ggttttttgt agggaaattt agtttttttt atttttttag attaatattg cggggcgatt 3600 

atcgattggg aataggaatt tattagtaga cggtaaagga aaatgttgtt aaagcggaaa 3660 

tttttttttt ttaagttcgg gtgaatttgt tgtttgttat attttttagt tttgtcgggc 3720 

gttggtcgag tgtattttat aagttgggtt tagcggtgcg aggaaagttc gggggatcgg 3780 

gggtggaaat ttgagttcgt agtttcggaa gcgagttgtg cgaggttggt tcggaatcgc 3840 

gcgtatttta agattcgcgt tatgtaaatt atcgttatta cgaattgttt ttttagatcg 3900 

ttcgtcgatt ttgaaagata ttttttttta ttcgatttat gggaagtcgt ttattttggt 3960 

ttgttatagg gtatagtgtg ttttttttag cgggggattt ggatattttg tttttttggg 4020 

ttcgtggttt gtttttaggg gtagcgtagt cgttacgtat atcgcgcggg gttcgtgcgt 4080 

ataaaagcgc gtaattgcgt gttagtcgta gtaattattg tatttttttt tttagttcgc 4140 

gtattcgtcg tttatatagg aggtttttgt gtttttagaa taatagaggg ttgtatttcg 4200 

tagtttggtc gttagaggag gtataaagag tcgagtcggt tttagaataa tgtaattcgc 4260 

gtttttattc gttttttttc gcgggtcgtt agtttttttc gcggtcgtta ggcgcgtata 4320 

attagcgttc gtttaatgta aaggcggagt ttgtagtcgg cggtagggat gtattaatta 4380 

atttgaattt ttaatttttt agtatttttt tttgcgagtg tttagggttt ttcggagttt 4440 

cggtaaattt tggtaaattt tagtaaacgt tgggatcggg gtaaggacgg tcggtgagta 4500 

aggtaaagcg ttttttattt tagttttttt ttttacgttt ttttaaagcg gtgaatgtga 4560 

tttttttaat tttatatttt gttttcgttg taggtagtga tattggagtt ggttcgagtt 4620 

ttttaagttc ggttagaaag tttttgcgtt aacgttaagg ttagagatgg gttttaggag 4680 

cgttgttgta ttttggtcgt cggtttttcg cgggggtaag gttgtgtttt tagttaatga 4740 

ttaaataagt ttgtttatat ggtgggtggg agagtagaga gttttttttt ttttgtagag 4800 

taaagttttt taattgtttt tgtataggga gattagttag ttcggaaaat tgaaatgtgt 4860 

tgtttaaaag agatttgaag tgtatggggt tttgaataag ggtaggtttt ggtttttgtg 4920 

ggttttggaa agatagggtt tttagaggaa aacgtatatt tttatttcgt tttttagtat 4980 

ttgggaagat cggaaatagg gtaaaggttg gtatattgag gggagggcga atgaaatggg 5040 

ggggggtcgg ttaatgaaag tttagggata aggagagagt aagaaagaaa aagaaaaggg 5100 

agaagggaaa gtaggggaag agcggaagag aaagagaaaa tggaagaaga aataaaaacg 5160 

agaagaaaga ggacgtgtat aggaaagaga aggaaagaat taagagaagt gattcggcgc 5220 

gtagatttgg gttataagta tcggattcgg agtttttttt ttagtagttt ggtttttggt 5280 

tagtttcggt tttatttgtt aggcgtaagg cggatatgcg ttggaattcg ggtttttgaa 5340 

gttattttta tattttgcgg aaggtttttt tgttagattt gagtttattt tacggttttg 5400 

gatttcggag taggtcggag agagtttttt gagatcggta tagtattttt tgcgtaaaga 5460 

tcgtttgggg gatattgtag cgttttagtt tcgttgtgat gcgttgggtt atttttttgg 5520 

tgttgatttt ttttagttgg ttcgacgtgg ttatttgcga gttcgatgag gacgagggtg 5580 

gtegttcgcg attgggtgtt agtatcggtt cgtgagattg agtgtggttc gggtggtgta 5640 

ggtcgtttag gtgcgatatt atggtcgtag gtggggtgtt taggtcgcgg gataggtgtt 5700 

gtttatcgcg ggttagtatg gtagtgtggt gegcgtegaa gttggggttg agtattttgt 5760 

cgtggttcgg cggatcgtag ttgggtagat tttgtt£jcgc gttgtggagg tcgtttagtt 5820 

cgttgtttag cggcgtggcg gttagcgggg ataggttttg grttatgtcg ggtatttttt 5880 

tgtagggatt gtagaggttg tttatggtcg ggagttcgcg ttcgtcgcgt atgagggtga 5940 

agttgtcgtt gacgttgtdrg gataggcgtt ggtggtggtg gtggt^gtgg tggtgcggat 6000 
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ggtggtgcgg gtgagggtgg tggaatttgt tagatacggt ggagatgggt ggtagcggtt 6060 

ggagcggtgt tagcgtggtg taggtgttgt ttatgtttat gttaggcgga gacgagtcgt 6120 

aggatatgtt tatggcgtgg tgtagcggga tggagagttc gggtcggtag tcgtcgtcgt 6180 

ttaggatcga ggttatgttg gtgattatgg tcgagcgcga cgtcgtcgtt gtcgtcgttg 6240 

tcgtcgtgtt tagtttttgg tgcgcggttg gaggcggcgg agggtttcgt agcgagttag 6300 

cggcgtcgcg gttcgcgtgg tgggggttgg ggttggttag tagtttttgt ttatggttcg 6360 

ggttttcgtc gtcgttttcg tcgttttcgt cgtcgttttc gttattgtcg tcgtcggtcg 6420 

gttcgtgtaa agtgtttaga ttttttattg ttagtttcgg gtttatggcg tagtttttta 6480 

ggtttttggt gaggtatcga taggcggtgt aggtagtttt tatttagttt attagggttc 6540 

gggggcggcg ggggcggcgg gggcggtcgg cgggcgggta aggcgcgcgt ggcggggttc 6600 

ggttgcggga gtgggggcgg gcgcgggagt gagtggagag gagtgcgcgt gcgggtgtgc 6660 

gtttcgggtt gcgggcgggt tcgttcgggg tgggggcggg gtgggggcgg acggggcgtg 6720 

cgtgcgggag aggggagggg acggggaggg agggagggat tatcgggttt cgtcgttcgg 6780 

gtcggttcgt agtttcgtta tgttcgagtt cgtttcggcg tcgacgtttt ttgtttgggt 6840 

gagcgggagt tgtttagaag ggttttgtcg ttcgtcgggg tagttaattt attttttcgg 6900 

atttggggcg gggttacgtt cgacgggcgt taagcgtaat tatttcgagg tcgtagttgt 6960 

ttcgggaggc gggtttttta cgggaagtta ttagtggttg tggagggggc ggggtcgtat 7020 

tttttaggtt tggttgacgg ttcggtgttt tttttatttt ttttgttatt ggaattaaac 7080 

gttaaggaga gggagggagg agaaggtggg ggggtcgttg gggagaggga ggttggagtg 7140 

cgagcgagcg agagagattc ggtgaaagat gtcgtgttgt gttttttttg agtagttgcg 7200 

atttggggga aggggtattg ttttaatcgt tttcgtttta ttttaaagtt tttttggagg 7260 

cgagtaagtc gggagtaaaa gatttcgtta tgtgttagta tagataaata ttcgttatta 7320 

ataagttatt tatcgagagt ttttagttgg gagttaatag tttagttttt taaaatgtag 7380 

gcgttaatgt attttaatgt atggtaaatt ttcgaggtcg ggatatgtat aatttacgcg 7440 

gttagagtaa ttgtagtgtt ttcgtcgtta gcgagttgcg gtcggtcggc ggtttacggg 7500 

cggcgcgggt agtagtaatg tatgtaaagt taatataata atattattta attattcggt 7560 

gagtttataa attttagtta tgaggttttx gaagagtttt gttagattgt aaaataaaat 7620 

tagaatttcg ggagttgtcg ttttaaggtg attaagtgtt ttttgtatta gcgatttgga 7680 

aagaagtggg gagagtgtag aatcggtgag gtggaggtga ggagggtcgt tggtgtttta 7740 

agtgagtaaa ttttttttgg gggtagagag taagtaaatt agttgagacg gtaattacgg 7800 

attttttttt taatacgaga tatatatacg cgtatatata tatatatata tatatatata 7860 

tattttcgtt ttttttaaat ttcgggagag aaacggcggt gggtagacga taaggaattt 7920 

tggttgatat ttgtatattt aaattttgtt tttgaaaaga ttagcgtttc ggttagtttg 7980 

agaacgttta taagttaagg ttttaaaaat cgtatttacg tttttttaga attttcgttg 8040 

tttggtaatt ttaggtttgg aggtgggagt ggaattgggg gatagagagt tggagatttt 8100 

tgtagttcga gtcggggtgg aggggtagtt gaagatagag aggttaatat tcggtataaa 8160 

gttttaaacg taataagatt tttttaaagt aggttgcgtt ttgtttgttt tcgaagattt 8220 

aagttagttg aggttgcgta gatgttgttt ttgtttaaag agatagtaat taaggatagg 8280 

gtggatataa gtttgagaag cgtgatttat ttttttttga ttagttgtta aaagaagagt 8340 

agggggtttt tttaagatag ttttttttaa ttatttttag ttgttagaga aaatttgata 8400 

agaaaataag ggtagagttc ggaatattgt atgttttttt ttggtgttag agaaaggttt 8460 

gatttat 84 67 

<210> 276 
<211> 6456 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 276 

taaaattttt taaaattgaa aagaagaaag gggtaattgg agaattttta ttttttttgg 60 

ttgttttttt taagtcgttt agtttttatg aatagtattt agttttattg ttattattaa 120 

taatttttaa aattagttaa tgtttcggtt tttagtattg gaaagttttt taaataggat 180 

attggaaatt tttatttata agtttggggt ggcgggcggg gcggggaggt ggagagagag 240 

ttgttattta taggttttta ttttggtttg aagatttaat tgtagttatt agagtaaggg 300 

aatgtttatt ttttggtatt tgttcgttat tttttttttt tttagagata aatatttttt 360 

cgtttttttt aaaaaagtat atattttaaa gtaagaatgt gattttattt tttttttttg 420 

agtttatgtt tgttattttt aggaatagcg tgtggattag ggttagatga attttaattt 480 

gggttgtaga tttacgaggt tttgttttag tgttaaaggt ttttggtagt aaatagtgag 540 

taaaatagat atttgttttt tgatggattt tgtcgttttt tttttttttt tttaagttat 600 

ttattaaaat tatatatatt ttgtaaagaa aaagggaaat tggtagtttt tgtagaggaa 660 

gtcggtggta tcgtttagag ttataaattg tatttttaaa tagttttttt tttggttttt 720 

ttttttttgt tttatttttt ttaaaattta gattgtaaaa aatatattta ttgatattta 780 

ttttatttta aaaaaagaag agaaaaagta aagcgttata agattttttt tttggaaatt 840 

ataaa ttgaa aaaaaaat tt ataaaagatt aaat tttggc gggttgtggg g tgg cggggg _ _ 9>00_ 
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tcggcgggga 
tagtttggga 
ttatttttaa 
ttttttttat 
taaaatttat 
ataaaggtaa 

ggggtggttt 

agtcgatttt 
ttttttattt 
ttatttttgt 
gtcgggtttt 
aggaacgcgg 
tttgaagttt 
atgggaaaaa 
gagttacgta 
gtttttttag 
aggtgtagtg 
ttttttcggg 
gaaagaataa 
tagggaatat 
agatacgata 
gagttcgttt 
agatcgaggt 
ttttcggagt 
agggagggaa 
gagggaaaaa 
agaaagagcg 
attagttcga 
tttttttttt 



gggggcgcgg 

tttttagtat 

attttagtta 

ttttttaaaa 

tattgtaata 

cgtaatttat 

ttagtttttg 

tgaggtttta 

ttattttttc 

aaaattattt 

tttagttacg 

gttggggttt 

ggcgtttatt 

ataaaataaa 

gcgtttaaaa 

gttgttttgt 

cgtatttttt 

atgagtaggg 

tttttaaagt 

taaaatattc 

ttttatttta 

gtattttttt 

cgatggaaga 

tatttagaag 

gcggaggtta 

atatagtcgt 

agagcgcgag 

attatttttt 



taaaagtttt 
aataatttat 
tttgatagtt 
ggagttaaag 
tgagtagcgg 
gtagcggcgg 
tattattatt 
agtttcggat 
ttttattgtt 
atatttcgag 

aggggtttga 

attattacgt 

ttggaatttt 

tttcggggtt 

tgtataaaag 

gttagagagg 

gaatattagg 

tatttttatt 

tggtttggga 

tgttttttgt 

tgtacgtgtt 

atattaatat 

tttttttttt 

agtgtgtttt 

tttttttagg 

tgttgtaaga 

tttttataaa 

agatttttgt 

ttaagagatt 

tjgtatttaga 

ttgaagttga 

tttttttttt 

tgggttttag 

tttttttttt 

taaaagaagg 

tttggattag 

ttaiagcgatt 

ttaattttta 

tataatggag 



tttttttttt 
tttaaatcgg 
attttatggt 
agaataaata 
tatttggtaa 
cggcggtagc 
tagtaatagt 
tttaagaatt 
tttattattg 
tttggttaat 
gtttttataa 
aggtggtttt 
gggttttttt 

gagaattaat 
tggggttaat 
ggagcgagag 
cgagtaagag 
tattacgaga 
taggataggg 
tttttggttt 
aagtttgtgg 
tttttttttt 
tattgaaaga 
tttttttttt 
ttttgtattt 
atagtaagtg 
aacgtgtgtg 
attttgtgag 
tttcgaggtt 
agtcggaggg 
atagattgtt 
gttggggggt 
ttttattatt 
tttgtttgtt 
ttatttattt 
gtgtgagggg 
aagtagtttt 
tattaaatta 
ataaagatat 
gagtgattag 



agtggagatt 

ttggtttggg 

ttaatataat 

aatcggtttg 

ttagttttat 

agtttttaag 

ttgttttttc 

attttattta 

gaggagagtt 

ataaagattt 

gttataaagt 

tgttcgtttt 

acgtttaggt 

atttaggaaa 

tataaatatc 

cgaggttttt 

ttttatttat 

agagcgcgcg 

gtttaatatt 

ggtttttatt 

tttggattta 

tttttaaatt 

gagttagcgg 

gataggggaa 

ggagcgacgg 

acgaatttag 

cgcgcgtttt 

aagtattgtt 

attttttttt 

tcgatttatt 

gggagagacg 

aatatatata 



gagcggaaaa 
ggtattagcg 
aataattatt 
ttagttttaa 
ttacgtggta 
taaaagtttt 
gtagagcgtt 
tttttttcgg 
tttgtgacgt 
ttgttattcg 
tttttttttt 
gtttgaatta 
aaaaataaaa 
aaaatatttg 
gtttttgttg 
ttgtattttg 
tgttacgtaa 
ttatttaatt 
agtttaaatt 
tttggtgtaa 
tagagaaatg 
gtgggtttaa 

tgggggggag 

ttaaatattt 
cgttttaagg 
aattttaaag 
ataattataa 
ggggtggggg 
ttttttttaa 
tagagttgtt 
ttgaataagg 
aaaataatat 
taaaggttag 
tagatgaatt 
gtttaaaata 
agttgtgaak 



seq_prot. 
ggttttttga 
gttatttatt 
tatcgtagaa 
gttgtgtttt 
ttttcgttag 
atatttatta 
gttttttttt 
tttttttttc 
ataggttgta 
ttttttcgcg 
gttttttttt 
ttttttcgtt 
tcgtagtttt 
gggaggtaat 
gtagcggtta 
tgagtttttt 
tttaagaagt 
gaggttttcg 
ttcgttttta 
aggaagatta 
tgattaaaaa 
tcggttcggt 
ggcgagcgag 
gggaaggaag 
agtaaggaaa 
agagattata 
tttcgcggtt 
tcgttttttt 
taaaaattaa 
taaagataat 
tgggattaat 
tttatattaa 



txt 
ggtggttagg 
tgtttaattg 
gggaatataa 
tgttttttat 
ggttttaata 
cggttattat 
ttcgttcgtt 
gggtcgtcgt 
aatttaatta 
ttcgcgttcg 
ttttttgagt 
taaggtaagg 
ttttttatag 
agttattggg 
gaaatttcgt 
cgtatattga 
tttgtttcgt 
attttttcga 
agttttttaa 
cggttttgaa 
aataaaaata 



aaaaagaatt 

gtaatagcgg 

tggtgttcgg 

gaattaatag 

ggggaggatt 

taaagttata 

tcggatttgg 

gtttaagacg 

cggcgttttc 

gtgatataag 

ttatttataa 

ttgggaaaag 

taaaataaat 

ttaagtagtt 

ttgttttggg 

atggtttatg 

attataggat 

ttttaagcgt 

tttggttata 

tttttttttt 

tttaaaggat 

tttgttattg 

ggtgggggtg 

gtttgtgttt 

tcgttgtaaa 

gtttgtcgtg 

tgtattatac 

gggggtagag 

aggtaattgt 
agttttttta 
gagatttttg 
tttagatttt 
gttgttttga 
tttaaagttt 
atttatttat 
gattgttgtt 
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tcgaaggtag 
cgggtagttt 
aagaggcgag 
gtagtttgta 
agtcgtacgt 
tggggttaga 
tgtttcggtc 
aataatataa 
aataataata 
tttcggtatt 
tagatatata 
aaaaggtaaa 
cggcggcggt 
ttttttttag 
agtttaattt 
tggtgttagt 
agattttttt 
aggtttcggt 
atggtatggt 
gttgtagtaa 
agggggagtg 
atttagtaga 
tatgttatat 
attatagttt 
ttcggtgggt 
tttttttttt 
gatttttgtt 
cggtatcggt 
tttaaagatg 
tgtaataatt 
tttttttgga 
ttgttttggt 
ttgatttttg 
gcggggtggt 
tgtttttttt 
aaaatttttt 
ttagtattat 
gtagtatagt 
aagaaggaga 
agtgtagaat 
tttaaatgtt 
ttttttttat 
ttaagaatta 
gttttggtta 
ttataaaatt 
tttaatttaa 
tgtt^ggata 



ggttttgtga 
ttaggatttt 
tatagagggt 

gggggagttt 

atacggtatt 
attcggtagc 
ttcggagttt 
cgtcgcgttt 
atttcgtaat 
tttttttcgc 
tttgtatata 
atttcgggaa 
agtttgtatt 
agttaatata 
tatttttttc 
aaggtatcgt 
tattagtttt 
ttataattaa 
aatatagggg 
aggaaatagt 
aaaatttaaa 



ggaatttaaa 
ttttttttgt 
gaaaaagagg 
agcgagaaaa 
aagtagtagt 
tagtttttag 
gttttttaat 
gggagggtgg 
ataaatatat 
tattttaata 
tagaaaattt 
ataatgatta 
agtagtagta 
aaattttttg 
tcggaatttg 
ttttcgagat 
attggtcggt 
tcgaggttcg 
ttgtttcgtt 
agttcggttt 
cgttttgttt 
tcgagttaaa 
atatagtagg 
tttttaatta 
ttatttgttt 
tttggtgtgg 
tttgattttt 
tggatttttt 
tattatttta 
ttagttttta 
tttttgttga 
ttggtttgtt 
ggaaattttt 
atgtgtgtgt 
taaggttttg 
tagtttgtgt 
agatattgtt 
ttttttagtt 
aatttttttt 
tattttattg 

ggttcgggtt 
gttttttttt 
gtttaaaatg 
tgagagtttt 
tcgttatttg 
aatttg^gtt 
tttcigaaa^t 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

^980 



seq_prot.txt 

tttttttatt gatagtttcg taagaagttt tatttattat cgaaaatgtt gttggtaatg 5040 

tattttttta ttttataata tatatatatt tttaaaggtt gtattttagg ttttaaaatt 5100 

tagaaaggtg taaagaggtt gattattttt tataagggtt ggagtttagg aaatatgcgt 5160 

taaagtgttt gtttttgtga aatttgaata ttttaatatt tattgataga tggattattg 5220 

tagtttttgt agaaaagttt gttttgtgaa cggtttttga aaaatatata taggtttaat 5280 

tttgttattg ttgttgaaat gtacgtttta tttttaaaga tatatgatat tgttaagttt 5340 

agatttattt tgttagaagg tagattattt gtttttttat ttgagttttt ttttttttta 5400 

tttttaaaga aattaaaatg aatttcgttg gatgtaaaat gatatttcga ttttgtggaa 5460 

gggatttttt ttgtgtgttt atatgtgtaa tttttttttt tatgagtaat tagtatttta 5520 

ggatttttta gacgtattta tatggtttgg aatgtgaatt gtatggtttt gttttattat 5580 

tgggttttta aaagattatt tttatgattt ttggttattt ttttttttgt taaaattgtt 5640 

atttagtttt atttatcgtt ttttttaatt tgaaatatag tttgtttttt ttgtgttttt 5700 

tttgttattt tatttttagt tttatggggg cgttgtagag taaagggaaa tttgtttttt 5760 

aaggtagtag tagaattagt tagagtgggg gttgggaagg aatttatttt ttttatattt 5820 

tagttttttt taattttttg tagtgagttt ttagagggag aatttattag ggttggttta 5880 

tatataatta attttattat tagggatttt aaattttttt agtatattat ttttgggatt 5940 

aagtgttaga gtgttatatt tttttattgt tttttgttaa attaatgtgt ttatattgaa 6000 

agaaattgat tagaatagaa gattagttag aaagaaagag aaaattatta ttgaaaattt 6060 

tttaaagata ggtatttatt tttcggtgag ttttttttat tttgaattga atgaatggtg 6120 

aggaaaattg tgtattcgtt ttttaatttt tttttaattt ttaggaaagt aattttgtta 6180 

aatattatat tagttgaatt ttttttagaa tgttattttt ttttttattt ttgatttttt 6240 

tttagtattt agttggtatt attatgtggg ttttttttat tttcgtagtg tcgtgttata 6300 

ggattaaaga aagggttgat ttgaacgtat tttgattttt ttaaattatt tttaagaagt 6360 

tttaaagagt gattttatgt ttggggtcgt atgaataatt tttgtagatt tcggggtatt 6420 

atattttaat tagttttatg atttaatggt tattgg 6456 

<210> 277 
<211> 6456 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 277 

ttagtaatta ttaaattatg ggattagttg aaatgtagtg tttcgaagtt tgtaggaatt 60 

atttatacga ttttagatat ggaattattt tttagagttt tttaaggatg atttaaaaga 120 

attagaatac gtttaagtta gttttttttt taattttgta atacggtatt gcgggagtga 180 

gggaggttta tatagtgatg ttaattggat attgaggaga ggttaagaat gaaagaagaa 240 

atgatatttt ggaagaaatt taattggtat aatatttgat aaagttattt ttttaggaat 300 

tgaaaagaga ttgagaggcg ggtgtataat tttttttatt atttatttag tttaaagtaa 360 

aagagattta tcgaaaagta agtgtttatt tttagaaaat ttttaataat gatttttttt 420 

ttttttttaa ttggtttttt gttttggtta atttttttta gtgtaaatat attgatttgg 480 

taaaaagtag taggaaaatg tggtattttg gtatttggtt ttagaaataa tatgttggga 540 

agatttgagg tttttggtga tgaggttaat tatatatgaa ttagttttgg tgggtttttt 600 

ttttaggggt ttattgtaga gaattaaaga gggttgaggt atgagaaggg tgaatttttt 660 

tttagttttt attttggttg gttttattat tgttttagag agtagatttt tttttgtttt 720 

gtagcgtttt tatggggtta agagtggagt ggtaaaggga atataggagg gataagttgt 780 

gttttaggtt gaggggggcg gtggatgagg ttgaatggta gttttgataa agaaaaaagt 840 

gattaaaaat tataaaaata attttttgag ggtttaatag taaggtagag ttatataatt 900 

tatattttaa attatataga tacgtttgag aaattttaaa gtgttaattg tttataaaag 960 

aaaaaattat atatataaat atataaagaa aatttttttt ataaaatcgg ggtgttattt 1020 

tgtatttagc gggatttatt ttaatttttt tgaaaatgag aaggaagggg atttaaatga 1080 

aaaagtagat agtttgtttt ttggtagaat aaatttgaat ttgataatat tatgtgtttt 1140 

tgggggtaaa acgtatattt taataatagt gataggatta ggtttatgta tattttttaa 1200 

aaatcgttta taagataggt ttttttgtag aggttgtagt aatttatttg ttaataagta 1260 

ttaaaatatt tagattttat agggatagat attttaacgt atatttttta agttttagtt 1320 

-tttgtggaaa ataattaatt tttttgtatt tttttgggtt ttaaaattta aaatatagtt 1380 

tttaaaaatg tgtgtgtgtt gtggggtagg ggggtgtatt gttaataata ttttcggtga 1440 

tagatggaat tttttacggg attgttaatg aaagagattt tttaaatatt ttagtaaata 1500 

gtaatttttt atagttttga ttattttttt attataaatt taaattttgg gttgagatag 1560 

gtagattatt ttagatatat ttttattaga aattaataag tgacgagatt ttgtggaagt 1620 

tttaagaatt tatttgtaat ttaataagtc gtttgaagga tttttatagt taaggtttag 1680 

aatagtttga tttttgaaag ttgtttttgg tttaaatatt ttgggttaat ttttgaggaa 1740 

tttgaaatat tatttttttt ttatattttt tttttaagag agagatataa aagaaataag 1800 

agtttttttt atttagggat gagtaggagg ggaaaaaatt cgaattaata tttaaataag 1860 

gaaattagta gttttgaata aataaattag gatttataat gaaatgattt tgtattgtaa 1920 
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ttgtttttaa aaagaaagta atagagaaaa agagaaggaa agaatttttt tttttttttt 1980 

attttttttt tattttattt tttaatttag ttttaaagtt aagaagattg tgttgcgtgt 2040 

agtgtattgt agttgtggta gtttgtttta aatataggta gtatttgtga tattggtacg 2100 

gtaggrtttt agaatttttt tcggttgatt tttraaatat agattgaaga gattttttta 2160 

taacgatttt gaaacgagtt tcgaaaataa aaattttaag attttaagag aaaataaaat 2220 

ataaataggt atttggttta tagaattttg tagaaaatat atatatatta tttcgttatt 2280 

tttatttttt tttttatata tacgtttttt gtaataagaa attttttaag agttaataat 2340 

aatagattaa atttattatt tgttgrtttg gaaagaaata aattaaatta aaataaattt 2400 

tttgaatatt tttttgaagt gtaggagaga tatattttag taaaagttta agggggaaaa 2460 

agaaaattgt attaaaggaa aaaaaaaaaa aaaaagtggg ggttgggatt gttatatatg 2520 

gttaaaaatt taagtttttt ttaatagtat tagtattgaa ataatatatt tttaaaacgt 2580 

ttgagggatt agatagggaa agaaaaggta cgtataaaaa aatttaatcg atgtcgattt 2640 

tgtgatttac gtaatattat aaatttgtaa aaggtaaaaa attagaagta aaaatttata 2700 

aattattaaa atatagaaat taaaaatttt aagttattat attagaaaga aaaaaattta 2760 

gaataatagt aaaaattttt gttttaaata aaaataaagt aaatgaattt atcgaaaatt 2820 

gtttggtaaa tatttttttc gtggtgttta atattttagt tggaaagagt tgtgatgttt 2880 

attttatttt attttttttt tattcgtttt tttaatttta ttatatatat aatatatttt 2940 

ttrtagtggt ttaaattttt cgtttttttt tgtgtattta gttcgatttg ttgagtttat 3000 

gggtaagaaa gaaggaatta gttttagatt tcgggaaagt aaagcgtatt ttttttttta 3060 

tgttatcgaa tagtaaatta gtttttagaa ttttagaggt cgagttttgt tatagcgaag 3120 

gcgtcgacgt tatagaggag gagtttacgt gatggtggcg gagtaggtta tattatcgtt 3180 

ttgggttcgg ggagggagag ttatttttag gtttttcgag tttcgaatcg gaatttttaa 3240 

attcgagacg ttttgtttat gaggatttcg aaatatgtcg gttagtgaaa aaattttgtg 3300 

gttttgaggg tttttggttg gttaggggta gtaaaaattt cggagagttg atattaagtt 3360 

tttttttgtt acgtagtagt ggtaaagttc gaagtttaaa tttcgagaat tgagttttgt 3420 

tgatttttag aattggggtt tttagaagtg gtgatgtaag aagtttttag gaaaggtcgg 3480 

atattaggtg attattgttg ttgttgtcgt cgttgttgtt gttattgtcg tcgtcgtcgt 3540 

tgttgtcgtt ggtgtcgttg tcgtcgtcgt tgtttatgat tattatttta ttttttaatt 3600 

tttttttttt tcgtttttgt taaatgtttt ggttttaagt tttttatgtg tatttattga 3660 

tataaatgta tatatttatt tattttagtt gttaggtgtt aaaataaatg tcgaagatta 3720 

gttttacgtt ttttttatta taggatatag attgttatgt atttattatt attattgttg 3780 

trtttgagtg aatcggtcgg tttggggagg tttttgttat ttttttttgt gttgttttgg 3840 

tttttggaaa ggaggtggag gagaggaagg aggggaatta gggggcggtc ggagtagaga 3900 

ggacgagata gtgtttgggg ggtgattcgg gttagtttgg gggttgtttg gttttagatc 3960 

gcggagagga cgcgcgttcg cgttttcgtt ttttttgttg ttgtttgcgt acggtttgtg 4020 

atttttttgg atitcgtgcgg ttgtgttttt tttttttttt ttcgtttgta aattgttttt 4080 

tttgtttcgt cgtttttggt tttcgttttt tttttttttt ttttttttcg tttttttttt 4140 

tttttttttt tttgtttttt tgggtaattt cgggaggtaa aaagggaggt tgttcgttcg 4200 

ttcgtttcgt tggttttttt ttatcggttt cggtttttta gattttttgt tttcgagtcg 4260 

aatcgagatt tggaaggaaa aaatgtaagc gaattttttg ggtttttgtt tttgtttttt 4320 

tggttataaa tttagatgag atgaagtatc gtgtttatta ttttttttta gagtcgtgat 4380 

ttttttaatg agagtcgagt gttttggtgt tttttgtttt tgtgttttgg ggagtttggg 4440 

ggcggggatg ttgaatattt tgaaaattat ttttttttgg ttgtagtcga gggagtcggg 4500 

aattttcgcg cgtttttttt gtttatttcg aggagagaga ttgatggcgg gatagggttt 4560 

tttggggtgg gtgggaaagg ggtgcgtatt gtatttgcga tgttttttag tgtgcggggg 4620 

gatttaggga atttcgatag gatagtttgg gagaacgaga aaggtggcgg gatttttggt 4680 

cgttgcggtg tttgtatttt gggcgttgcg tgattttgtg ttggttttta atggttgttg 4740 

tttttttttt ttaaatttta ttttattttt ttttatggtg taagttttgt aaaaagggaa 4800 

ttgcgggttt gagcgtagtg gacgttaggt tttagatttt cgaggttttt attttgggcg 4860 

aggagaaaga cgaatagagt tttagttcgc gttttttatg tgtaagattt aggaggagag 4920 

aagggtattt tgtggtcgtg gttgagggga ttcggcgcgg gaggagcggg cgcgggcgcg 4980 

aaagggagat ttttgtgagt gattttgtaa aaatagattg cgaggttggt tggatttgta 5040 

atttgtggtt tttttcgagg gagtaagaat gggggaaggc gcggcggcgg cggttcgggg 5100 

agggagtggg tagagttgga gttttagaaa tcggttgagt ttcgggggcg ggcggggaga 5160 

aagggcgggg gggtagtagg agttaggggt tatttcgttg tcggatgtag tgatcgtggt 5220 

aaatgttttg agaattgtgg gttgcgttgt ttttatgatg tcgtgttatt ggaattttgg 5280 

cgaaaaatgg aattagtgtt gtaataatga gttttaaagt ttttttatgg aaaataaaaa 5340 

tataattaaa tcgatttttt aaaaaaatga aaaaagattt tttgtgttgt gttttttttt 5400 

acgatggttg tgttggtggt tggggtttgg gggtgggggg ttttgatagt tgagtagata 5460 

aatgatttta aattaggtgt tgaaagtttt aagttgttat agggtttttg attattttag 5520 

agagttaatt tttatttcgc gttttttttt cgtcggtttt cgttatttta taatrtcgtta 5580 

ggatttaatt ttttatggat ttttttttta gtttatagtt tttaggagga aagttttgta 5640 

acgttttatt tttttttttt tttttttaaa gtaaagtgag tgttagtaga tgtgtttttt 5700 1 

atagtttgga ttttaaaaaa agtggggtag gagagaggga attagggaaa gggttgttta 5766 

gaaatalragt ttgtggtttt gagcgatgtt atcggttttt tttatagaga ttgttagttt 5820 

tttttttttt ttgtaaggtg tgtgtggttt taataaataa tttaaaaaaa aaaagagggg 5 880 

gcggtaagat ttattaggag ataggtattt attttgttta ttatttatta ttaagagttt 5940 

ttggtattag aatagaattt cgtaaatttg tagtttaagt tgaagtttat ttggttttag 6000 
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tttatacgtt atttttggag gtagtaaata tgagtttaaa gaagagagat gaaattatat 6060 

ttttgtttta aaatatatat ttttttaaaa aagacgaaaa gatatttgtt tttgggggag 6120 

gaggaaatgg cgaataagta ttaggagatg ggtatttttt tattttaatg attgtagttg 6180 

agtttttagg ttaaaataga aatttgtaga tggtaatttt ttttttattt tttcgtttcg 6240 

ttcgttattt taagtttata aatagaagtt tttagtattt tatttggaag attttttaat 6300 

attagaggtc gaagtattgg ttagttttaa gggttgttga tagtgatagt aaggttagat 6360 

gttgtttata aaaattgaac aatttgaaaa aaataattaa aaaaaataaa aattttttaa 6420 

ttgttttttt ttttttttta gttttgaagg gtttta 6456 

<210> 278 
<211> 4001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 278 

tgtatttttt ttttttttta gtttttatta atttttgaaa aaaaattttt ttaattagaa 60 

attttgtgag attgtttttt ttttagtttt gattttaatg tttaagaaag atggtattat 120 

tttttttgat tatgaaaatt tagattttta ttatttttta gaaatgtgat gagtgtgaag 180 

attattttga atatttttaa gttattagtt tagttttttt tttatggatt atagtattga 240 

ttttgttaat tgataaaaat aattagaaaa tgttttaacg atatggtata aattatgaat 300 

attagaattt attagttttt tttttataat ttttttttat taatatgtaa ttgtaaaaga 360 

agaataagaa gggaatgggg gtaatggatt taggtttaaa atatatattt tatttttgtt 420 

tttttttatg gttttttttg gtatgtatgt tgtatatttt taatattttt aagatattaa 480 

aaatttatta atttaagtat atttaattta gatttagttt tagtttagat ataatgtttg 540 

atttttaaga gtttaaaata taattaaaaa tgtaaatatt tgaaattagt gttatattga 600 

gagtttttta aaatttaaat attaagatat atattttttt tattaagatt tgatttttaa 660 

tatatatttt ttaaggtaat taaagggaat atttattttt gtttgaggaa aaaatatttt 720 

gtaagttaaa atttttatta atatagtatt ttattatatg attttttttt tttattttaa 780 

aataatgggg tttaattttt tgaagcgtag tttttttagt aataaaggcg gataaatttg 840 

tttttttttg taagtagata ttttttaaag gtaagaaagg gaagattttg ttagtttagt 900 

ttattatagt taatgttttt tttattaagt gtttaattgt ttcgaaagtt agatgtttta 960 

gaaagttaga tgttttatag ttgttttggt ggtaattatt ttaagatagc ggtattaaag 1020 

gttttgattt ttagtttttt tatttaaaaa atggggatat taattatttt ataataaagt 1080 

tttgatgata taatgagatt ataaataaaa tattttgtga tgtttgattt attatttaat 1140 

taaaattttt ataatggatt tatatttttg gtttagttat tatatatttt gttaagtggt 1200 

ttttttttaa gaaaaattgg ggatgggggg gttagggggg tgggagagta gtagggagta 1260 

ggaggtaggg aaggaggaaa ataagtaaaa ttagtatata tataagtttt tttatttttg 1320 

attgtttgcg ttttttgtgt tttttttttt taagtttttt taggtttatg ttttgaattt 1380 

ttttttttat aaatgaaata taacggatac gaattataga gaattttacg gaagatattg 1440 

gagtttaagt ttgtagatta gttttttgtt tataagcgga ggcgattgga aaattaagtt 1500 

aaataagcgt nggataattt taaatagtta aagaagtttt aatgggatag ggtttaggtt 1560 

tattaaagga ggtaatatta gtaattgacg aaattatagc gacggttgat cgattatgat 1620 

tcgaggtata agacgttaaa tttaaggtta ttaattaacg tgcgtaagaa gcggttttaa 1680 

tcgattcggg cgatgttaat cgcgtaatta ttggtcgtcg cgttatttta ggtttaggaa 1740 

tagtttagtt cgtaataggg agattttaat tagggagagg aggagcgggg taaaggggtt 1800 

gggtgttttt taagtcgttg cgcgggacgt ggaagggacg aaagaggtga ggaagagtag 1860 

taaattaatt taatatattt atatttttgt tgtgttttta ggattacgag aagcgggtag 1920 

ggggcgattt tcggtttttt ttatttcgta aaggttttta attgggtttt tcggttaatt 1980 

ttttagttat cgttattttg aaatttcgcg ttttttcgta ttttttacgt tattttttat 2040 

ttttttgttt cgtttttttt tagcgtcgtt tgattttgga attatagaga atatttaaga 2100 

attttaggat tagagttttt ttcggtttta tataatattt aattggttgg ttgtcggtta 2160 

agagttcgtt ttattgcgtt tatttagtat aataggaatt agaagaagat agttgttaat 2220 

tgagtggttg acggttttgt ttttcgagtt ggaggtattt tgatttagtt gttgggagtt 2280 

tggtacgttg agtcgttcga gaacggtata ttttgtagat aaagaagcgt tttgcgtttt 2340 

ttgttttttg gaattatttt agtttttgta tttttttgtt ttttatgatt taagtaggtg 2400 

cgatttttta ggggtgtttt cgtgtgtaaa cgaatattta gataatattt gttattgagt 2460 

ttaattcgtt ttgtaagtat taggttattt tatgtgagcg tttagtacgt gtggaaattt 2520 

aatagatagg tagaatagaa cgtttgtaga ttatttgaat aaataaatta gtattcgtat 2580 

tagagtcgta gattgtttat atgggcggtg atttattatt tgttagtagt gttttgaaga 2640 

agaaatacgg ggatatatat gtagtcggtt ttaagttata tgagtggtaa ggttgatata 2700 

tattggttat attttcgartt ggttttttta ttagttaaaa gttttttatg atgttatttt 2760 

acgtacggga ttttttattt aattttagtt ttcgtttttt ttttattttt tatttttttt 2820 

tttcgtgagt tagtttttat tjtattttttt taaaggtaag tttaaatatt taaaaaatta 2880 

ttttagagga tacgtttggt tetttttaat tttaatttga tarattttatg taggaaggtt 2940 
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atttttatta ttagttataa atattttaaa atattgatgg agatatatat taatttttag 3000 

tagaattttt aaattgtttt tgaaggtata agatcggttt tatatttaaa tttatttttg 3060 

tgttaggaaa tttttgattt gattttagat tgaatttgag tgttagaaag taatgacgag 3120 

atattgtagg ttaaagttga taaattaggt agtatataaa aaagaaaata attttgaatt 3180 

gcgaaggtga gttaagttgt aatgttttgt aattagtttt tttttttttt taatatatta 3240 

gtgattatag ttttgtaatt cgttttgaaa aataatgtaa gaagagaaag agaaaatagt 3300 

gtatgttgga aatattaaag ttttttatgt agttaaaatt ttatatgtag ttattatttzt 3360 

taagatatgt tgagtagttt ttattttaat atattttttg tatttggatg ggagaattaa 3420 

ttataataaa tatagtttgg tattaatagt atattaataa ttttgttagt atagattaaa 3480 

agagttttta tgggaattat tggttggtaa aggatttaga aaataaatat atttgtagag 3540 

atttagcgga attattaata ggtttttata gtttttggcg ttttttaaaa ttaggtaatt 3600 

agtgaaatgg tttatttaat gcgttatgta gtaaaggtag ttgttttttt tttgttagtg 3660 

ttgaaaatga aattatattt ttttttttga aagtttataa taaattatga ttttttttta 3720 

aatgataaat gttttgagta gtgatttttt attttgtatt ttaaatgtaa tatgtgtagt 3780 

tgtgttttgt ttttrttgttt tttaaatgtt ttagtgtttg aatatttatt tgagcgatat 3840 

ttaattatta tagttggtta gggttgattg tagtgattga aataggattt tgatgttttt 3900 

agttttgatt tttattattt ttaaatttat tcggtttata ttgtgtaatt agtagacgtt 3960 

tatagattag ttttttgttt taattgaagt atattgttat a 4001 

<210> 279 
<211> 4001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 279 

tgtaataatg tgttttaatt aggataggaa gttgatttgt aaacgtttat tgattatata 60 

atgtaaatcg gatggatttg aaaatggtaa aaattaaaat taaaaatatt aaggttttgt 120 

tttagttatt gtaattaatt ttgattaatt gtgataatta aatatcgttt aagtgaatgt 180 

ttaaatattg aaatatttag aagataagaa agtaaaatat agttatatat attgtattta 240 

ggatgtagaa tagaaggtta ttatttaaag tatttgttat ttaaaagaaa gttataattt 300 

attataagtt tttagaaaaa agaatataat tttattttta atattagtaa aaaggaggta 360 

gttgtttttg ttatatgacg tattaggtgg gttattttat taattatttg attttagaaa 420 

acgttaaagg ttatggaagt ttattaatgg tttcgttgaa tttttgtaga tgtgtttgtt 480 

ttttaaattt tttgttaatt aataattttt atgaggattt ttttagttta tattgataaa 540 

attattaata tgttgttgat gttagattgt gtttgttata gttgattttt ttatttaagt 600 

ataagaaatg tattgaaata aaaattgttt agtatatttt aaaaatagta gttatatata 660 

aaattttagt tatatgaaag attttaatat ttttaatata tattgttttt tttttttttt 720 

tttgtattgt tttttaaaac gagttgtaaa attatgatta ttaatgtgtt aggaaagaaa 780 

aaaaattagt tgtaaaatat tataatttag tttattttcg tagtttaaga ttattttttt 840 

ttttatgtat tatttggttt attagtttta atttataata tttcgttatt gttttttgat 900 

atttaaattt agtttaaaat taaattaaga attttttaat ataaaggtaa gtttaggtat 960 

aggatcgatt ttatattttt aagggtagtt taaagatttt gttaaaaatt agtatgtatt 1020 

tttattaata ttttgaaata tttataattg gtggtaagaa tagttttttt atatgggata 1080 

gttaggttaa gattagaaat aattaagcgt attttttaag atagtttttt aaatgtttaa 1140 

atttattttt aagggaaata agtgggggtt gatttacgga aagaggggat agaaggtagg 1200 

gaaagggcga aggttagagt taagtgaaga atttcgtacg tagagtgata ttatagagga 1260 

tttttggttg gtgaaaagat tagtcgggaa tatagttaat atatattaat tttgttattt 1320 

atataattta aaatcgattg tatatgtatt ttcgtatttt ttttttaaaa tattattaat 1380 

aaataataaa ttatcgttta tataagtaat ttgcgatttt aatacgggtg ttaatttatt 1440 

tgtttaggtg gtttataaac gttttgtttt gtttgtttat taaattttta tacgtattag 1500 

gcgtttatat gggataattt gatgtttata aagcggatta aatttagtgg taaatgttat 1560 

ttgggtattc gtttatatac ggagatattt ttaaaggatc gtatttgttt gggttataag 1620 

aaatagaaaa atgtaaaaat taaggtaatt ttaaaagata gaaaacgtag aacgtttttt 1680 

tatttgtaga atgtatcgtt ttcggacggt ttaacgtatt aggtttttag taattaagtt 1740 

aagatgtttt taattcgaga gataggatcg ttagttattt agttagtagt tatttttttt 1800 

taatttttat tgtattagat gagcgtaatg aggcgggttt ttagtcgata attaattaat 1860 

tgagtattgt atggaatcgg aagagatttt ggttttggaa tttttgggta ttttttatgg 1920 

ttttagggtt aagcggcgtt aagagaaggc gggatagagg aatgggaaat gacgtgagga 1980 

gtgcggaggg gcgcgaggtt ttaagatggc ggtagttgag gggttgatcg agagatttag 2040 

ttgaaggttt ttacgaagtg aaagaggtcg ggagtcgttt tttattcgtt tttcgtagtt 2100 

ttgggagtat agtagaagtg tgagtgtatt gaattgattt attatttttt tttatttttt 2160 

tcgttttttt tacgtttcgc gtagcggttt gggggatatt tagttttttt gtttcgtttt 2220 

tttttttttt ggttgaggtt tttttgttgc gjgattgggtt atttttgggt ttggggtggc 2280 

gcggcgatta gtggttgcgc ggttggtatc g^tcgggtcg gttggggtcg tttttt^cgt 2340 
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acgttgatta 

cgttatggtt 

tgaagttttt 

tttcgtttgt 

ttttgtaatt 

gaggaaattt 

atatgtattg 

atttttttaa 

ggtagttagg 

attataggat 

gatgttttta 

aagtaattgt 

ggtagttgga 

ttttttttta 

tattggaaag 

ttatgtaata 

taaaaataag 

aagaaaaata 

aaatatttgt 

agaattgaat 

gagaatgtat 

ttttgagttt 

aataaaggga 

tcgttaaagt 

gaaaaaagtt 

ttgaagaatg 

atattaaagt 

tttttagaga 



gtagttttag 

tcgttagttg 

ttggttgttt 

agataaaaaa 

cgtattcgtt 

gagaaaggaa 

attttgtttg 

tttttttatt 

ttaggaatat 

attttatttg 

ttttttagat 

tattagaata 

tatttggtga 

tttttaaaaa 

attgcgtttt 

gggtattgtg 

tatttttttt 

tgtgttttaa 

atttttagtt 

ttgaattaaa 

agtatatatg 

agatttatta 

ggttatagaa 

attttttagt 

aaattgatga 

atagagattt 

taaaattaag 

ttggtaggga 



gtttgacgtt 

ttggtgttgt 

agagttattt 

ttaatttgta 

gtgttttatt 

agtataaaaa 

tttttttttt 

tttagttttt 

ggatttatta 

taattttatt 

gaggaagtta 

gttgtgaaat 

aagagatatt 

gtgtttattt 

aaaaaattaa 

ttaataaaag 

aattatttta 

tatttaagtt 

gtgttttgaa 

tatgtttaaa 

ttaagaaaaa 

tttttatttt 

gaaggattaa 

tgtttttatt 

tttgaggata 

aaatttttat 

gaaagaatag 

ttagaaaaag 



seq^prot . 
ttgtatttcg 
tttttttggt 
aacgtttatt 
gatttaagtt 
tgtgggggag 
acgtaggtaa 
tttttatttt 
tttagaagga 
taaagatttt 
gtattattag 
aaaattagag 
atttggtttt 
aattgtaata 
gtaaaaggaa 
gttttattat 
ttttagttta 
agaaatatat 
ttggggagtt 
tttttgagag 
ttaatgagtt 
ttatggagag 
ttttttattt 
tgaattttga 
agttgataaa 
tttaaagtgg 
ggttaagaaa 
ttttatagaa 
gaagaaatat 



txt 
agttatggtc 
gaatttggat 
tggtttgatt 
ttagtgtttt 
gagatttaga 
ttaaagatgg 
ttattttttg 
aattatttga 
agttaagtag 
agttttgtta 
tttttgatat 
ttgaaatatt 
aattgggttg 
gtaaatttgt 
tttggaataa 
taaaatattt 
attgggagtt 
tttagtatag 
ttaagtatta 
tttagtgttt 
aggtaaaggt 
tttttttata 
tgtttatggt 
gttagtgttg 
tttttatatt 
ggtgatgtta 
tttttaatta 
a 



ggttagtcgt 

tttattttat 

ttttagtcgt 

tcgtaaaatt 

atatgagttt 

gagagtttgt 

ttgttttttt 

tagggtgtat 

tgagttaaat 

taaagtaatt 

cgttgttttg 

tggttttcgg 

ataaaatttt 

tcgtttttat 

gaggaagaag 

tttttttaaa 

aggttttagt 

tattaatttt 

tatttgaatt 

tagaagtatt 

gaggtatgtg 

attatatgtt 

ttgtgttata 

tgatttataa 

tattatattt 

ttttttttgg 

aaggaatttt 



<210> 280 
<211> 4341 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA CHomo sapiens) 
<400> 280 
ttatggaagt 

gtggggtgtt 

gaatgggagt 
tttttttatt 
aatagaattt 
taaagaaagt 
tttagataaa 
ttatattgat 
gtgataagtt 
ttaaaatttt 
tgtgcgttta 
ttatttgggt 
ggtcgtttta 
ggtgggtagg 
ttttttaagg 
attataatgg 
tattttaggt 
gatgcgatgg 
tattattatt 
aggttagatg 
ggaagatggt 
aggggttttg 
ttagtgagtt 
cgacgggggg 

tattagggag 
tttagtaggt 
gtgttgtagg 
atagggtttg 
aggcgaagcg 



ttttagcggg 
aatcgttttt 
ttttaaaggg 
tttgtagtaa 
ttaattcggg 
agttagttta 
gatagtaaat 
tagttatttt 
tttgagataa 
agtttatgtg 
tgtgaagaga 
gtaggcgggt 
taggatttgg 
ggtgaatttt 
gtggggaaga 
tggaatgtta 
tatttggatg 
tttggtttga 
ttatttattt 
agttagattt 
atgggtgtgt 
tagttattgg 
ttgggaggga 
aaggatatgt 
ttttcggagt 
ttgggttcgg 
tgcggggtta 
tgagttttat 
agagcggagg 
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ggtttgtaag 
aaagggaaat 
aggattattt 
aattttagtt 
ttaatagtag 
tatgtcgttt 
tatttaattc 
ttaagttgga 
gtttgtattt 
ttatttttaa 
ttattaaata 
tgagttcgaa 
gaaggtaatg 
ataaagtata 
ttataaagta 
ttagttaagg 
tatatttgta 
taattattat 
tattttattt 
tagggaggtt 
ttggttaggg 
tatataatgt 
attgaagatt 
ttaggtttaa 
attttgattt 
gtCcgagaat 
ggagttaggt 
tttttcgc^g 
tcgtatttta 
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ggattgtgag ttgagttaag 
gttatttttt atttatagtt 
atgggtttgt ttgatttagt 
aaggaagata aagagatttt 
gtttatgttt aaaatggttg 
gagatgggaa aaataaggta 
gtgataattt tgaggaaatt 
ttagggttgg aggttggggt 
gtaatttgtt ataatgttgg 
atagtttata ttaaatgtta 
ggttttgtgt gagtaatatg 
aagagagtta gcgaagggag 
gaaaattata gttaaagggg 
tttttaaggg tggggagaat 
tattgattag ttagggtggg 
ttgtttttat tttttttgtg 
agttataggg gatgcgatgg 
ttatgttatg tttattattt 
tttttttata tattcgtttt 
tagaagtggg taaggggaaa 
gtgggagtgt tggacggagt 
ttgggagttt ttgttggtgt 
tttaattatt aatgtatttg 
ggatacgtgt aggtttggat 
ttagacgggt ttgatgaaac 
ttgcgttttt cgcgagtttt 
ttcgtttttg cgttcggagt 
cgcggggcgg ggttgggcgc 
gtattgcgta gggatcggtg 
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gagaatggag 

agttagtcga 

tttttttaat 

tggagattaa 

ttaattttaa 

ggatataggt 

ggtaatttag 

ttcgaggtag 

gaggggttgt 

attgttttta 

gttgtgtatt 

ataggggtgg 

gttgtttttt 

tataaataat 

gtaggaataa 

gatttttagt 

tttggtttgg 

taagttttat 

tattttggag 

cgggaaagga 

tcgggataag 

tgggattatt 

tttttaaaat 

gatttcgggt 

gagtatttga 

cgcgaggtaa 

cgtttttagt 

ggggtgaaag 

agtgtcgttt 



2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3 540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4001 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



seq_prot.txt 

"Oggggtag cgtttagtaa tcgcgttagg agcgcggaga agggtattgg gagagcggcg 1800 

ttcgtggcgg agattagcgt ttcggagtac gggtacgacg ggggtatttt ttcggttgtt 1860 

agtaattaat aataataata attataatta tagtaagggc gttgatgggc gggttcggag 1920 

tacgtttgat tttggttttt attaggttgt ttaggttttt gatgacgtat tagaaatatt 1980 

tttttaattx gcggtttttt tgtaggagag gttgggaagg ggtgggggac ggggttcggg 2040 

ggaggttttc gagggatttt agtaagcggg gaagggcgtc gggaaagttt tagatttacg 2100 

gttgcgcggg ttacgagttt attcgaacgt cgattattgt ttttcgtcga tttttatttt 2160 

ttgggaacgc gcgaaagtaa atttaagtta gattgcggag gtcgttgggg agggaaggtt 2220 

taaggagttt tcgtcgattt tgttgaataa agggggtttc gagttgggtc gagatggggt 2280 

atgcgcggga agatttttgt tcgttgtttt tttttatcgt tttagtggat gttatgtttg 2340 

gggttttttc ggcgcgtggg gttgacgtat tttcggggtt tatcgtagtt ggtcgggatc 2400 

gtggagtggg tgcggtggac gaagggaggt aggatagttt cgggggtggt agaaggagtt 2460 

cgggtatagt tgagatttgc gtttttattt tattaatatt tatagtaggt gttgtcgagt 2520 

tgggtaattg ggatggttta agttatttgg ttaaatttta aattacgttt gttattggga 2580 

agtagagttt agtgatgtta atcgcgtttt tatttttatt atcggtgtta gttttaaagg 2640 

gtttttaaaa tggttgtgtt attttttagt tttggatcgt agttgtcggt taggaatttt 2700 

agtgttacgg tggatacgtt tttttcgcgt ttttgtcgtt tatttgttta tttagtttag 2760 

gggttttggt aggtagtagt gtatttggtt taaagggtaa gatgtttttt tttttattta 2820 

taataaattt aaatattagt agggtttggg gggaaaaacg tttttagaag aaaaggtgaa 2880 

tgttagtttt gtaagagtta gttttaaaat tagattgaat tggtatatgt atatttatgt 2940 

aataaatttg tacgttttgt atatgtattt tagaatttaa aagtttataa aaaaagaaaa 3000 

aattagattg gattatgttg ggaaagtgta gtttttttta ttttaggtat tttttagaac 3060 

gtaggtagta ggtggttttt attaggagtt ttgggagagg aagggggttg aattttattt 3120 

tttatttttg tttttttatg gggtttgagt tgaggaagtt ttattataag gagagaattt 3180 

tttgataatt ttggatgtta tttttatttt tattgtagga attttgtggt tatattgcga 3240 

ggagatcggt tttgggtcgg aggttatagg aagattttta ttttttgaaa tttggagtga 3300 

agaacgtcgt tatttagtta ttattttaag gtaaggtaga aatgaagtgg gtcgttgggt 3360 

tttttttttt tttttttttt tttttttgag ataaggtttt attttgtcgt ttaggttgga 3420 

gtgtagtggc gttattttag tttattgtaa ttttcgtttt ttaggtttaa gcgattttgg 3480 

tgttttagtt ttttgagtag ttgggattat aggtatatat tattatattt ggttaatttg 3540 

tatgtgttta gtagagatag tattttatta tattggttag gttggtttta aatttttggt 3600 

tttaaatgat ttattcgttt tggtttttta aagtattggg attataggta cgagttattg 3660 

tatttagtta ggtttatttt agttgttatg ttaattaggt tttttgtatt tgtgcgttaa 3720 

tttttatttt tttaaaaggt tttagggtga tttagtaggt aatgagtgat ttttaaattt 3780 

aggatttatt gtgagagatt tatatatata attgagtaga tatttatagt ataatgatta 3840 

aagggagtga tagggtaagg atttatagtg gaggttttgg aggttagttt attgatagtt 3900 

attttaggga gtttagaagt ttcgttttag tgttgggtgg tggagggaaa tttgtttttt 3960 

tagggatttc gttttcggtt gtttagttgt taaagttagg aataagtttt tagaaatttt 4020 

attgttaaga tttcgaaaac gttttagata ttgttagttt tttgtcgttt ttgcgatttt 4080 

ttataggtgt gcgtgttatt gggtttttat ttattggggt ttttggtggt attgggttat 4140 

agtaagtgtt tttttatttt tttagtttat tatatatatg tttattattt tgaagaaaaa 4200 

tttttttatt atgagcgaaa gtgagaaata cgtatgttta ttgtttttaa agaaagaaat 4260 

ttaatatggg tttaatgtta tttagtgagt gtttttattt tgagatatta gggttataag 4320 

ttattattat atattatggg t 4341 

<210> 281 
<211> 4341 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 281 

gtttatgata tataataatg atttgtgatt ttaatgtttt aaaatggagg tatttattag 60 

gtagtattaa gtttatatta agtttttttt tttagaaata ataaatatac gtgtttttta 120 

ttttcgttta tagtaaaggg gttttttttt aaagtggtaa gtatgtgtgt ggtagattaa 180 

ggggatgagg gaatatttgt tgtggtttaa tgttattaga gattttagtg aataaggatt 240 

tagtggtacg tatatttgtg gaggatcgta aaagcgataa gggattagta atgtttgaag 300 

cgttttcgga attttggtag tgagattttt gaaagtttat ttttgatttt ggtagttggg 360 

tagtcgagga cgaggttttt ggaggaatag atnttttttt attatttagt attagagcgg 420 

gatttttgga ttttttggag tggttgttag tgggttggtt tttagagttt ttattgtggg 480 

tttttatttt attatttttt ttaattattg tattgtgaat gtttgtttaa ttgtgtgtgt 540 

gaatttttta taataaattt tgaatttgag aattatittat tgtttgttgg gttattttga 600 

aattttttgg gaaagtgaaa gttagcgtat aggtgtagga afeitttggtta atataataat 660 

taaaatgaat: ntgattgggt gtagtggttc gtgtttgtaa ttttagtatt ttgggaagtt 720 

?aaggcgggtg gattatttga ggttagpagt ttgaaatrtag ttnggttaat atggtgaaat 780 
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gttgttttta ttaaatatat ataaattagt taggtgtggt gatgtgtgtt tgtaatttta 840 

gttatttagg agattgaggt attagaatcg tttgaatttg ggaggcgggg gttgtagtga 900 

gttgagatgg cgttattgta ttttagtttg ggcgatagag tgagattttg ttttaaaaaa 960 

aagaaagaaa gaaaagaaag aatttaacgg tttattttat ttttgtttta ttttggaatg 1020 

gtggttggat ggcggcgttt tttattttag atrttaggga gtgggagttt ttttgtagtt 1080 

ttcgatttag aatcgatttt ttcgtagtgt ggttatagaa tttttgtagt gagggtaaag 1140 

gtggtattta gggttgttag ggggtttttt ttttgtggtg aagttttttt agtttagatt 1200 

ttatagagga gtagggatgg gaggtaggat ttagttrttt ttttttttta aaatttttaa 1260 

taaggattat ttattgttta cgttttagga aatgtttaag atggaagagg ttatattttt 1320 

ttaatataat ttagtttagt tttttttttt tttataagtt tttaagtttt ggggtatatg 1380 

tgtagaacgt gtaggtttgt tatataggta tatatgtgtt aatttagttt agttttaaaa 1440 

ttaatttttg taggattgat atttattttt ttttttgaaa gcgttttttt ttttaaattt 1500 

tgttggtatt taaatttgtt atgaataaaa gagagaatat tttgtttttt agattaaatg 1560 

tattgttatt tattaaagtt tttgagttgg gtgagtaggt gggcggtaag ggcgcgaggg 1620 

aggcgtgttt atcgtgatat tgggattttt ggtcggtaat tgcggtttaa ggttgaaaga 1680 

tgatatagtt attttaggag ttttttggga ttgatatcgg tggtggaggt agaggcgcgg 1740 

ttagtattat tggattttat tttttagtaa taaacgtgat ttaaaattta attaaataat 1800 

ttgggttatt ttaattgttt agttcggtag tatttgttgt gagtgttggt gggatgggag 1860 

cgtaggtttt agttgtgttc gggttttttt tgttattttc gggattgttt tgtttttttt 1920 

cgtttatcgt atttatttta cgatttcggt taattacgat ggatttcgag ggtgcgttag 1980 

ttttacgcgt cggggaggtt ttaggtatgg tatttattgg ggcggtgggg gggaatagcg 2040 

ggtaggggtt ttttcgcgta tgttttattt cggtttagtt cggaattttt tttatttagt 2100 

aggatcggcg agaatttttt gaattttttt tttttagcga ttttcgtagt ttgatttggg 2160 

tttgttttcg cgcgttttta ggaaatagaa gtcgacggaa agtagtggtc ggcgttcggg 2220 

tgggttcgtg gttcgcgtag tcgtggattt gaaatttttt cggcgttttt tttcgtttat 2280 

tagagttttt cggagatttt tttcgagttt cgttttttat ttttttttaa ttttttttgt 2340 

aggaaggtcg cgggttaggg ggatgttttt gatgcgttat taggagtttg ggtagtttgg 2400 

tgggaattag aattaggcgt gtttcgagtt cgtttattag cgtttttgtt atgattatga 2460 

ttattattat tgttagttat tagtagtcga gaaggtgttt tcgtcgtgtt cgtgtttcgg 2520 

agcgttagtt ttcgttacga acgtcgtttt tttaatgttt tttttcgcgt ttttagcgcg 2580 

gttattgggc gttgttttta gaagcgatat ttatcggttt ttgcgtagtg ttggagtgcg 2640 

gttttcgttt tcgtttcgtt ttttttattt cgcgtttagt ttcgtttcgc gtcgcgaaga 2700 

aatgaaattt atagattttg tgttgagggc ggtttcgggc gtagaaacga aatttagttt 2760 

ttggtttcgt atttgtagta tttgtttcgc gggaattcgc gggagacgta ggttttcggg 2820 

ttcgaattta ggtttgttga attagatgtt cgttttatta ggttcgtttg aggattaagg 2880 

tgtttcggag gttttttaat ggttcggagt tatttaagtt tgtacgtatt tttgaattta 2940 

ggtatgtttt ttttttcgtc ggttttgaaa atagatgtat tggtaattgg gggtttttag 3000 

ttttttttta gggtttattg ggatgatttt agtattagta gggattttta ggtattgtgt 3060 

gttaatggtt gtagagtttt ttttgtttcg aatttcgttt agtattttta tttttaatta 3120 

ggtatattta tattattttt tttttttttc gttttttttt gtttattttt aggttttttt 3180 

ggagtttggt ttatttggtt tttttagggt ggaaacgggt gtgtggaagg aaaataaaat 3240 

aaaataaata aaatagtaat aataaagttt aaaataataa atataatata ggtaatagtt 3300 

gttaggttag gttatcgtat tttaggttag gttatcgtat tttttgtgat ttgtaggtat 3360 

atatttagat ggtttaaagt aattgaagat ttataaaaga agtaaaaata gttttaattg 3420 

atgatatrtt attattgtga tttgtttttg ttttatttta attgattaat gtattttgta 3480 

atttttttta tttttaagaa ggttatttgt aatttttttt atttttgaga atgtattttg 3540 

tgagatttat ttttgtttat tagagaataa ttttttttga ttgtaatttt ttattatttt 3600 

tttaaatttt ataaaacggt tttattttta ttttttttcg ttgatttttt tttcggattt 3660 

agttcgtttg tatttaggtg aaatatatag ttatgttgtt tatataaagt ttgtttggtg 3720 

gtttttttat atggacgtat atgaaaatag ttaatattta gtataagtta tttagaagta 3780 

atatataggt taaaatttta agtagttttt tttagtattg taataagttg tagatgtaag 3840 

tttattttaa aagtttatta tttgtttcgg gattttagtt tttaatttta gtttagtttg 3900 

gaggatgatt agttaatata attgaattgt tagttttttt agaattgtta cgggttaagt 3960 

aatttattgt ttttgtttga aatttgtatt ttattttgtt ttttttattt taaacgatat 4020 

ataagttagt tgtttttttt gttggggttg gtagttattt tgggtatgag tttgttgttg 4080 

gttcgaatta aaagttttgt tttggttttt aaaggttttt ttgttttttt tggttagagt 4140 

tttattgtaa aggtgaaagg ggttggggag ggttgagtta agtagattta tgggtggttt 4200 

tttttttggg agtttttatt ttcggttaat tggttgtgag tgggagatgg tatttttttt 4260 

tgggagcggt tggtatttta tttttatttt ttttagttta gtttatagtt ttttataagt 4320 

tttcgttaaa gatttttatg a 4341 

<2i0> 282 > . 
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<213> Artificial Sequence 

- " > • 

<220> i 

<223> chemically treated gen omic D NA (Homo sapiens)^ LJ 

Seit^206 " 



seq_prot.txt 



<400> 282 

ggtagaatcg gtgtttatag ttttcggggt ttaattttat taggtaggag gagttgagtt 60 

ggttttgtaa tcgaagttta aggttagtga tggtagcgag gggtagagtt tagggtaggg 120 

gaaattagag tttttaggac gcggttagta gagttttgtt tttgttgtag cgtcgggcgg 180 

gggtttggtc gttatggttc ggtttttatt taagacggtt gtcgaggagg ttcgtagaga 240 

ggtattagga tagtaacggg gtagtgttat tttcgcggtt tttcgagcgg aaggaaagga 300 

gttttttagg ttaggtattg tttttttggg taggagcgaa gtagggggga tgttcgtttt 360 

ttttttgatt tattttattt ttcggagttt tggtagcgaa gcggaggcgg agataggtgg 420 

tgttagggcg ttttttcgtt aggtcgtagg tttcgcgggg taataggata ttggtttttg 480 

gtaggcgggc gcggggtttt cgggttcgat ggggagaggt cggggtcggc ggcggcggat 540 

ggacgttggt tcgggagata gagttcgtcg ttttcggaaa ttggattcgt ttcgtttggg 600 

cgtcgcgggg agcgtgtggg acgcggtgga cggggtcgtc gttttggacg tttacgcgcg 660 

cggttttttt ataggatttt tattcgttta ggagttcgta ttttcggcgt tgttcgtttt 720 

tcgcgttttg tagtttgggt tttagacgga aggttttggg gttaagggtg gttggagtag 780 

ggaggtttcg ggggtttttg ttttcggagg aggttggttt tgggttagta gggaggtttc 840 

gggtattcgg agttttggtt tagttttaga ttttacgcgt ttttggttag agagttcgtt 900 

ttaaggtcgt ttattttcgt tttaagggtc gggtgttata ggggtttacg atgtcggcga 960 

ggtcggggtt gatagttttc gagataatag ttttgtcgtt gatttggggt ttattcggtt 1020 

ttcggagtaa gtaaagtcgg ttgtagtcgg ttatagtcgc gcgttttttc ggagtcgtga 1080 

gtttcgttcg cgtttcgttt tttcgttcgt agttttcggt tcggggtttt tatttgaagt 1140 

tttgattttt tttttttttt tagatttttt gtttttggag gttatttagg attttttcgt 1200 

gggcgaaaat tttagagcgg cgttagtttt aagtttagtt ttagtttaag ttttaatttt 1260 

agttttagtt ttagttttag ttttagtttt ggttttattt tttagtttag tattaatttt 1320 

agttattatt ttaattttat tttttatttt agttttagtt ttagttttaa ttttagtttt 1380 

aattttaatt ttagttttag ttttaagttt agttgtagtc gtaattttag ttttagtttt 1440 

agttttagtt ttaattttta tatttttatt tttagtttta attttagttt taattttagt 1500 

tttaagttta gttttaattt tagttttaat tttagtcgta tttttagttt taattttagt 1560 

tttaatttta gttttaagtt tagttttaat tttagtttta attttagtcg tattttttgt 1620 

tttagttttt attttagttt taattttaat tttagtcgta ttttttgttt tagttgacgg 1680 

gttaaagttt taggagagtg tggttttttt taggcgttat taggaggggt aggttttagt 1740 

tagttgggga aattttattg ttatggtttt tagaagttat ttttttttta ggtaagatag 1800 

gttttaaggg agtgtttcga gggcggtttt cgggagtttc gtgggtttta gtatttttat 1860 

atattttgag gatagatacg gttttttttt tttagtgggg atagttatag ggataggtat 1920 

agggggtttg gttgaggttg gaggttagtt atagttaaga agtttcgaga ttaacggatc 1980 

gtttagttta gatttttttt taggtttaag aggagaggga attagggaga aaagtttaga 2040 

ttcgttgttt taagtcgcga tgtttagggg tttcgtatag tttttcgcgt agaggtcgtt 2100 

tggtttcgcg gttttttttt ttgcgaggcg tttatagtcg gtattttagt tacggaaacg 2160 

tagtaggtgc gaaatcgttt cgagttcgga gtaggaggtt aggtcggtcg tttcggggga 2220 

tttttaaggg gaggcgtcgg gggagggggg tagttttgtc ggtcgtagcg gggcgtttac 2280 

ggaaaagtag gaggaggttc ggaagtttat ggtgtttttg tagaggttcg ggggttgggg 2340 

ggtggtggag gggtttcgga agtttagttt tcgggttttg gagttcgtta cggcggtagt 2400 

tttgcggcgg cggttggatt tgggtagttg tttggacgtg ttggtttttg tttagtagta 2460 

cggagagttc ggtttggcgt aggagattta cgcgttgatg agcgataatt tgttgcgagt 2520 

gttgggagiat tcgtgttttt atcgtcggtt gagcgcggtc gatcgcgagc gtatttttag 2580 

tttgcggatc ggtcggggtc gggcggtgtt gggcgttttc gtattgttta gtttttatta 2640 

ggggggtcgt ttagggtttt ttaggggttt tcgtggcgag gagttttttg cggcggtttt 2700 

tgtgtttttg tttttatttg cgtatttgta tgtgtttaat tttcgggaga atatttggcg 2760 

gtttttgatt taggtgttcg aggaggtttc gtttcggggt tgcggttttt gtattatgta 2820 

taattatttg tttttggcgg ggggtattcg tggtttcggt gttaaggtcg tttgttttaa 2880 

cgaggttttt tgttataatt ttttgattaa tatttggagt taggttcggt ttatgtagta 2940 

ggttcgagtt tagtttaagt tggtggtttt ggacgggttg ttttatgtta tcggtggcga 3000 

atgtttgtat agtatggagt gttacgattc gcgaatagac gtttggattt tacgcgcgtt 3060 

atttttcgta ggtatttttt ttgtggttta cgaggttgtg gtttgtcgtg gggatattta 3120 

cgttatcggg ggttattttt tttatcgttt gtttaggtat agtttcgtga aggatgtttg 3180 

ggacgagtgt ttatatagtg ttagttatcg gcgttttagc gatatcgtgg tattgggggg 3240 

ttttttgtat cgtttcgatt tgttgcgggg cgtgggcgtc gtcgtgatgc gttataatat 3300 

agtgatcggt ttttggagta gggttgtttt tttgtttttg ttcgttttcg ttttattgta 3360 

ttgtattatt ttgggtaata ttatttattg ttttaatttt taggttattg ttatttttac 3420 

ggtttttggg gggattgttt agttttaggt taaggagttg tagtttttpt ttttggggag 3480 

tatcggggtt tttagtttat ttattttgat tttgtttttt gaggatcggt tgtagatttt 3540 

attttgagtg gtaggtagag aattaaagtt gtttcgttgt tttt£aggga gatttttttg ,3600 

ggatgggttt gagaggtcgg ggtttaggga aggggttggg atcggaattt tttgtttttg 3660 

tttttggata attttttttt tttgttttaa aggttgtcga tt^ttttgaa gtttagattt 3720 

ttttagtttt tttttgtttt ttattttata tttagattgt tttttgattt aatttcgtat ,3780 

ttattta^ag atttttttag tttgttgata tttttttgtt tgtgggattt tttatttttt 3840 

agagttaggg gttgatgcgt ttttatagat aaggatttgg ttcgttggag titttgttgag ,3900 
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tcgagagagg 
gtaaaaacgt 
tttaggtatt 
agaatttttg 
atatttttag 
ggggtttttt 
tgaggtttga 
tttgattttt 
agttgtggga 



agggggtaga 
ttttaggtaa 
aatttcgtag 
tttttgaaat 
tttatattag 
tagttttttt 
gcgttatttt 
tttttttttt 
gataagatta 



aaatatttat 
cgttttcgtt 
ttatttgggt 
tggttcgttg 
gttaggttta 
agaagtttat 
agttttgttt 
gtttataaaa 
tagttatgag 



seq_prot. 
attttttatg 
tttgggattt 
ttgtttggta 
gtgtagtttt 
tttcgggttt 
tttatttttc 
ttttttagtt 
tgtggatagt 
tatttcgtac 



txt 
ttttgttagt 
tttgtttgtt 
ttgtggattt 
gttgtttgta 
attatttttt 
gttaattttc 
gtgtagattt 
ggacgtttgt 
ggtgtttagg 



aggataggga 
ttaaggtttt 
ttaagggttt 
gtttttgttt 
gtagttttgt 
gatttttaaa 
ggacgagata 
tatttaagag 
atg 



<210> 283 
<211> 4433 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 283 

tattttggat 

gtgatagacg 

tttaagttta 

atcgggggtt 

tgtagagggt 

agttgtagat 

tgagaattta 

taggataggt 

tttgttgata 

gagttttagc 

agggagtttt 

attgagttag 

ggttttaaaa 

aggaagtttc 

gtttttttgg 

gtagtcggtt 

aggggaaggg 

tggtagtgat 

gggcgggggc 

agcgtattac 

gtgttacgat 

tttttacggg 

ttttacggta 

gtggggttta 

cgatggtata 

tgggtcgaat 

agacggtttt 

tgtagagatc 

tgttttttcg 

tcgtaggagg 

ggttgggtag 

tgcgttcgcg 

gtaggttgtc 

gggtaaaggt 

tcgtggcggg 

tttcgggttt 

tttcgttgcg 

aggcggtcgg 

gtagttgggg 

tttgcgcggg 

tttttttttt 

tggtttcgga 

tttttatgat 

tgttgggatt 

gtttggggaa 

tttgtttttt 

ttggggtagg 

atgcggttgg 



atcgtgcgag 

tttattgttt 

tatagttaaa 

ggcgagaagt 

ggtgagttcg 

agtagggttg 

tagtgttaaa 

agaaagtttt 

gagtatagga 

gagttaagtt 

atagataggg 

gaaatagttt 

taatcgataa 

gattttagtt 

agagtagcga 

tttagggggt 

ttgtagtttt 

"ttgggggttg 

gggtaggggt 

ggcggcgttt 

gtcgttggaa 

gttgtatttg 

ggttatagtt 

ggcgtttgtt 

gagtagttcg 

ttggttttag 

ggtatcggag 

gtagtttcga 

ggggttgaat 

tttttcgtta 

tacgaggacg 

gtcggtcgcg 

gtttattagc 

tagtacgttt 

ttttagggtt 

ttgtagaaat 

gtcggtaggg 

tttgattttt 

tatcggttgt 

gggttgtgcg 

ggtttttttt 

gttttttggt 

tgtttttatt 

tacggggttt 

ggggtggttt 

ttggtagcgt 
ggatgcggtt 
ggttggggtt 



gtgtttatag 
atattttata 
aaggggtaga 

ggggtgggtt 

gagtgggttt 
tattagcgga 
tagatttaga 
agaggcgagg 
agtgtgaatg 
tttgtttgtg 
gggtgttagt 
gggtgtggag 
tttttaaagt 
ttttttttga 
agtagttttg 
agagttagga 
ttggtttgga 
aggtagtaaa 
agggaggtag 
acgtttcgta 
cgtcggtggt 
agtaggcggt 
tcgtgggtta 
cgcgggtcgt 
tttagggtta 
atgttggtta 
ttacggatgt 
agcggggttt 
atatgtaggt 
cgagggtttt 
tttagtatcg 
tttagtcggc 
gcgtaggttt 
aggtaattgt 
cgggagttgg 
attatgagtt 
ttgttttttt 
tgtttcgagt 
gagcgtttcg 
gggtttttgg 
ttttttaggt 
tgtggttgat 
gaagaagagg 
tcgggaatcg 
ttaagaatta 
ttggggagag 

ggggttgggg 

ggagttggag 



ttgtgatttt 
gataaggaga 
attagggtga 
tttggagggg 
ggtttggtgt 
ttagttttag 
tagttacggg 
gcgttgtttg 
ttttttattt 
gagacgtatt 
aagttgagag 
tgaggggtag 
agaaggggaa 
gtttcggttt 
gttttttgtt 
tgaatggatt 
attgggtagt 
tggtgttgtt 
ttttgtttta 
gtaggtcgaa 
tggtattgta 
agaagaggtg 
tagggaaggt 
agtattttat 
ttagtttgag 
gagggttgta 
ttttcgttag 
tttcgggtat 
gcgtaggtag 
tggggagttt 
ttcggtttcg 
ggtagaggta 
tttgcgttag 
ttaggtttag 
gttttcgggg 
ttcgggtttt 
ttttcggcgt 
tcggggcgat 
tagaggagga 
gtatcgcggt 
ttgggggagg 
ttttagtttt 
ggtcgtgttt 
ttttcgggat 
tggtaataag 
ttatattttt 



ttggggttga 
ttgggtttag 



gttttttata 

aagggaagtt 

cgtttaggtt 

ttgggagagt 

gggttggggg 

aggtaagggt 

gttggtattt 

gggacgtttt 

tttttttttt 

agtttttggt 

ggtttgtaag 

aaagaggttg 

agttgtttag 

tttaggttta 

tgttatttag 

gaggatttcg 

ttttttagag 

tagggtggtg 

ggagtcggtt 

gcggtatagg 

tgggtattcg 

attttcggtg 

gtttgcgggg 

gttgtatagg 

ttgggttcgg 

gtagaagatt 

aaataggtag 

ttgggttagg 

aggtagggat 

tgagcggttt 

gtcggttcgt 

cgggtttttt 

gtcgggtttt 

tcgtcgtcgt 

tttttttatt 

tttttgtttt 

ttttttttga 

ttcgtatttg 

ggtcgcgggg 

ttggggtagc 

gtttgggttg 

aattaggttt 

gtttttagag 

atttttttgg 

gtttttttag 

ttgaggtttt 

ggtgggggtt 

ggttggggtt 



atttttttgg 
aaatgtttcg 
ttatttagag 
tttataaggt 
tatgggtagg 
tttaggtttt 

ggggaggttt 

tgttttttgt 

cggtttagta 

tttagggaat 

taggtacgga 

agggagtttg 

aaataagagt 

ttttaggagg 

agtgaggttt 

gtgtttttta 

atcgtgaagg 

tagtgtagtg 

attgtgttgt 

aagtttttta 

ttttaagtat 

acgtagatgt 

agtggcgcgc 

tattcgttat 

gtttgttgta 

tcgttggagt 

ttgtgtatgg 

ggtcgttagg 

ataggggtcg 

ttttggtaga 

aggttgagga 

agtattcgta 

tcgtgttgtt 

agggttgtcg 

attttttaat 

ttcgtgagcg 

gggtttttcg 

ttgcgttttc 

ttaggcggtt 

gggtttagat 

ggcgattcgt 

tttgtatttg 

tgtgtggaag 

ggtttgtttt 

ttggttgaga 

gattcgttag 

ggggtagggg 

gggattgggg 



3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4433 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 



208" 



ttagggttgg ggatgcggtt ggggttaggg 
ttgggattgg ggttagggtt ggggatgggg 
gggttggagt tgcggttgta gttggatttg 
aggttggagt tggggttggg gttgaggtgg 
gtgttgagtt gggggatgaa gttagggttg 
agatttgggt tgaggttgaa tttggggttg 
gattttgggt aatttttagg ggtaaaaagt 
gtgggaattt cgggtcggga gttgcgggcg 
ttcgggaggg cgcgcggttg tggtcggttg 
tgggttttag gttagcggta ggattgttat 
tatcgtaggt ttttgtggta ttcggttttt 
tttttagtta ggggcgcgtg gagtttgggg 
ttttgttgat ttagggttag tttttttcgg 
agttattttt ggttttagag tttttcgttt 
gtagcgtcgg gagtgcgggt ttttgggcga 
gggcgtttag ggcggcggtt tcgtttatcg 
ggagcgggtt tagttttcgg gggcggcggg 
gtcgtcggtt tcggtttttt tttatcgagt 
tagtgttttg ttgtttcgcg gggtttgcgg 
ttttcgtttt cgtttcgttg ttagggtttc 
atattttttt tgtttcgttt ttgtttagga 
ttttcgttcg gggggtcgcg ggggtggtat 

gggttttttc gatagtcgtt ttgagtggaa 
gcgttgtagt agggataggg ttttgttggt 
ttgggttttg tttttcgttg ttattattga 
rtttttttgt ttgatgaggt tgagtttcga 

<210> 284 
<211> 4494 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic 
<400> 284 

attataaatt agtattatta taaaggtaag 
agggaaataa aatttaagag gaagttagat 
gtagtgaaat attatataga ttagggtttg 
taaaattatt aatttagata ataattatgt 
tgtttttatt tatttttttt ttatagttat 
aaagatagtt aattcgtaat tatatttttt 
tttttaaatt ttagtagatt gaattattta 
aatgtgaatt ttatttttat tttttaaagg 
aagatggtag aaagagtata ttaatgtaat 
tttatatgtt tttttttttt atatagaaag 
gttttgataa aagttttaaa aatacggtta 
tttttaattt gtagtaagat tattcgggaa 
taagttatga gagaakttag aattttttta 
cgaaattttt taatttttaa aataatttgt 
atttgtgggt aaaaattaga agagaaaaaa 
aaattaatgg tgtgatagtt tgtatagttt 
aaattgtttt tttttttttg gaaaattatt 
ggtaaggaag gttgtaaatg ttaagatgtt 
aatatagttt tttttggttt aattgttatg 
agaaaaatag ttattgtgta taggcgttag 
agatttgtta tatatttagt ggtgggtgga 
ttttttgttg tattgtagag gcgttatttt 
titttatttgt attttaaaaa aaagattttt 
gggaattaaa ttagagtttt tagagttgtt 
cgtttttttg gatgtatagg atagagttcg 
tagacgtata ttttattttt cgttttattt 
tagaattaag tttcggaggt gattcgaggg 
agaagtttcg gtapcgtttt tgtttttttt 
fgggcgggag tttcggtggc gtattgagat 
gtgtcgcggt ttttgttttc gcggattttt 
gaatttt^cjgg gaccjtttaga tftggggaag 



seq_prot.txt 

ttggagttgg agttgggttt agggttgggg 2940 

gtgtgagggt tgggattggg attggggttg 3000 

gggttggggt tggggttgaa gttggagttg 3060 

gggatgaggt tgaggtggtg gttgaggttg 3120 

gggttaggat tgaggttgga gttgaagtta 3180 

gcgtcgtttt gagattttcg tttacggaag 3240 

ttgaaggaga ggggagagtt agggttttag 3300 

gggaggcgga gcgcgggcga ggtttacggt 3360 

tagtcggttt tgtttgtttc gggggtcggg 3420 

ttcgggagtt gttagtttcg gtttcgtcgg 3480 

gggacgagag tgggcgattt tgaggcggat 3540 

ttgggttaaa gtttcgggtg ttcgggattt 3600 

gggtagggat tttcgaggtt tttttgtttt 3660 

gagatttagg ttgtagggcg cggggagcgg 3720 

gtgggggttt tgtggggagg tcgcgcgcgt 3780 

cgttttatac gtttttcgcg gcgtttaggc 3840 

ttttgttttt cgagttagcg tttattcgtc 3900 

tcgagggttt cgcgttcgtt tgttaggggt 3960 

tttggcgaga ggacgttttg atattatttg 4020 

gaggagtgaa gtggattagg aggggggcgg 4080 

gggtagtgtt tggtttggga ggtttttttt 4140 

tgtttcgttg ttgttttaat gtttttttgc 4200 

gtcgggttat ggcggttagg ttttcgttcg 4260 

cgcgttttgg gggttttggt tttttttgtt 4320 

ttttagattt cggttgtagg gttagtttag 4380 

gggttgtgga tatcggtttt gtt 4433 



DNA (Homo sapiens) 



atgattgtga ttaggttaaa tttttaatag 60 

ttaaaaataa ataaataaat aaatattagg 120 

ttaaatttat atttttatta atatattagt 180 

ttttttcgat agttaaggtt ttattattat 240 

ttgtagtaga tttatttttt aagttttata 300 

tagtggatat agagagaatt agataaatta 360 

tttttttatt aatatttcgt aaaatggtat 420 

agttaataat agttattgaa aaggttttaa 480 

taaatagatt aaaaattaaa ttcgaatttg 540 

aaatattgtt taataaagtt atatatttaa 600 

ttatatgttt ttaatttttt aaatgtatgg 660 

aattggaata attttataaa taaatgtaag 720 

gttaagtgta tttttttaaa atatggaaaa 780 

tttaartagt tttaaggtga tttttgagtt 840 

gtttaatgat tttgtaaatt agaaattaaa 900 

tagtggaatt ataggaaaat atttgttcgg 960 

aggaggttgg gggaaaagag gggaaaaacg 1020 

aaataggaat ttaatcgaag ttcgaagtga 1080 

taaataattt ttaagtattt ttagatgtaa 1140 

ttgaatgttc gagagttgga tgaatgggga 1200 

ttttattagg ttgtagtttt tatatagttt 1260 

tttcgtttta atttttattt aattcgattt 1320 

cggatttttt ttagtatttt gtgtttttag 1380 

agggaagagg tcgcgtttta attagataaa 1440 

tgttttttta ttattcgtta ttgttttttt 1500 

ttttttcgtt aaggatttta tagtttagta 1560 

tgggaagtag aggggtttcg aggtttcgtt 1620 

taggcggacg cgtagtttcg ggaagggagt 1680 

taacgcggtt cggtttacgg cgagggcggg 1740 

tgj:t£tttag atagagttta gggtgcgagc 1800 

gfcgcpgg£tg gttatgggtt ttcgattttt 1860 
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tttatttttc gtaagttata taaaagggag acgcggatat gtataaagtt tgtttgtgga 1920 

aggcggtgcg cgatttcgac gtcggttcgg gttagggtcg tatttaggtt gtcgtcgtta 1980 

gtagtcgcga aaagaggttg gagtaaggaa gggggatggg ggtgagagag gaaggtgaaa 2040 

cgaggcggag aacgtaggga aaagcgaggg gtttttagtt ttttgggttg tttcgatttt 2100 

tttttttttt ttttagggtt taagagaaag gaaagggtaa cgatttaaga gcgaaggatt 2160 

ggttttaggg acgttgcgtt ttcggttttt tttacgtgtt tttttttttt ttttttttgt 2220 

ttgaatgttt ttcgtttttt tcgattttcg tttttaaggt agggattaag tcggggtttg 2280 

ggtttagggt cgttttttat ttttttcgtt tcgggttggc gcggaattag ggagattagc 2340 

gtttcgttcg ttttttttta gcgggtcgga gcgcgatttt ttggggtagg gttgggtcga 2400 

aagcggggat gcgttggacg ttcgtaagcg ggggcggaga ggagaggggt tttattgacg 2460 

gattttcggt ttttttgtat taaagatttg ggttaaggtt ttggggggta tcgcgttttt 2520 

atcgtgtagt tttttttcgt ttttcgaggc gcggtattta tttttagcgt tcgagtcgtt 2580 

taggcggtta gtaggagtag tgttaaatcg ggtagtatcg cgattttgcg cggggtatcg 2640 

agtgcgttgt tgtgcgagtg ggattcgtcg cgtttttgtt ttgttcgcgt cgttatcgtc 2700 

gtcgtttttc ggggttttcg cgtacgtttt ttcgcgtgtt ttcgtttatc gttgtcgagg 2760 

aaattgacgg agttcgagcg cggcggcggg gtttagagtt aggcgagtta gttgattcgg 2820 

tttatttcgt tcggtattcg agagagattt ttagcggcgt cgtcggggaa ttgcgttcgt 2880 

tcgcgtcggg aggggttttc gcgtttcgcg tttataggtg tacgcgtttt tggcgtcgtt 2940 

tgtattttac gcgttttttt cgtttttcgg tcgacggttt atttgggttt cgtgaatagt 3000 

gggagggaga gtttggggtt aggagaggga cggtgtagga ttagggaaag gtgagtttta 3060 

ggacgttgag gttttagaaa agtcgagagc gtttttgttc gttttcgtga gtttgaatta 3120 

ttcgatttcg taggtttttc gggggtgtcg tataaaggat tgttgttagg gtgcgttttt 3180 

attcgtattt cgtttttttt ttcggtttgg agaggtgggg taggcgtttt tggaagagaa 3240 

tgagaacgag tgaagtttaa aggaaatagg attttttttg tttgttgaga tagtaaaatt 3300 

ttatttttta atttttaacg ttaaaagtag gtacgagtaa tttggaattt ttatttttag 3360 

aacgaattaa aggagtaagg cgtaggattt ggtaaagaag cggttaattt atttattttt 3420 

ttttgaaagt agggttttgg gtttgggttt tgtttttttt ttatattttt gtttgttttt 3480 

ggttttgttt tgttaaaggg ggtggttagt gtggattttg cggggcgggg ggtttttttt 3540 

tgttgttaag tagggatgta atatttattt gagttgggga agaaggaaaa attaggagag 3600 

attattcgta ttcgattttc gtaaatgagg atttttgatt ttaaacgttg tttttgtttt 3660 

ttattgtgtt tgttttgaat tatagaaatg aattttttgt tatgttattt ttttttggat 3720 

taaatataaa tcgtttattg tttagttagt gtttagatag tttagaatgt tttagaattg 3780 

tttagatata ttttttgttt ttgatttgaa gtagtattta gttattaagt tatagtttat 3840 

tttatatagg gtttggagcg aaggattgaa gttagggagt gttttggttt ttttgagggt 3900 

tgtattgtag tttgtttttt tttttttgtt tattgcgttg ataagggtgt ttaggttcgg 3960 

gaaggatggt ttagttagcg gggtatagat tttttttttg ggaagttatt tttgtattag 4020 

tttttttaaa tgttgttttg tgtgtttttt ttcgtattag gtatttagtt ttgatttttg 4080 

gattattttt gtttttgaat gttttttgga tgtttttaat aaataaggat aaatatttat 4140 

tgttatgtag tatttgttta tacgttttat gttatttatt ttttataaaa gttaaagagg 4200 

ttgttattat tatttttatt tatatattag gaaaatgaag ttaatgaatt tttaaattta 4260 

atagtaagta agtgatatga agttgggata gtagggaaaa gtttaaagtg ataggataaa 4320 

tttagttgtt tattttttgt attatattta tttatcgtaa tttcgtattt gtttcgggtt 4380 

tgtatgagta aataaggtaa aaaagaaggt atttgaatgt aaagagaaac gtggtttaaa 4440 

gttataaatt ttgtagagtt tattgtaaaa tgtaaacgtg agattttttg ttat 4494 

<210> 285 
<211> 4494 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 285 

gtaataaagg gttttacgtt tgtattttgt agtggatttt gtaagatttg tagttttgga 60 

ttacgttttt ttttgtattt agatattttt ttttttgttt tatttgttta tgtagattcg 120 

gaataaatac ggaattgcgg tgggtaaatg tggtgtagaa agtgaataat tgggtttgtt 180 

ttgttatttt aggttttttt ttgttgtttt agttttatgt tatttatttg ttattagatt 240 

tgggagttta ttagttttat ttttttgatg tataaatagg aataatagta atagtttttt 300 

tggtttttgt aggaagtaaa tgatatgaag cgtataaata aatattgtat gataataaat 360 

atttgttttt atttgttgag gatatttaaa ggatatttag gggtaaaagt aatttaagag 420 

ttaagattga atgtttagtg cgggaaaaga tatataagat aatatttagg ggagttggta 480 

tagaaatgat tttttaggaa ggaagtttgt atttcgttgg ttgagttatt tttttcgggt 540 

ttaggtattt ttgttagcgt aatgagtaag ggagagaagg taggttgtag tgtagttttr 600 

agaagggtta gagtattttt tggttttagt ttttcgtttt aagttttgtg tggagtgggt 660 

tgtggtttgg taattaaatg ttatttt;agg ttaagagtag gggatatatt tgggtagttt 1 720 

tagagta ttt taaa ttattt ggatattaat tggatagtgg acggtttgtg tttaajttag* teQ_ 

SeTte ZXO 
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gagaaagtgg tatggtagaa ggtttatttt tataatttag gatagatata atgaagaata 840 

agggtagcgt ttgaggttag aagtttttat ttacgggggt cgaatacgaa tgattttttt 900 

taattttttt ttttttttta atttagatgg atgttatatt tttgtttaat aataaaaaaa 960 

gatttttcgt ttcgtaaaat ttatattgat tatttttttt aataaaataa aattaaaaat 1020 

aaataaaaat ataagaaaga aataaaattt aagtttagaa ttttgttttt aagaagaagt 1080 

aaatgggttg gtcgtttttt tgttaggttt tgcgttttgt ttttttggtt cgttttaaag 1140 

atagaaattt taggttgttc gtgtttgttt ttgacgttgg gggttaaaaa atgaggtttt 1200 

gttgttttaa taagtaaaga aaattttatt ttttttaagt tttattcgtt tttatttttt 1260 

tttagaaacg tttgttttat ttttttaaat cgagagaaaa aacgaaatgc ggataaaaac 1320 

gtattttagt agtagttttt tatacgatat tttcgggagg tttgcggggt cggatgattt 1380 

aagtttacgg ggacgagtag gagcgttttc gattttttta gagttttagc gttttaggat 1440 

ttattttttt ttgattttgt atcgtttttt ttttggtttt agattttttt ttttattgtt 1500 

tacgaagttt aggtgggtcg tcggtcgggg agcggagggg gcgcgtgggg tgtaggcggc 1560 

gttaagggcg cgtgtatttg tgggcgcggg gcgcgagggt ttttttcggc gcgagcgggc 1620 

gtagtttttc ggcggcgtcg ttaggggttt ttttcgggtg tcgagcgggg tgggtcggat 1680 

tagttgattc gtttggtttt gagtttcgtc gtcgcgttcg ggtttcgtta gttttttcgg 1740 

tagcggtagg cgagagtacg cggaggagcg tgcgcggggg tttcgggaga cggcggcggt 1800 

ggcggcgcgg gtagagtaag gacgcggcgg attttattcg tatagtagcg tattcggtgt 1860 

ttcgcgtagg gtcgcgatgt tgttcggttt ggtattgttt ttgttggtcg tttggacggt 1920 

tcgggcgttg gaggtgggtg tcgcgtttcg gaaggcgggg ggaggttgta cggtggggac 1980 

gcgatatttt ttaagatttt aatttaagtt tttaatgtag agaagtcggg ggttcgttaa 2040 

tgggattttt ttttttttcg ttttcgtttg cggacgttta gcgtattttc gttttcggtt 2100 

tagttttgtt ttagggagtc gcgtttcggt tcgttgagag ggagcgggcg aggcgttggt 2160 

ttttttggtt tcgcgttagt tcggggcgag aagggtaggg ggcgattttg agtttagatt 2220 

tcgatttagt ttttgttttg gaagcggggg tcgggggagg cgagagatat ttagataggg 2280 

gggaaggggg aaggagtacg tggggaaaat cgaaaacgta gcgtttttaa agttagtttt 2340 

tcgtttttga atcgttgttt tttttttttt tttgggtttt ggggaggagg aggaggagtc 2400 

gggatagttt aggaagttag gagtttttcg tttttttttg cgtttttcgt ttcgttttat 2460 

tttttttttt tatttttatt tttttttttt gttttaattt tttttcgcgg ttgttggcgg 2520 

cggtagtttg ggtgcgattt tagttcgagt cggcgtcggg gtcgcgtatc gttttttata 2580 

ggtaaatttt gtgtatgttc gcgttttttt tttgtgtaat ttgcgagaaa tgggaggggt 2640 

cggagattta tagttatttc gcgttttttt taagtttgga cgtttcggga gttcgttcgt 2700 

attttaagtt ttgtttgaga ggtagaaggt tcgcgggaat aaaagtcgcg atatttcgtt 2760 

ttcgtcgtgg gtcgagtcgc gttagtttta gtgcgttatc gaggttttcg ttcggttttt 2820 

ttttcggggt tgcgcgttcg tttgggaggg ggtagagacg ttgtcggggt ttttggcgga 2880 

gtttcggggt ttttttgttt tttattttcg gattattttc gaggtttaat tttgtgttgg 2940 

gttgtaaagt ttttggcggg gaggaaataa ggcggagagt gggatgtgcg tttggaaagg 3000 

gtagtggcgg atggtgggga ggtacgggtt ttgttttgtg tatttaagga ggcgtttgtt 3060 

tgattaaggc gcggtttttt ttttggtagt tttggggatt ttggtttagt ttttttgggg 3120 

gtataggatg ttggggaggg ttcgaagggt ttttttttta gggtgtagat aaaaggatcg 3180 

aattgagtga agattaagac ggagaagatg gcgtttttgt agtgtagtaa agaaaagttg 3240 

tgtggaggtt gtagtttagt gaaatttatt tattattagg tgtataataa gtttttttta 3300 

tttatttaat tttcgaatat ttagttaacg tttgtgtata atgattgttt ttttttatat 3360 

ttgaagatgt ttagaagttg tttgtatgat agttgggtta gaggaggttg tattttattt 3420 

cggatttcgg ttgggttttt gtttggtatt ttaatatttg taattttttt tattcgtttt 3480 

tttttttttt ttttttagtt ttttagtaat tttttaaaag aaaaaaagta attttcgggt 3540 

agatgttttt ttgtaatttt attgaggttg tatagattat tatattatta gttttttagt 3600 

ttttaatttg taaggttatt gaattttttt tttttttgat ttttatttat aagtggttta 3660 

ggaattattt tagggttagt taagataggt tgttttaaga gttaaagaat ttcgtttttt 3720 

atgttttaaa aaaatgtatt taattaagag aattttgatt ttttttatga tttatttata 3780 

tttatttgta gaattatttt agttttttcg aatggtttta ttatagatta aaaattatat 3840 

atttaaggaa ttaaaagtat atgataatcg tatttttgaa gtttttgtta aaatttaaat 3900 

atgtaatttt gttgagtaat attttttttt gtataaaaag aggaagtatg taggtagatt 3960 

cgaatttagt ttttggtttg tttaattgta ttaatatgtt ttttttgtta ttttttggga 4020 

tttttttagt agttattgtt aattttttta aagggtggag gtgagattta tattatgtta 4080 

ttttgcgaag tgttggtaga agggtaggta atttagttta ttgaggtttg agggtggttt 4140 

atttggtttt ttttgtgttt attgaaaaga tgtaattacg agttgattgt tttttataga 4200 

gtttagaaaa taggtttgtt gtaggtggtt gtagaaaaga aatgaataag ggtaatgata 4260 

atggggtttt ggttatcggg ggaaatatgg ttattatttg ggttgatgat tttggttggt 4320 

atgttgatag aggtgtgagt ttaataaatt ttaatttata tgatgtttta ttattttaat 4380 

atttatttat ttatttattt ttaaatttag ttttttttta agttttattt ttttttgttg 4440 

agggtttaat ttagttataa ttattttatt tttgtggtag tgttaattta tagt 4494 

<210> 286 : i ^ 

<211> 4489 ' 5 1 M ' • 

<212> " DNA ■ i * ■ • 1 i • > ' 4 ' • i : i iv ( " . . V ! 

<213> Artificial sequence ^ - * [i ■- 
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<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 286 

taaaaagatt taaaaagata atgtataaaa gatttaagaa ttaaaaagat aattttagaa 60 

agaggaaatt taaatagtcg gtaagttttt gaaaaagtgt ttaaattttt tagtgttttg 120 

ggtagaataa attaaaataa taataagata aagttatatt tattaagttg ggaaaaagga ISO 

aaaataattt ttggtgagat agtaaaagta aagggattta ttatatattg ttgtcggggt 240 

atatattggt gtatttattt tggaaaaggt ataattagtg aagttgaaga taaatatacg 300 

tattttataa aaatttatgg taatgtatat taagaatata tgtataagaa tatttataat 360 

agtattgttt ataattatta aaaacggaat ataatttaaa tgtttattaa tattagatta 420 

aattgtggtt tatttttatt aagatttttt atattatatt tattatatta tagattatta 480 

taatttaata taagaattag agttaaatgt attaatagag ttatatttta gttgtatgag 540 

agaaaaagat atataatgat atgtaatata taatttattt atatataatt aagaaatagg 600 

taaaacgaat ttaaattgtt taggaatgtt gatataggtg gttaagaaat attaagaaag 660 

aaagtagtaa atgattgtta taaaaattag tatagtgtta attattgaaa agaattgagg 720 

aaagggatga ttaggaatag aatggagagg ggagtttttt aggtgttata gacgttttag 780 

tttttgattg tggtagcgtt tatatagagt ttatatttta tagtttttta ttatattatg 840 

tatcgatgtt tgtatgttta aatattaaat gtatggaaaa agatataaaa tatttaaaat 900 

tgtagttaag gtatattaat tgttttgttg gtattttttt tttttattat aaagtggtag 960 

ataattggtt atattataaa gatagataat tggttatatt ataaagaaaa ttttaaagta 1020 

taggagttta gttttgattt tatcggatat gattatcgta tataatatag aagatattaa 1080 

gggttaagta atgatgtata aacggggttt attgtttaaa tttatagaaa gtaaaacgtg 1140 

gaaaattagt aattttaatt gatagtgtat atttaaagaa tttatatcgg tcgggcgcgg 1200 

tggtttacgt ttgtaatttt agtattttgg gaggtcgagg cgggaggatt atttgaggtt 1260 

aagagatcga gattagtttg attaatatgg tgaaatttcg tttttattga aaatataaaa 1320 

aaattagtta ggcgtggtgg tatatgtttg tagttttagt tattcgggag ggtgaggtag 1380 

gagaattgtt tgaatttagg agatagaggt tgtagtgagt cgagatcgcg ttattgtatt 1440 

cgtttgggag atagagcgag atttcgtttt aaaataaaaa ataaataaaa agaatttatt 1500 

taatagaatt aagtattaat ataataaata cgaagaattt tagatttttg gtttttaaaa 1560 

aatatataaa gatgatattt ttttaaaata tttttataaa atatattgag attgtgatgt 1620 

tttatattga ttgtatgaaa ataatgaaaa agaattagta ttgttttatt ataaaagttt 1680 

tattaatgta aatttataaa ttttttttta aatattttga gttaatttta attttatgat 1740 

agaaatttat tatttttagt aaaaatagtt ggtatttggg aaattaaagg tttaaaaatt 1800 

aagaatagta attaaagaaa tttgataaaa tagttttttt aaaattttta tttatattat 1860 

aaggggaaat tttgattacg tttttttttt tttattaatc gtagaattta atattaagga 1920 

ttatataatt ttatattttt tttcgagaaa aagtaaaggt tttgtgttgt agtaataacg 1980 

taagatatgg agggaagttt tatttaagat ttttttgttt gttttttttt taaagttatt 2040 

ttagaatatt agggagggtt gagaggtaag gtatgaaggg cgtaatattt aatatgagta 2100 

acgcgtgtga tgtatttggt taaaatgtat atagaggatt tgtttttgtt tttagataga 2160 

agtttttcgt tttgtagtta tgagggttaa ttgttgaggt tttatagttt tttttttttt 2220 

tatattcgga tcgttacgtt ttttatttat tatttcgatg tagaggtaga tttaggattt 2280 

ttgtatttgt taaggatttt tcggtaagtt tacggggcgg gagtggttat aagacggagt 2340 

tcgtttggtt ttggtttttt cggtttatat aagtttgttt tttttttaat ttttaatttt 2400 

tatagttttt tattttttta ttttcgattt attttgcgtt atcgacgttt ttggttttcg 2460 

tttgtagtaa gtttattttt attattattt ttcgtataaa agtttgtatt tattaggtta 2 520 

aagaggggaa ttaacgtttg taggaatcgt tttatcgaat cgtttggtcg cgttttttgt 2580 

tagattttat ttgtcgttgc ggatcgtata taattatttt cggtatgttt cgcgtacgta 2640 

ttattttttt tatttcgttt ttttttcgtt taaatacgtg attttttttc gttttcgttt 2700 

acgtttattt tcgttttttt attttttttt aggaaggagg agggagttgg gggtgttaaa 2760 

agcgtagcga tttttttttt tttttcgttt tcgtttttgt atttttcgtt ataatgtttt 2820 

tcgggtcgtt agcgtttcga cgttttttgg gaaaatagtt tatttttttt tttttttttt 2880 

tttttgtttt taattaatta gttattcgtt agagagggat atgcgtagtg agtgtttttc 2940 

gtttttttta ttcgaatttt tttttttttt taagtagaga gattttagta gtagtagtag 3000 

ttgatgatga agagagaggt agtggtagag ggggggtatt ttttattttt atttttaaag 3060 

ggataggata ttaattttat tttattttaa ttttgaattt aggggggtgg ggggaaggcg 3120 

gttgagtttt tttttttatt ttttagtttt gagttttgag agggggattg agtttgagag 3180 

aggagaagga gttttttttt tttcgaaaat ttttatttac gatttttatt tttttatttt 3240 

tttaattcgt tttttttttt ttattttttt tttttttggt cgtgagagga ggagagaaag 3300 

aaattaaaag ttttttagta atatagattt tttgttgttg ttgttgttgt tgttgttgtt 3360 

gttgttgttg ttgttgttat tgttgttgtt gttattgttg ttgtttggtt ttggttggag 3420 

atattttatt atatttagga gtagttattt ttttagtttt tttttttttt tttcgttttt 3480 

ttttttttta tttttttttt ttattatttt ttgtgttatt ttttatagtt ttagggaagg 3540 

tatttaaaag tggggggtag gaaaggtaag tgtgtttgtg ggggttttat tgtttttttt 3600 

tggttttgat ttttatttcg gggtaatagt agtattaaat tatatacgta gtggagtttc 3660 

ggggtgagga gggggtggtg ttgggggggg tgaaggaggg gttggagtag ggaggggtgt 3720 

gtflaflgtggg _g_tgfcttattt_ tgttagggga ggaagggtat 1 1 1 1 at tttt_atttg ttgtt— -3780- 
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gttaaaagtg gttttttttt tttatttaat atttgattgt gttttgtttt agtgggggag 3840 

aatataaaaa taattttttt ttttttttaa tgaggcgtta gggaaagaga tagaaagagg 3900 

tatatttttt agatgttatt taaaaaaaat ttattggaga gttttttttt tttaggaaaa 3960 

gttttattgt atttgttttt gagtgggaaa atgtcgggat ttggattgta ttggaattgt 4020 

ttatttattt gtagatttga gtgttttttt ttttttggtt ttgtatgaga tttggatatt 4080 

gattttagtg ttggaagatt tgattggttt ttgttttaag ggttttattt tttatttttt 4140 

cgttttgttt gatttttatg gaaaatttta aatttttagt tggtagaggt ttttggttag 4200 

gttttgattt gtattattgt tttttttttg gttaaatgtt tgttttatat attagtatat 4260 

aaagtttaaa gttttaaaat gtttaaagtt taaatgttat tgttattgtt tgtttttata 4320 

tacgtttttg atatttattt tttatatgat gagaatatta gttattttat aattaatttg 4380 

ggtatatttg taatttttgt aatttgtttg aataagtaat ttttattgaa aattttatgt 4440 

tgttttggtt gtatggaagg ttaaattttt ttattaatag tttgttgga 4489 

<210> 287 
<211> 4489 
<212> DNA 

<213> Artificial Sequence 
<22Q> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 287 



60 



tttagtaaat tattgatgga gaggtttggt tttttatgta gttagaataa tataaagttt 

ttaatgggga ttatttgttt agataagtta taaagattat aaatatattt agattaatta 120 

taaaatagtt aatattttta ttatatggaa aataagtatt aggaacgtat gtgggaatag 180 

ataatagtag tggtatttaa attttaagta ttttagagtt ttggatttta tatattgata 240 

tatgaaatag atatttaatt aaaaaaaaag taataatgta agttaaaatt taattaaagg 300 

tttttattaa ttaaaaattt gaaatttttt atggaagtta gataaaacga aaaaatggaa 360 

aatgaaattt ttaaagtaaa aattaattaa gttttttaat attgaagtta atatttaaat 420 

tttatatagg attaaaaagg ggaaaatatt taggtttata aatggataag tagttttaat 480 

ataatttaga tttcgatatt tttttattta gaaataaatg tagtggagtt ttttttggag 540 

aaaaagagtt ttttaatgaa ttttttttaa gtgatattta gaaagtgtgt ttttttttgt 600 

tttttttttt ggcgttttat tggaggaagg agggggttgt ttttgtattt ttttttattg 660 

gagtaagata taattagatg ttagatggag gagagaaatt atttttgata atagtaaata 720 

aaaataaaag tgtttttttt tttttgatag gatgggtatt ttattttata tatttttttt 780 

tattttaatt ttttttttat tttttttagt attatttttt ttttatttcg gagttttatt 840 

gcgtatgtgg tttggtgtta ttgttgtttc ggagtgaaag ttagagttag aggaaggtaa 900 

tgaagttttt atagatatat ttattttttt tgttttttat ttttgagtgt tttttttgga 960 

gttgtggagg atgatataaa gggtaataaa gggggggagt ggaggaggag gaggcgaagg 1020 

aggaggagga gagttgggga agtggttgtt tttgggtgta gtgagatgtt tttagttagg 1080 

gttaagtagt agtagtagta gtagtagtag tagtagtagt agtagtaata gtagtagtag 1140 

tagtaatagt agtagtaaag ggtttgtgtt gttaagaggt ttttggtttt tttttttttt 1200 

ttttttacgg ttaaagagga ggagggtgga gggagggagg cgagttggag gggtgagggg 1260 

gtaggagtcg tggatggggg ttttcgaaga agaagaaatt tttttttttt ttttaggttt 1320 

aatttttttt ttagggttta gggttggagg gtgggggaag gaatttagtc gttttttttt 1380 

tatttttttg aatttaaggt tgaagtgggg tagaattagt gttttgtttt tttaaaaata 1440 

gaaataaaag gtgttttttt tttgttattg tttttttttt tattattagt tgttgttgtt 1500 

gttggggttt ttttgtttgg gggggagggg gggttcgggt agaagagacg ggaggtattt 1560 

attgcgtatg tttttttttg acgggtggtt ggttggttga aggtaggagg agggggaggg 1620 

gaggaaatga gttatttttt tagaaggcgt cgaggcgtta gcgattcgga agatattgta 1680 

gcgggaggta taggagcggg ggcggggagg aggaggaagt cgttacgttt ttaatatttt 1740 

tagttttttt ttttttttta agggaaagtg gaggaacgga agtgggcgtg gacggagacg 1800 

aaaggaggtt acgtgtttgg gcgggagagg ggcggggtgg gagaggtagt gcgtgcgcgg 1860 

ggtatgtcgg gagtggttgt gtacggttcg tagcggtagg tgaagtttag tagaggacgc 1920 

ggttaggcga ttcggtgaag cgatttttgt aggcgttggt tttttttttt gatttggtaa 1980 

atgtaggttt ttatgcgaga ggtaatggtg ggggtaaatt tgttgtaaac gaaggttagg 2040 

ggcgtcggtg gcgtaaggtg aatcgaaagt gggaggatgg aaggttgtgg agattgggaa 2100 

ttgggaaggg ggtaggtttg tataggtcgg gaaggttagg attaggcgag tttcgttttg 2160 

tggttatttt cgtttcgtga gtttgtcgag gaatttttga taagtgtagg gattttgagt 2220 

ttatttttgt atcggggtag taggtgagga gcgtgacgQt tcgagtgtaa gagagaaggg 2280 

aattgtgaag ttttagtaat tgatttttat gattgtagga cggaggattt ttatttaggg 2340 

atagagataa gttttttgta tgtattttga ttagatgtat tatacgcgtt gtttatattg 2400 

gatattgcgt tttttatgtt ttatttttta attttttttg gtattttgga gtggttttgg 2460 

ggaaggagta ggtagg^aag ttttgagtgg agtttttttt tatgttttgc gttgttgtta 2520 

taatataaag tttttgtttt ttttcggaga gggatgtggg attgtgtagt ttttalatgtt 2580 

gagttttacg attgatgaag gagaagggac gtgaitta&ag ttttttttta tagtgtagat 2640 

ga^agtttta aaafcjgattgt tttgttaagt ttttttggtt attattttta gtttttgagt 2700 
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ttttggtttt 

agaattaatt 

gtgaaatagt 

ttaatatgtt 

aagaatttaa 

tttgtttgtt 

gcgatttcgg 

tttcgagtag 

tagtagggac 

attttttcgt 

cggtgtaggt 

ttttgtaagt 

tgtattgtgt 

tttttttgta 

taatgaaaaa 

tttgtgtttt 

gaaaaattat 

gtggtatttg 

tttagtggtt 

ttttttaatt 

attgtatgta 

gggatataat 

agtataataa 

taataaatat 

ttttgtatat 

ttttaatttt 

agtatatggt 

aatttgatga 

agagtttgag 

tttttttaat 



ttaaatgtta 
taaaatattt 
gttggttttt 
ttgtaaagat 
aatttttcgt 
ttttgttttg 
tttattgtaa 
ttgaaattat 

ggggttttat 

ttcggttttt 

tttttaggtg 

ttgggtagtg 

acggtaatta 

gtgtaattaa 

aaaaaatatt 

tttttatata 

aaagtgtgaa 

agaagttttt 

aatattatat 

atttatgtta 

agtggattat 

tttattagta 

gtatagtata 

ttggattgtg 

gtatttttaa 

attagttatg 

agattttttt 

atgtggtttt 

tattttttta 

ttttaaattt 



gttgtttttg 
aagaaaggat 
ttttattgtt 
attttgaagg 
atttattatg 
agacggagtt 
tttttgtttt 
aggtatgtgt 
tatgttggtt 
taaagtgttg 
tatattatta 
gatttcgttt 
tgttcgatgg 
ttgtttgttt 
agtaaggtag 
tttagtattt 
ttttatgtaa 
ttttttattt 
tgatttttat 
gtatttttaa 
atattatata 
tatttagttt 
aaaagtttta 
tttcgttttt 
tgtatattat 
ttttttttaa 
atttttattg 
attttattgt 
aaaatttatc 
tttgtgtatt 



seq_prot. 
ttgaaaataa 
ttataaattt 
tttatataat 
aatattattt 
ttggtattta 
tcgttttgtt 
ttgggtttaa 
tattacgttt 
aggttggttt 
ggattatagg 
gttaaagtta 
atgtattatt 
aattaaaatt 
ttatagtgta 
ttaatgtgtt 
aagtatataa 
gcgttgttat 
tatttttgat 
ggtaattatt 
gtaatttagg 
ttattatgta 
tagtttttat 
atggaaatga 
agtagttatg 
tatgggtttt 
agtagatata 
ttttattaag 
tgttttaatt 
ggttatttga 
gtttttttaa 



txt 

tgaattttta 

atattagtaa 

taatataaag 

ttgtatgttt 

attttgttgg 

ttttagacga 

gtagtttttt 

ggttaatttt 

cgattttttg 

cgtgagttat 

ttaatttttt 

gtttagtttt 

agatttttgt 

attaattgtt 

ttagttataa 

atatcgatat 

agttaagaat 

tatttttttt 

tattgttttt 

ttcgttttgt 

tttttttttt 

attggattgt 

gttatagttt 

aataatgtta 

tgtagagtac 

ttaatgtata 

agttattttt 

tattttgttt 

attttttttt 

atttttttg 



ttataaaatt 

agtttttata 

tattatagtt 

tttaaaaatt 

gtgggttttt 

gtgtagtggc 

tgttttattt 

tttgtgtttt 

attttaaatg 

cgcgttcggt 

acgttttgtt 

tagtattttt 

gttttaaaat 

tgttattttg 

ttttaaatat 

atagtataat 

tagaacgttt 

tttaattttt 

ttttttagta 

ttgtttttta 

ttatgtagtt 

agtaatttat 

agtttaatat 

ttatgaatgt 

gtatgtttat 

tttcgatagt 

tttttttttt 

aggatattaa 

tttagaattg 



<210> 288 
<2U> 4395 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 288 



ttttttttgt 

tttgtttttt 

tgtattaaaa 

ttttagttaa 

tagtaggagg 

agtaaatata 

tataattgaa 

aaaaatataa 

aattgttaga 

ttttgggatt 

ggtaaatata 

agtgggatta 

tttttgaatg 

gaatagtttt 

tttagataaa 

ttgtgggtat 

taatttcgtt 

tttaatatta 

gtattatatt 

atttgtaatg 

tttttataag 

ttttttattt 

tttttattta 

ttattatagg 

atattaatat 

tattttaaat 

ttaatttaat- 



atgttttgtt 

tgttaaaaaa 

atagaggata 

attaattaat 

aaaataggaa 

tttgtttttt 

atagaaaaga 

tataatttta 

agtcgattta 

tttttaagga 

ttatttattt 

taggagttat 

aagagagggg 

atgattgttt 

ttttatagat 

taagtattat 

ttttagtatt 

aattaatata 

gggttaggtt 

ttattataat 
ataaaatatt 
ttttttattt 
aaatgaattc 
tttatatatt 
ttaaaatatt 
taaggtattg 
ttgttatgat 



taagttttat 

aaaaaaaagt 

tttttataat 

tatatatatt 

aatataaaat 

ttttggttgt 

aaaattagag 

agataaaata 

gaatttgtaa 

attaatgtaa 

taaaggtaat 

attgtattta 

ttagaaggta 

ttgatatatt 

tgtggaggag 

tattggaatg 

ttttaagttg 

tagaaattga 

tttagtgtta 

ttaaatttgt 

agtaaaatag 

attttgttat 

gggggagaga 

tatatttaaa 
gttaattgtg 
gttgttgttg 
tttgtttata 



attgtattga tttttgtgtt 
tttaaaaaga gttattattt 
atttattagt aaattaattt 
ataataggtt tagatatttt 
aattcgaggt tttttattag 
tttagataaa tggaggatga 
ataaaatata ttgaaatgat 
atattaaagt agattaattt 
aaattaattt gaaatgtata 
gttaagaatg gatgagattt 
atgttaattt atttatgtta 
aattattttt agggaatggt 
aaaaggagga aaaggagagg 
gtagtgtggt tgtattttag 
gagaataagt aagaaataga 
ttgtttataa aaatgttttg 
tgaaattaat tttttttaat 
ttattaattt gagaaacgtg 
aagagattgt gatttgtaat 
tttattttcg tatttttgat 
taagtaagtt agtttttata 
ttaacgatat aaaaggttta 
ataatttata aatatttttt 
tattatttta agaaacgtat 
tgttatttta tgttttagtt 
ttattgtgtg tttttatagg 
1 1 1 ag a at a t a t ag g t a tg t 
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tttgaggtgg 

aattttttat 

ttaaatgttg 

ggaatattaa 

tattttttta 

ttaaatttgt 

taaattttat 

ttttattgtt 

aattaggaaa 

aaatttttaa 

tatatagtat 

tttgttagta 

taaataatga 

aaaataaata 

ttaatgtgga 

aatttttaag 

tttaatttga 

attttcgaaa 

ttaatagtaa 

ttaattaata 

atttgatttt 

tgtttttata 

gaattaaagt 

ttaggttttg 

agtttatttg 

aaaaaaaaaa 



2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4489 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



atatttttta 



1620 



seq_prot.txt 

gtattaggat gtgtatttgt ttatttcgtt ttttttgttt tgatgaaatt atattataga 1680 

ttacgtattg tgtttaattt agtagtaaat tttttttaat tgcgtggttc gcgattaata 1740 

tttttatttt tgtttagtaa tgatatattt ttattatata tttttttata ttgtttttta 1800 

ttcgatgatt cgtgatatta tttttttttt attttttttt atttttatat tgttcgcgtt 1860 

attatatatt tataaagcga tattagcggt ttttagggcg gaaaggggtg gaagttgatt 1920 

ttcgtttttt tttttagcgt tggttttagg tgtgtttttt gttatttttt gtattgcgaa 1980 

taggggttcg tcgagtttcg ggagttttta gaagaggaag atttttttgg ttttattagg 2040 

tattattcgc gttttttcgt tttttatttg cgttttcgtt tgggttaatt tttgtcgtac 2100 

gtgtttattt ttgaattgta cgttattttt tattttcggg gggtttttgc gtattgaaag 2160 

atcgtttttc ggtaggtttt gggattcggc gacggttgat cgcgcgtcgt ttttacgttc 2220 

ggttttacga tgttgtaata tagaaagttt acgtcggttt cgattcgcgc gggattgtag 2280 

ggttcgtcgg agcgcggcgt agaggttttt tttgcgcgtt cggtttcggg aaaggggcgg 2340 

gagggttggt ttcgggagcg tacgggcgcg gcggggaggg tatttattgt gaagtacgtt 2400 

gcgtttatgg attatgtttg tgcgttatat tagaggtttc gggttttatt aattttattt 2460 

agagacggga agatttttag tggcgggggg aggatagggt cgagaggtgt taaagacgta 2520 

aagtaagaag gaaataaagg ggggtcgaga gggagatcga gaggaagggg gagtttcgag 2580 

tttacgttgt agttagattc ggatgagttc gtttttcgtt tcgggcgggt tttcgttttc 2640 

gttggttttt agcgtcgcgt agttagtagt atttttatcg tgacgttcgt attatattcg 2700 

ggcgtcggtc gttattattc gcgtcgtcgt cgttaggatt tttttttcgg gtatcgtcgt 2760 

cgtcgcgggg tcgggaggac gcggcgcgcg ggaggcggcg gtcgtagggc gagtttcggg 2820 

acgtttcgag tcggggtcgg ggtcggggag agggcgtagc gaggtggggg ttagtttaga 2880 

tcgacggtag cgacggagcg ggcggcggcg gcggcgtcgg cggcggcggg gtggtttagt 2940 

ttttagtttt agacgcgtcg cgtagtaggt cggagtagtt ttttcgggag gatgtttagc 3000 

ggtagcgttt ttcgttttag tttttgggga tttttcgttg aggtattgaa ggtaggaaga 3060 

aggggttcgt tatcggttcg tcgggttgcg cgttattttt gttattttgc ggaaagagga 3120 

gcgggtgggt gggcgtttgg gaggcgggtt ggagggcggt gtaggggagc ggggcggtcg 3180 

gggggggggt cggggggcgg ggaagggagg gaggagaaag gagtcggaag agggtagagt 3240 

tattaaatgg gttttttagt tatggtttgg ggttttacga tttttttgga agttcggagt 3300 

ttgggtggga tagcgaggtt gcgcgcggtc ggcgtttcgg ggttggtgcg cggtagaatg 3360 

gggtcgcggc ggcggtagta aggatatttt agtcgcgcgg atttggggga ggggcgggga 3420 

gggggtgagg attcggttgg gattcgcggt tcggttcgtt agggcgtaga gagaggatgt 3480 

agtcgtaaat ttcgagtcgg attttcgtgt cggacggaag gcgtggaagc gggaggggtt 3540 

ttcgtgtgaa aattttttgt ggggtttggt gttttatttt ttaaaggtta gattttgcgg 3600 

gttttttgtt ttttattttt ttttttttat tcgcgtaaag gaattgggcg tttttttttt 3660 

tttttttttt ggggcgtagg tttcgtcgcg gatttcgcgt ttagtttggg agatacggta 3720 

ggggcgcgtt ttagggaaag gcggtcgtaa aagtttcgcg gttgagtatt gggtttgatg 3780 

tttagttttt ttattaaatt atttttgtaa agacgcgggt tttttgtaat tgagtttttt 3840 

atttcgaggt atttaaaatt attttaaggt atatacggat tttcgttttt tcgcgttatt 3900 

tttttttata ggttcgcgcg gcgcgttaaa gtttgggaga tacgagttgc ggggaaatag 3960 

tatcggaaga gttcgggttt gtaaaatgcg aattaatgaa tacgaaataa gggtaatcgc 4020 

gaggtagcgt cgggaagggt tggagcgcgc gggggtgtag ggagtttttt tttgggcgtc 4080 

gtaacggtat tttttttttt tttcggtttc gtttcgtttt ttttcggttt ttgcggacgc 4140 

gatttacgta gattatagtc gagttcgtcg gatttcggga ttacggaagt tatttcgttc 4200 

ggttatttat attcgcgtcg cggttggggt ttgggttggg gattgtggcg gcgaagaagt 4260 

cggggtagga agagttaatg gtaatggcgg ggtcgtcggg gggcgggggg ttagtagtag 4320 

acgttgagtt gtgaagatta gggtggtttc gaaaggtcga ggaaaggaga aagggttatg 4380 

agaagagttt ggcga 4395 

<210> 289 
<211> 4395 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

> 

<400> 289 

J! * 

> <■ ■ 

tegttagatt ttttttatgg tttttttttt ttttttcggt ttttcggggt tattttagtt jBO 

tttatagttt agcgtttgtt gttgattttt cgtttttcgg cgatttcgtt attgttatta 120 

gtttttttta tttcggtttt ttcgtcgtta tagtttttaa tttaagtttt agtcgcggcg 180 

cgggtgtggg tggtcggacg ggatggtttt cgtagtttcg gggttcggcg ggttcggttg 240 

tggtttgcgt gggtcgcgtt cgtagaggtc gagggagggc gggacggagt cgggaaaggg 300 

aggaggtgtc gttgcggcgt ttagggaggg attttttgta ttttcgcgcg ttttagtttt 360 

tttcggcgtt gtttcgcggt tgtttttatt tcgtatttat tgattcgtat tttataagtt 420 

cggcjtttttt cggtgttgtt ttttcgtaat tcgtgttttt tagattttaa cgcgtcgcgc 480 

g^dtttgt&g gaggaagtgg cgcgggggaa cgggggttcg tgtgtgtttt ggggtggttt 540 

taaatatttc gaggtggggg' gtttaattgt aagaagttcg cgtttttgta ggagtaattt 600 
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seq_prot.txt 

ggtgggggga ttggatatta ggtttagtgt ttaatcgcga aatttttacg gtcgtttttt 660 

tttggggcgc gtttttgtcg tgttttttaa gttgagcgcg gagttcgcgg cgaaatttgc 720 

gttttaggga gggagggaga gggggcgttt agttttttta cgcggatgga aaagaagggg 780 

tgggaggtag agagttcgta aggtttaatt tttaaaaagt gaaatattaa attttataag 840 

ggatttttat acgaaggttt ttttcgtttt tacgtttttc gttcggtacg aggattcggt 900 

tcgggatttg cggttgtatt ttttttrtgc gttttggcgg gtcgagtcgc ggattttagt 960 

cgggttttta tttttttttc gttttttttt tagattcgcg cggttgggat gtttttgttg 1020 

tcgtcgtcgc ggttttattt tgtcgcgtat tagtttcggg gcgtcggtcg cgcgtagttt 1080 

cgttatttta tttaggtttc gggtttttag gagggtxgtg gagttttaag ttatgattaa 1140 

ggagtttatt tggtaatttt gttttttttc ggtttttttt tttttttttt ttttttcgtt 1200 

tttcggtttt tttttcggtc gtttcgtttt tttgtatcgt tttttagttc gttttttaga 1260 
cgtttattta ttcgtttttt ttttcgtaag atagtagagg tggcgcgtag ttcggcgagt . 1320 

cgatgacgga tttttttttt ttgtttttaa tgttttagcg gaagattttt aagggttgga 1380 

gcgaggagcg ttgtcgttgg atattttttc ggggaggttg tttcgatttg ttgcgcggcg 1440 

cgtttgagat tggggattga gttatttcgt cgtcgtcggc gtcgtcgtcg tcgttcgttt 1500 

cgtcgttgtc gtcggtttgg attggttttt atttcgttgc gttttttttt cggtttcggt 1560 

ttcggttcgg ggcgtttcgg ggttcgtttt gcgatcgtcg tttttcgcgc gtcgcgtttt 1620 

ttcgatttcg cggcggcgac gatgttcggg aggagggttt tgacggcggc ggcgcggatg 1680 

gtggcggtcg gcgttcgggt gtgatgcgag cgttacggtg gggatgttgt tggttgcgcg 1740 

gcgttgaggg ttagcgagag cgagagttcg ttcggggcgg aggacggatt tattcggatt 1800 

tggttgtagc gtgggttcgg agtttttttt ttttttcggt ttttttttcg gttttttttt 1860 

attttttttt tgttttgcgt ttttaatatt tttcgatttt gttttttttt cgttattgga 1920 

agttttttcg tttttaaatg gaattagtgg agttcggagt ttttggtgta acgtatagat 1980 

atgatttatg ggcgtagcgt gttttatagt gagtattttt ttcgtcgcgt tcgtgcgttt 2040 

tcggagttag ttttttcgtt tttttttcgg ggtcgaacgc gtaggaaaag tttttgcgtc 2100 

gcgtttcggc ggattttgta gtttcgcgcg ggtcggggtc gacgtaaatt ttttgtattg 2160 

tagtatcgtg gaatcgggcg tgggggcggc gcgcggttag tcgtcgtcgg attttaaaat 2220 

ttgtcggaga acggtttttt agtgcgtagg aatttttcgg gggtggagga tagcgtgtag 2280 

tttagggatg gatacgtgcg gtagagattg gtttaagcga gggcgtaggt ggaaagcggg 2340 

agagcgcgga tgatatttag tggggttaga ggagtttttt ttttttaggg gttttcggag 2400 

ttcggcgggt ttttgttcgt agtataggag gtagtagaag gtatatttga agttagcgtt 2460 

ggagggaagg gcgagggtta gtttttattt tttttcgttt tggagatcgt tgatgtcgtt 2520 

ttatgggtgt gtggtaacgc gggtagtgta gaggtgggaa ggaatgagag gaaaatagta 2580 

ttacgagtta tcgagtaaaa ggtagtataa gaaaatgtgt ggtggaaatg tattattatt 2640 

aagtaaagat gagagtgttg atcgcgggtt acgtagttga gaggggtttg ttgttgagtt 2700 

aaatatagta cgtggtttat aatgtgattt tattaaaata ggaagagcgg agtaaataaa 2760 

tatatatttt gatgttgaag gatgtgtata tttgtgtgtt ttgggtgtga gtagaattat 2820 

gataaattaa attgattttt ttttttttgt ggaaatatat agtaataata gtagttagtg 2880 

ttttaattta aaatgtaaat gaattggtta gagtataaag taatatataa ttgataatat 2940 

tttaaatatt aatattaaga tttagatacg tttttrtagag tgatgtttaa gtataaatgt 3000 

ataagtttgt ggtgagtttt aatttaggaa atatttgtga attgtttttt ttttcgaatt 3060 

tattttaagt gaaaatgtga aaatatagat tttttatgtc gttggatgat aaaatgaata 3120 

agaaaaaata gaagaaaaat taaattatga agattagttt atttattatt ttgttagtgt 3180 

tttgttttgt aaaaatgttg attagattaa gaatacggaa gtaaaataag tttagattat 3240 

aatggtattg tagatttatt gttaagttat agattatagt ttttttgata ttggaaattt 3300 

gatttaatat aatgttttcg gaaattacgt tttttaaatt aataattaat ttttatatat 3360 

tgatttggta ttaaattaag ttaaggttag aagagattaa ttttatagtt tagaaagtgt 3420 

tgagggacgg ggttatttgg aggtttagaa tatttttgta ggtaatattt taataatggt 3480 

gtttaatgtt tataatttat attaatttgt tttttgttta tttttttttt ttatagtttg 3540 

taggatttgt ttaagtattt attttttaaa atatagttat attgtagtat gttaggaata 3600 

gttatggagt tgtttttatt gtttgttttt tttttttttt ttttttattt tttggttttt 3660 

ttttttattt agagatgttg gtagagttat tttttaaggg tagtttagat atagtgtgat 3720 

ttttgtagtt ttattgtatt gtgtgtgata tgagtaaatt gatatgttgt ttttagagtg 3780 

aatgatgtat ttgttttagg agtttgggtt ttatttattt ttaatttgta ttagtttttt 3840 

aagggagttt taaggttttt tgatttgtat attttaaatt ggtttttgta agttrtgaat 3900 

cgatttttaa tagttaataa tgaaaaaatt aatttgtttt gatattattt tattttgaaa 3960 

ttatgttatg tttttatggg atttgattat tttagtgtat tttattttta attttttttt 4020 

tttgttttag ttgtagtaag tttaattatt ttttatttgt ttggggtaat taagaaagga 4080 

ataggtgtgt ttatttagaa aaatgttagt ggaagatttc gaattatttt gtattttttt 4140 

attttttttt tattgttaat attttagaat atttgaattt gttataatat gtataattga 4200 

ttggtttggt taagataata tttaaaaatt aatttattaa tggatattat aagaatgttt 4260 

tttattttta gtataataga aaattggatg atggtttttt ttaaaatttt tttttttttt 4320 

taatagaaaa gtaaattatt ttagaaatat agaagttagt gtagtgtgga atttggataa 4380 

aatatgtaga gggag • ■ m. ■ . 4395 



<210> 290 
<211> 10490 

<212>-DNA — 
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<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 290 

agaaggtgtt tgtttttttt tcgttttttg ttatgattgt aagttttttg aattgggagt 60 

cgattaaatt tttttttttt ataaattatt taggtttaag tattttttta tagtagtgtg 120 

aaaataaatt aatatttttt ttttgaggcg tttttttttt aggtaattcg ttgtttttat 180 

gttttttttt tgttttttgt tttttttttt ttttttatga ggtttaagtg ataaacgggg 240 

ttagttttag ttttagtttt agttttagtt ttattttatt gtaggtttgt gtggttgttg 300 

gagaggtcgt gttttttttt ttttttcgag tttgtttgat atgttttttg gattttggag 360 

gaaattgatt ttttattttt atattggtgt aatatttttt aagattttaa agttgtatta 420 

tttgagttag tttttttttt tatttttatt tgttagggtt gttattggga tagttttaga 480 

gggtggtgtt aatggatgaa tggatggatg gatagtagtt tagggatgat gtttttgttt 540 

gttttgaatc gggttttttt tttaatgaga agtttttttg agtgagtata tatagttatt 600 

ttttggtatt tatggaggat tagttttagg gttttcggga atgttaaaat ttatggatgt 660 

ttaagttttt gatataacgt ggtttagtat ttacgtataa gttatgtata ttttttcgta 720 

tacgttagat cgttattaga ttatttatga tgtgtaatat aatgtagatg ttatataaat 780 

ggtcgtgata ttgtattttt tagggaatga tgataagaat aaagtttgta tatgtttaat 840 

agaaatataa tcgtttaatt tattttttga atatttttta tttgttgttg ttgaatttat 900 

agatgtagag tttttggata cgagagttaa gtgtgttttg agagtagggt gggtgaggtt 960 

gttaatgagt ataggggagt aggtgttgat taggaggatt ttgtattggg gtatttggac 1020 

gttttgtttt aggatttgag attttagttg gatggtatag gtagatttag tttaggttaa 1080 

agtcgttttt ttgaagtttt tttttatttt aagttttttt tggattttgg aattcggtat 1140 

tttttaggtt ttgcgtggaa ggatagatga attaggtttt agataatatg tttagaagag 1200 

tgagttttta ttgtgtgttc ggtatttttt ttataggatt ttttagttag aatatttaag 1260 

ggttatggag agaaatattt agttaaaata ttagaaaaga aaaagtgata ttattagaga 1320 

tattaaaaag attattaggt aatagtatta gtttttgtat tttgagattt aatagtagta 1380 

gttatttttt tttatcgtta tgtgtatttt aggattattt tgggcgggga gggttgcggt 1440 

tagggagtag ttatggatgt tttgatgttg gttttgggtt tcgggggtga tagtgatgag 1500 

gaattgggtg tatatatgag tggggtagtc gggtttggtt agagaagtag tatatacgtg 1560 

tatagacgtg tttatttata tatatatgtg tacgtacgtg tataaatata ttgtaggtag 1620 

gtatgttgac gttttaggta gcggaggatt ttgattttgg gcgttgttga ttcgggtaag 1680 

gttttattgt gattcgtgtt atgattttag aatgttattg gtgtttagta tttgtttgtt 1740 

ttttggtttg ttttagtggt ttatagtagt tatatatagg tagtggtatt tgtgagtagt 1800 

tttgtggatt taaaggtttt ttttttgaga ggtatgattt aggttagttg atttattaga 1860 

attaggtgag cgtgatttgt tttttttttt ttaggcggat ttggggatag tggttacggt 1920 

gcgggcggtg ttggtttttg tggggtagtt atcgaggagg gttatttttg agtatttatt 1980 

aggcgttcgt tttatattgt tcgtgtagac gattggtttt ttcgttttta tggtggtttc 2040 

gtagagtggg tgttgttttt aaatgttttt attcgataga tgagacgttt ggggttagag 2100 

aggtagtaat cggtttggga attcggatat gattttgagt tttgttttta gttttgtcgt 2160 

gtgttgtgtt ggaatttagg tttgaatttt gtgatttttt tgttttagat tttaaatttg 2220 

tttaggtttt ttatttcgat ggggtagagt ttggttttgg tagagttatt ggtatagagt 2280 

tattggtata gatttattga cggtttttag aatatttttg tgttttaagt tgggttttga 2340 

tggtcgttgt gggttttatt gaatatatat ggttttttgt ttaggggagt ttgttgtttt 2400 

tgggtagttg tggaaaatga aggagttttg gagggttggt tgaggggaga ttattttttt 2460 

ttgtgtttaa aggggttcgg gtattagggt tttttttagg tattttttgt tttgcgtggt 2520 

ttttttgagg tttcgttttt tttttgtttc gagtattttt aggagggacg gtttatttag 2580 

ttgtttttta ggattaagga tttattgttt ttttttagtg atttaggaaa atgaagtttt 2640 

tttttgttgg gacggtttag aatggtggat tttatagttt tttcgcgaga gacgtggttt 2700 

ttatgcgtat aatagatttt ttttattttt taaatttaat atttttttgt ttaataggcg 2760 

ttatttttaa agtggtttta ttgtttagat tgaagagtta cggtagttaa agtgatgagc 2820 

ggagtagaat cgagtagtcg ggagagattt tgttttttgt aggaaattgg gtatcgttga 2880 

ggttttgagt attttaggag gtcgattgta tagagatttt tggtcgttga ttttagtttg 2940 

tttttatatt tttggaatag tttattatgg gttttttatt tttggtaggt ggaaattatt 3000 

taatttgttg gggtcggtgt gtttttattt tatggtattg ggggataata ggattttttg 3060 

tttaggtttt attgtattta agtttttggg aagatgttta tttttgtttg ggatttgaga 3120 

ttttagagat tggagtagtt gtgggttatt gggtttggtt tttttttttt tgggggcggc 3180 

ggtggaatgg gggttacgta gttagttagt atttgggagt tcggcgagag cggtttaggt 3240 

gtttttcgaa gtcgtcgcgt atagtgtgat ttttagataa ttttgtttta taggatggac 3300 

gtggtagagg tcgcgggtag ttggtgggta taagagcgag aggatattat tatgaaatac 3360 

gaaaaggtat aagtcggttt gttttttgga gggaggtttt ttttagtgtg ttttggttaa 3420 

agggttttgg gttttttagg agtatagggt agggacgggt ggttaatgtt tttaggtttt 3480 

tg-tatttttt attttggatt ttttattaag gttttttttg ggttataggg atatcgagtt 3540 

gggttgttag aggataaggg gtttaagttt tttcgaagtt ataataataa cgtcgattat 3600 

ttggggattg tatagtgagt tttttatatt ttttttattt ttaaagtatt tgttttagtt 3660 
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tagggatggg tttgttttta gaaaggtttt tttgacgtag gatatgtttt attaggtcgg 3720 

gttaattttt tttttaggga tagaattttt ttttgatttt tttgtaggtt tagttcgagg 3780 

ttgttaggtt agaggtgtgg ggtttattta gggagtcggt gggaatggag attgggttag 3840 

gttaggtttt tgggcgttta gtagttttgt cggtaagtga gtataagagg agcggggtag 3900 

tttgagggtt tggttttgtt tatttggaga taatttcggt gagatgtaag ggttatggtt 3960 

atagggtgag gggacgtttg gtttagtttt agggttgttg tttagtaggt ttttgaggqt 4020 

ttatttgttt ttgttttttt ttattttttt agagttatag tttttattgt ttcgtgaggg 4080 

gaaaaggtat ggtgataatg ggggttgtag ttttaggaga acgggggaga agatgggtag 4140 

ggtttcgttt tgggtatttt acggtgaggt tagggaggta gtagggttcg cggttaaaga 4200 

^JSSSJ** 9 Stgttgggaa gggatttggg gtcgggtaag aggagtttag ttaggagttt 4260 

attttttagg gattatagga tggagagata gaggattttt ggggaggtag ggcgggaggg 4320 

agttgacgag tcgtgttatt tttgaaacgt agggtgtgtg gttcgggtgt agggagaggt 4380 

aggtggatgt tgggaggtta gaatttgtaa gggttttggg gttgttaagt ggggtgggtt 4440 

tttggtgtag ttagagtata tcgggtaggt tttagggtag gtttttttga ttttggcggg 4500 

gggatgtggt tattttttga gggatttttg ttagggttcg gtcgtttatt ttgggcggtt 4560 

tttattttat tttagggtta atttttttta gttttagtag aaagtattat ttcgagttta 4620 

ggacgggtag ttttattggg tagtttgatc gttttttacg ttaggggttt tagtaatttc 4680 

ggttaggttg tttttatatt tttttttttt ttaggttttg ttttttttgg gagttagttt 4740 

tataggaagg tttttgtttt tttttttttg tgtttttttt tgggttgagt tttgagttgg 4800 

aaagggatag agttagtttt ttttgggggt cggtatttag gttggggtcg ttttaggttt 4860 

cgtgtagttt tttagttttg tttgggttgt tttatagtga gacggagttg ttttttttga 4920 

ttgcgcggga ggcgaaggta agagtttgat gcgtggaggg gttggtttag ggacgtaggg 4980 

attgggcggg tggttagtga ggtagaggaa gtagttggtt tgagcggtgg cgggtgaggg 5040 

taatacgttg ttattgggag gggtagtagt ttttgttgga tttgatttta ggttgttgtt 5100 

tattttggta gtttgataaa attttaaaag gagaattata gttttggttt gggggtggtt 5160 

gcgcgtttgt gttaggattt tatttagagg ttgggattta agattggtgt gtttgtggtt 5220 

tgaggatggt atatttcggg gttttaaagt tagtttattg gtgtttattt gtttaaaggt 5280 

ttttagtttt tgaggtttgt ttttttttgg ttttttttag ttggttttta ttagggtttt 5340 

agagtttaag atttagtatt cgcgggcggt tttgggaagt ttggtagttt cgttaatttt 5400 

aatatgtttt atttgatagt aaattcggcg ggagattagt cgaaagagta agtgggtgga 5460 

tatgttggga gattgggaga aatataaaag tagtagaaag gtaacgtgtg gagggaggaa 5520 

gtattttttg tagagatagg ggataggtat ttatggttgt ggtttggtat tattagtttt 5580 

HH 9 S§§ gt:g 9gcggtatat tgttttcgtt tagaggattg taggtttggt cgttagattt 5640 

tttgtttatt cgtgtaagcg ttattttgta gggagggaat ttgaatttag ggttgggatt 5700 

attcggagtt taaggttagg gatgttttgg tgatttgaag gaaggaaaag gtttagatta 5760 

9 fSJ t1:cgat tttgagtgtt tatttatttt tttagttttg ggaagggaga ttttgtttta 5820 

gtt l2fJ tt:t atttttattg aggaattatg gggttaaaat cgataatttt tagaattttc 5880 

gggttttggt ttttattggg gttatttcgt ggtttgtgat attagatcgt tttttgttta 5940 

tagtttatag atcgagcgta taagggaatg tttatgaata ttcggggttc gatgtggtta 6000 

gtttttttga atattgagga aatgaagttg aaaaatttcg gaagatatta ggtacgttta 6060 

gttagagtat aataaatagg ataggtcgtg tcggggttta ggtttttagt tggagggaac 6120 

gttaagatta ttttggggag ttgggggtga aggttagatg aatattttgg gtatagatgg 6180 

tgatatagtt attatagata aatttagttt tggtgatttt ttttggtttt agtaataagt 6240 

taaaatgtag ttttttgtag aaggaaattt tttttttgtt tttttttttc gaagtgttga 6300 

" g 5 ggg HS at i g H a * tg gg 99taggga gttttttatt tgttttgaga ttgttttttt 6360 

tttttggttt tgttttatag attatgaagg agaagggtaa gaggttattt gagtatattt 6420 

a ?«H= a H ga ^gggacgta agcgggatat taaggaagta tatatttttt agggatcgat 6480 

acggaattaa gtaagtttac gggagttata gggttttagt agagatgggg tgaatgagag 6540 

ggatgggggt tttttcggag tagaagttag ggttatttag gagggatgat atagttgtta 6600 

a 9fgtttttt cggtttaggg agtagtcggt attatgaatc gagtattttt ttggttttaa 6660 

gttttgggtt agattggaat atgtggggtt agaatttagg aggattttga ggagatggaa 6720 

ggtagtaaat aaaattatgt ataatggtga agggtgtttt ttttgattta tggggattta 6780 

tggtaggatt tacgggaggg tggtaggata gagggtttat gagttttttt taggtaatag 6840 

tgatagtatt aaatgttggg agaattaggg gttttggaaa tttttattta ggttcgttgg 6900 

gaa i a 5 gata jggtatagtt acgttggtag ttcgttgggt agtggtttat aaagttcgat 6960 

ggatttgaat tatatatttt taaagtgtta tagatattga atttattgat ttgtaaattg 7020 

atatttatat gaaattagta tgttaggttt attgtttgat ttttcgttat ttatatacgg 7080 

agttttcggg gacggttttt aatacgggga tggggagagt aaggttggtt ttttttttaa 7140 

acggaagatt tagtgagaaa agggaacgag tcggtgatgt tcgtacgaac gtgggtggat 7200 

H ta 5?!: 9ta "ttgttgag ggatagaagt tagatttaat aagttattat agtaggattt 7260 

^ a ™««B g 9 tta ttttgg aaaaggttaa attataggga ttgagaagta gtttgggtgg 7320 

^ agggg ! tg acgg fH ggg 9agaggttgg gtgtataggg gttattttgg agatttggig 7380 

gatgaaggag tcgttttagg aggggttgga gcggtggtcg ggagattttg tatattggtt 7440 

tggaatcgtg gaggaattgt atatttatag attgaattgg cgtgtgtgta aattgaaaaa 7500 

?* a f aaaa f a aattatttag agtgaaaagg attaggtaag ttattgtata attgggttat 7560 

ttgtatgtta tagatgtgga ttttattgaa atattttttt aagagtttta ggttttgaag 7620 

agtttattgt ttatttggtg aaatatttga atttgaaatg ggatttgttg ttaggttttg 7680 

tagataaagt gaaattaata atatttgtat aaaataaatt aaagtttttt ttttttgttt 7740 
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tttaggtagc gggaattatt ttatattttt ttggtatatg aggagtataa ttcggtgagt 7800 

attttcggta gtgaggtttt cgggttatat ttttatattg ataggagtgg gtgtttggtg 7860 

ggggtgtcgt tgtttttttt aaagttagta tttgtgattt attaggatat aggaggtagg 7920 

atgttagttt atcgttggta taaattttta aggaaggggg tggttttaag gggttaagtt 7980 

gagatataga ggagttaggg tttggatttt tggtgttatt tgggtttgat tattattttt 8040 

tagaataaga aatgacgttt tttttttggg gttgttttaa agtttaggag tttggtagta 8100 

tcgtatatag gatggtgtta ttagtagata ttttggataa ggtgttgaag tgtttgatgg 8160 

atttggtttt tgttatgaaa tgaatgtgta ttttgaggaa gttttttttt tagaggaagt 8220 

ttttttttta gaggaagttt ttttagttat ttttgttttt tttaatgata tgagtttttt 8280 

taggtgattt tagttttttt aggtgatgtt tttttatggt gattttggtt tttgtaggag 8340 

gtgggttatt gtagggattt gagttatatc gtcgttttgt ttttttttta tttttttgag 8400 

gaggatgtat tttgggtatt ggtgtagttg ttggttagtg agaggtattt tttgtagggt 8460 

aagtgaatag ttgtttcggg gattttttgt agttagattt ggggatggtt attttggtta 8520 

ggtgattata gtttttagtt aaggtatttt ttttgtgtcg ttagtttgtt gggagatttt 8580 

aggatgtttt tgttgagggt tttataggag tttacggttg atttttaaag tttaaattag 8640 

acgtttttta tttttattag tagagggtat tttatttttt tcgtggttat tttttgtgtt 8700 

ttggagttac gtttttcggt tttgattttg tgtagttgat tttttttttt ttgagagttt 8760 

ttttgttttt tagttgttcg ggtttttgtt gttatcggtg tttacgaatg ggtcgattaa 8820 

gtttaggtgg tagtattttt ttattttttg ttttttggtt cgattttatt attaggagat 8880 

gatcgggaag tttagcgttt atttagtttc ggttattttg tcgtggtttg aaagttaggt 8940 

ttgttttttt tgtattttgg tttaggaggt ttttagggga atttttagtt aggttttagg 9000 

gaatgttttc gttttatttt tttagggtaa aggtcgtatg ttggggttat tagatgggag 9060 

ggtgggaggt tttggggttt gggggttttt ttagttgttt agtttttgta gttgatggtt 9120 

ttatattttg ggggaaggtt ttgattttat gatgggttgg gggtttttta ggattttata 9180 

gtttaaatgg cgggatcgtt taggggtttt aagattaata ggagtatgtg gtagttacgt 9240 

tataatttaa gattatgggg tattaggtga gtttatggtt tttttagttt tttttagagg 9300 

ttttgttttt cgtggggttg taggagtagg ggggttggag tttttcgtgg ggttggtgat 9360 

tggttgagtt ttagttaggg tttgatttgg gacgtcgggt tttttatggg ttgggagttg 9420 

gttttttttt ttgttttgga ggagatagag gtatagggat gggggtttag ttttcgtaga 9480 

gtagggtaaa gggtagtgtg tttatcggga gtgtgggaag gtgatagtgt tgtggggagt 9540 

tttggatatc gtttagtgtt ttgtattagg ggaagggttt ttagaggttt tggaagaggg 9600 

aggtttttag ggtagtttag tggtttgagt atttttgttg tttttattag gataagaaag 9660 

atttatgtgg gtagtgtttt tcgttaggtt gttttattcg gatattgatt gacggggtaa 9720 

ggaggtatag ggagattttg gtttagggat tttttttgtt ttgtagtgtt ttgttttttt 9780 

agttcggggg tttggtttat ttttagttta taggaggttt aggcgggttt ttaaaggata 9840 

tataagtaaa attttttgtt taaggggggt tattttaggg ttatggttgg ggtttaggtt 9900 

tagttttatg ggtagattgg gttaggattc gatttgagag ggtttaggga agttttaagt 9960 

tttgggtaag tttttttttt taggagttat atttttattt aaatgagtgt tttttatgag 10020 

gagttttaag attttgtttg atttagcgtt ttggagggtt taggcgattt ttatggggaa 10080 

ggttattgat tttggagatt gaagttttag tgtgcgtagt tcgagttatt agttttagtt 10140 

tggaaggatt aggttttttt atatttgttg tttttataga tttttttcgg gtttattttg 10200 

cgtttgtggg acgtgtattt ggtagaaggc gaataggcgt tgatgtcgat aataagaatc 10260 

gtttttaagg tttagtagag taagtttacg tgtgtttagc ggggtttggg gagttttggg 10320 

gttagatttc gattggttcg agggtagttt ttttatattg tttttatgat tttttgtttt 10380 

ggtttagagg gaggtttggt taggtgggtt gggtaggata ttgtgatatc gagtttattt 10440 

tttatatgat ttagatgaaa gtcgagagtg tggtgagtat ttttttgttt 10490 

<210> 291 
<211> 10490 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna CHomo sapiens) 
<400> 291 

ggatagggaa gtgtttatta tattttcgat ttttatttgg gttatgtggg ggatgggttc 60 

ggtgttatag tgttttgttt agtttatttg gttagatttt tttttgggtt agaatagagg 120 

attatgagga tagtgtgagg aagttgtttt cgggttagtc ggggtttgat tttagggttt 180 

tttaggtttc gttgggtata cgtagattta ttttgttgaa ttttaaaggc gatttttgtt 240 

atcggtatta acgtttgttc gttttttatt agatatacgt tttataggcg tagggtgagt 300 

tcgagagaga tttgtgggga tagtaggtgt gaaagaattt ggttttttta ggttggggtt 360 

ggtggttcga gttgcgtata ttggggtttt agtttttaga gttagtgatt ttttttatga 420 

gggtcgtttg agttttttag gacgttgggt tagataaggt tttgaagttt tttatggggg 480 

gtatttattt gagtggggat gtggtttttg gagagagggg tttgtttagg gtttgaggtt 540 

ttttjrgagtt tttttaagtc gggttttggt ttagtttgtt tatgaggttg ggtttgagtt 600 

ttagttatgg ttttgggatg attttttttg ggtagagggt tttgtttgtg tgttttttgg 660 
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ggattcgttt gagttttttg tgggttggga gtgagttaga ttttcgggtt ggggaagtag 720 

ggtattgtag ggtaaggaag gtttttgagt tagggttttt ttatgttttt ttatttcgtt 780 

aattaatatt cggatgaggt agtttaacgg ggaatattgt ttatatagat tttttttgtt 840 

ttgatggaag taatagaggt gtttaggtta ttgggttgtt ttaaaaattt ttttttttta 900 

gggtttttga agattttttt tttagtgtag aatattgggc ggtgtttaga gttttttata 960 

atattgttat ttttttatat tttcggtgga tatattgttt tttgttttgt tttgcgggag 1020 

ttgggttttt atttttgtgt ttttgttttt tttagggtag gaaaggaaat taatttttag 1080 

tttatggaga attcgacgtt ttaggttagg ttttggttgg gatttagtta gttattagtt 1140 

ttacgagggg ttttagtttt tttgttttta tagttttacg ggaggtaggg tttttgggaa 1200 

gagttgaggg gattataaat ttatttgatg ttttatggtt ttgggttgtg acgtggttat 1260 

tatatgtttt tgttggtttt ggagtttttg gacggtttcg ttatttgggt tgtgaaattt 1320 

tgagaagttt ttagtttatt atgaaattag agtttttttt taagatgtgg agttattagt 1380 

tgtaagagtt gggtagttgg agaggttttt aaattttaag gttttttatt tttttatttg 1440 

gtgattttaa tatgcggttt ttattttggg gaggtggggc gggaatattt tttggagttt 1500 

ggttggaggt ttttttggag gttttttggg ttagggtgta aaaagggtaa gtttgatttt 1560 

taggttacga tagggtggtc ggaattgggt gggcgttggg tttttcggtt attttttggt 1620 

agtggggtcg ggttagggaa taggggatgg ggagatgttg ttatttgggt ttggtcggtt 1680 

tattcgtggg tatcgatggt agtaggagtt cgggtagttg gagggtagga ggatttttag 1740 

ggaggggaga gttagttgta tagaattaga gtcggagggc gtggttttag gatatagagg 1800 

gtggttacgg ggaggatgag atgttttttg ttgatgggga tgagaggcgt ttgatttggg 1860 

ttttgggggt tagtcgtgga tttttgtggg atttttagta gagatatttt aaagtttttt 1920 

aataagttgg cgatataagg agggtgtttt ggttgaaagt tgtgattatt tggttagggt 1980 

ggttattttt aggtttggtt gtaggaggtt ttcggggtag ttgtttattt attttgtagg 2040 

gagtgttttt tattggttag tagttgtatt agtgtttaga atgtattttt tttaggaaga 2100 

tagaggagga ataaggcggc gatgtggttt aggtttttgt agtagtttat tttttgtaag 2160 

agttagagtt attatggaag gatattattt gggagggttg aggttatttg ggaggattta 2220 

tgttattgga gagggtagag gtgattggag aggttttttt tgaaggagag gttttttttg 2280 

aaaaagaggt ttttttagga tgtatattta ttttatgata agagttaagt ttattaggta 2340 

ttttagtatt ttgtttaaaa tgtttgttga tagtattatt ttgtgtgcga tgttgttaag 2400 

tttttgggtt ttggggtagt tttaggagga gggcgttatt ttttgttttg agaagtggtg 2460 

gttaggttta ggtgatatta ggagtttagg ttttgatttt tttgtgtttt agtttgattt 2520 

tttgagatta tttttttttt tggaggttta tgttagcggt gagttgatat tttatttttt 2580 

atattttggt gggttataaa tattaatttt aaaagaagta acgatatttt tattagatat 2640 

ttatttttgt taatatggaa atatggttcg ggaattttat tgtcgggaat atttatcggg 2700 

ttgtattttt tatatgttag gaggatgtgg agtagttttc gttgtttagg aaatagagaa 2760 

agggggtttt ggtttgtttt gtgtagatgt tgttaatttt attttgttta taaagtttaa 2820 

tagtaaattt tattttaggt ttagatgttt tattagataa gtagtgagtt ttttagggtt 2880 

tgagattttt gaagaaatgt tttagtaaaa tttatatttg tgatatgtaa atagtttagt 2940 

tgtatagtga tttgtttgat tttttttatt ttgaatgatt tttttttttt ttttttagtt 3000 

tgtatatacg ttagtttagt ttgtgggtgt atagtttttt tacggtttta aattaatgtg 3060 

tagagttttt cggttatcgt tttagttttt tttggggcga tttttttatt ttttaagttt 3120 

ttagggtggt ttttatgtat ttagtttttt ttcgattcgt tagtttttgg ttatttagat 3180 

tgttttttag tttttgtggt ttggtttttt ttagaatggt ttaggaatgg gaattttatt 3240 

gtggtagttt attgggtttg gtttttgttt tttagtaaaa tgtatttagg atttatttac 3300 

gttcgtgcgg gtattatcgg ttcgtttttt ttttttattg ggtttttcgt ttgaagggag 3360 

gattagtttt gtttttttta ttttcgtgtt gaaggtcgtt ttcgaaggtt tcgtgtgtga 3420 

gtgacgagga gttaagtagt gaatttggta tgttggtttt atgtggatgt tagtttgtaa 3480 

attagtgggt ttaatatttg tgatattttg gggatgtgtg gtttaagttt atcgagtttt 3540 

gtgagttatt gtttaacggg ttgttaacgt ggttgtgtta tgttatgttt ttagcggatt 3600 

tggatgagag tttttaggat ttttaatttt tttagtattt ggtgttgtta ttgttgtttg 3660 

gggggggttt atgggttttt tattttgtta ttttttcgtg ggttttatta tgggttttta 3720 

tgggttaggg agagtatttt ttattattgt gtatgatttt gtttgttgtt ttttattttt 3780 

ttaggatttt tttgggtttt ggttttatat gttttagttt ggtttagggt ttggaattag 3840 

ggaggtgttc ggtttatggt gtcggttgtt ttttgggtcg ggagagtttt tggtagttgt 3900 

gttatttttt ttgggtgatt ttggtttttg tttcggggaa gtttttattt tttttattta 3960 

ttttattttt gttgggattt tgtggttttc gtaggtttat ttggtttcgt atcgattttt 4020 

gaagaatata tgttttttta atgtttcgtt tacgtttcgg tcgatgcgtt ggatgtgttt 4080 

agatgatttt ttgttttttt tttttatgat ttgtagggta gggttaagag gaggaagtag 4140 

ttttagaata gatggaagat tttttgtttt tagtggtagt tagtttatag ttagtatttc 4200 

gggaaggaag gatagaagga aggttttttt ttgtagaaag ttgtattttg gtttgttatt 4260 

gaagttaggg agggttatta gagttgagtt tgtttgtggt gattgtgtta ttatttgtgt 4320 

ttagggtgtt tatttgattt ttatttttag ttttttaggg tggttttgac gtttttttta 4380 

gttggagatt tgggtttcga tacggtttgt tttgtttgtt gtgttttggt tgagcgtatt 4440 

tggtattttt cggggttttt taattttatt tttttaatgt ttaggaggat tgattatatc 4500 

gggtttcgga tgtttatggg tatttttttg tacgttcgat ttatgagttg tgggtagaaa 4560 

acgatttggt gttataggtt acggggtgat tttagtgagg attagagttc ggggattttg 4620 

?S*JISS2 ^"W^t tatgattttt tagtagaggt gagattaagt tgggataggg 4680 

tttttt-t^t^ 4740" 
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tttttttttt ttttaggtta ttagggtatt tttagttttg agtttcgggt agttttagtt 4800 

ttagatttag attttttttt tgtaaggtga cgtttgtacg aataggtagg aaatttggcg 4860 

attaggtttg tagttttttg ggcgaggata gtgtgtcgtt tattttttga gaggttgatg 4920 

gtgttaggtt atagttatgg gtgtttgttt tttgtttttg tagagagtgt tttttttttt 4980 

tatacgttat ttttttgttg tttttgtatt ttttttagtt ttttagtata tttatttatt 5040 

tgttttttcg gttgattttt cgtcgaattt gttgttaaat gaggtatgtt ggagttagcg 5100 

gagttgttag gttttttaga gtcgttcgcg gatgttgggt tttgggtttt ggagttttgg 5160 

tgggagttag ttggaaggag ttagggaagg gtagatttta agggttgaga gtttttgagt 5220 

aaatgagtat tagtgggttg gttttgggat ttcgggatgt attattttta ggttatagat 5280 

atattagttt taggttttag tttttaggtg gggttttgat ataagcgcgt agttattttt 5340 

aagttaggat tgtggttttt tttttggaat tttattaaat tgttaaagtg aatagtaatt 5400 

tggggttagg tttagtaggg attgttgttt tttttagtga tagcgtgttg tttttattcg 5460 

ttatcgttta ggttagttgt tttttttgtt ttattgatta ttcgtttagt ttttacgttt 5520 

ttggattagt ttttttacgt attaggtttt tattttcgtt tttcgcgtag ttagaggagg 5580 

tagtttcgtt ttattgtaag gtaatttagg tagagttgag gaattgtacg gggtttggag 5640 

cggttttagt ttgggtgtcg atttttagaa aggattggtt ttgttttttt ttagtttagg 5700 

gtttagttta ggagaaggta tagggaaggg aggataaggg tttttttgtg gggttgattt 5760 

ttaggagggg taggatttgg gagaagaagg agtgtaggga tagtttggtc ggggttattg 5820 

gggtttttgg cgtggggggc ggttaggttg tttagtgggg ttgttcgttt tggattcgag 5880 

gtggtgtttt ttgttggagt tgagaaaggt tagttttgag atgggatggg ggtcgtttag 5940 

ggtgggcgat cgggttttga taggagtttt ttagggagtg attatatttt ttcgttaggg 6000 

ttaagggagt ttgttttgag atttgttcgg tgtattttgg ttgtattagg ggtttatttt 6060 

atttgatagt tttaaggttt ttgtaggttt tgatttttta gtatttattt gttttttttt 6120 

gtattcgagt tatatatttt gcgttttaga agtggtacgg ttcgttagtt ttttttcgtt 6180 

ttattttttt agggattttt tgttttttta ttttgtgatt tttgagggat gggtttttgg 6240 

ttgggttttt tttattcggt tttagatttt tttttagtat tagatttagg tttttagtcg 6300 

cgagttttgt tgtttttttg gttttatcgt gagatgttta gaacggggtt ttgtttattt 6360 

tttttttcgt ttttttaggg ttatagtttt tattgttatt atgttttttt tttttacggg 6420 

atagtgaggg ttgtagtttt aggggaatgg gggagaatag gggtaggtgg gtttttagag 6480 

atttgttgga taatagtttt gaggttgggt taggcgtttt tttattttgt ggttataatt 6540 

tttgtatttt atcggggttg tttttaagta gatagggtta gatttttagg ttgtttcgtt 6600 

ttttttgtgt ttatttgtcg atagaattgt tgagcgttta ggggtttgat ttagtttagt 6660 

ttttattttt atcggttttt tagatgggtt ttatattttt ggtttaataa tttcgggttg 6720 

gatttgtagg ggagttaggg aggagttttg tttttggaaa ggaggttgat tcgatttggt 6780 

gagatatgtt ttgcgttaga aaggtttttt taaaagtaaa tttatttttg agttgagata 6840 

ggtgttttag gggtgagggg agtgtagagg atttattgta taatttttaa atgatcgacg 6900 

ttgttgttgt agtttcgaaa aggtttaggt tttttgtttt ttggtagttt agttcggtgt 6960 

ttttgtagtt tagagggagt tttggtgagg ggtttaaggt aaagggtgta agggtttggg 7020 

ggtattggtt attcgttttt gttttgtgtt tttagggagt ttaggatttt ttgattaggg 7080 

tatattggaa gaggtttttt tttaagaagt agatcgattt gtattttttc gtattttata 7140 

atgatgtttt ttcgtttttg tgtttattaa ttgttcgcga tttttattac gtttattttg 7200 

tgagatagaa ttgtttaaag gttatattgt acgcggcggt ttcggagaat atttgaatcg 7260 

ttttcgtcgg gtttttagat gttggttggt tgcgtaattt ttattttatc gtcgttttta 7320 

gggaaaaagg ggttagattt agtggtttat agttgtttta gtttttggag ttttaagttt 7380 

taagtagggg tgggtatttt tttaaggatt tgagtatagt gggatttaga tagagaattt 7440 

tgttgttttt taatgttatg aaatggggat atatcggttt tagtaggttg aatggttttt 7500 

atttgttaag ggtgaagggt ttatgatggg ttattttagg gatgtggagg tagattgggg 7560 

ttagcgatta gaggtttttg tgtaatcggt tttttgggat gtttagggtt ttagcgatgt 7620 

ttagtttttt atagggaata agattttttt cgattgttcg gttttatttc gtttattatt 7680 

ttggttatcg tggtttttta gtttgaatag tgaagttatt ttaggaataa cgtttgttga 7740 

gtaggagggt gttgggtttg ggggatgaga aagatttatt gtacgtatgg aaattacgtt 7800 

tttcgcggag ggattgtgga gtttattatt ttgagtcgtt ttaataggag gaggttttat 7860 

ttttttgggt tattgaggaa gaatagtggg tttttggttt tggagaatag ttggatggat 7920 

cgtttttttt gggaatattc gaggtaaaag gagggcgagg ttttaagagg attacgtaga 7980 

gtaagaaata tttggggaga attttagtgt tcggattttt ttgaatataa gggaagatag 8040 

ttttttttta gttagttttt tagggttttt ttatttttta tagttgttta agggtagtag 8100 

gtttttttgg ataagggatt atgtgtgttt agtggggttt atagcgatta ttaggattta 8160 

gtttagggta tagaggtgtt ttgaggatcg ttagtggatt tgtattagtg gttttatatt 8220 

agtggttttg ttaggattag gttttgtttt atcgggatgg gaaatttggg tagatttggg 8280 

atttagggta gggaggttat agggtttagg tttgaatttt agtatagtat acggtagggt 8340 

tgagagtaaa atttagggtt atgttcggat ttttaggtcg gttattgttt ttttgatttt 8400 

agacgtttta tttgtcgaat ggggatattt gggaatagta tttattttac gaagttatta 8460 

tg^agacgaa agagttaatc gtttatacgg gtagtgtaga acgggcgttt ggtgagtgtt 8520 

tagggatgat ttttttcggt agttgtttta tagaggttaa tatcgttcgt atcgtagtta 8580 

ttgtttttaai gttcgtttgg agggaagaga gtaggttacg tttatttgat tttgatgaat 8640 

tagttggttt gggttatgtt ttttagggag aaaatttttg agtttatajga gttgtttata 8700 

gatattattg tttgtgtgta attgttgtag attattgagg taggttagag agtagatagg 8760 

tgttaagtat tagtgatatt ttgaggtt'St g^iracgaatt atagtggggt tttgttcggg 8820 
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ttagtagcgt ttagagttag ggtttttcgt tgtttgaggc gttaatatgt ttgtttgtaa 8880 

tgtgtttgtg tacgtgcgtg tatatgtgta tgtgggtaaa tacgtttgtg tacgtgtgtg 8940 

ttgttttttt ggttaggttc ggttgtttta tttatgtgtg tatttagttt tttattattg 9000 

ttattttcga ggtttagggt tagtattaga gtatttatgg ttgtttttta atcgtagttt 9060 

ttttcgttta gggtggtttt gggatatata tagcggtgga gggaagtgat tgttgttatt 9120 

ggattttaga atataaaagt taatattatt atttaatggt ttttttagtg tttttaatgg 9180 

tattattttt ttttttttga tattttaatt gggtattttt ttttatgatt tttggatatt 9240 

ttagttagag gattttgtgg ggaaagtgtc gggtatatag taggggttta tttttttaga 9300 

tatgttattt aaaatttggt ttatttgttt ttttacgtag ggtttagggg atgtcgaatt 9360 

ttagggttta gaaagagttt gggataaaaa gaaattttaa ggggacggtt ttgatttggg 9420 

ttgagtttat ttgtgttatt taattggagt tttaagtttt gaggtaggac gtttagatgt 9480 

tttagtgtag ggtttttttg attaatattt gtttttttgt atttattagt aattttattt 9540 

attttatttt taaagtatat ttggttttcg tatttaggag ttttgtattt gtagatttag 9600 

taatagtaga tggaaaatat ttagaaaata aattggacgg ttatgttttt attgaatatg 9660 

tgtagatttt gtttttgtta ttatttttta aagaatatag tattacgatt atttatgtag 9720 

tatttgtatt gtattatata ttataaataa tttagtaacg gtttaacgta tacgggagga 9780 

tgtgtatagt ttatacgtaa atattaggtt acgttatatt agagatttga gtatttatgg 9840 

attttggtat tttcggggat tttagaatta attttttatg gatattaagg gatgattgta 9900 

tatatttatt taggaaggtt ttttattgga ggaaaggttc ggtttaggat agatagggat 9960 

attatttttg gattattgtt tatttattta tttatttatt ggtattattt tttaggattg 10020 

ttttaatgat agttttagta agtggagata agaaaaaaga ttggtttaaa tggtatagtt 10080 

ttgaggtttt ggaagatgtt gtattagtat gagaataggg ggttagtttt ttttaggatt 10140 

tagaaagtat attaggtagg ttcggggaga ggaaaggaat acggtttttt tagtagttat 10200 

ataggtttgt agtaggatgg ggttggggtt ggggttggga ttggggttgg tttcgtttat 10260 

tatttgggtt ttatgagggg aaaagaaaga atagggggta gaggaggagt atgggggtag 10320 

cgggttgttt aaggagaagg cgttttaggg aaggggtatt agtttgtttt tatattgtta 10380 

taaagaaata tttgagttta ggtaatttat aaaggaaaga ggtttaatcg atttttagtt 10440 

tagggaattt atagttatgg tagaaggcga aggggaagta ggtatttttt 10490 

<210> 292 
<211> 6416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 292 

ttttcgattt atttgtgatt gagggtaaga atgttttttt tttagagatt cggtgttttt 60 

ttttgtaaaa tgaggtttat tttgtttttt tttaagggtt ttagggaaat tttagttaga 120 

gtagtgaggt gttaggtaaa ttttgagggg tattttaatc gtcgtgtgag gttaggaggt 180 

ttgaattttt ttattttttt tttttaggat aagggtatat aattggtttc gttaagtttt 240 

ttttttgttt agacgttatg gagttggatt tgtttttatt ttattttagt agtttttcgg 300 

aagatttttg tttagttttt gggatttttt ttgggatttt tcggtttttt gatatttttt 360 

tgtttgagga ggtaaagagg ttttagtttt tttttatttt aattatcggt aggtacgatg 420 

gggcgtgggg tttgggggag gttagtgttg gataatatat agaggtttgt aggttattgt 480 

tttttttttt atattttttt tttttttttt ttttgttata ggaaatttcg agaggaggag 540 

aggcgtgtta tttttttttt ttttattttt aatttttttt ttgagttttg tagttttttt 600 

ttatagagtt taattttcgg gggttttttt agtgtaaggg ggttgttttt tcgcgatgtt 660 

agtcgttttt atgtgagttg ttttttagaa gggaagggag ggatgtacgg gttttgggtt 720 

tgtgggagat atatagtcgt ttttatggag gtaggggatt ttggttagga gtttttgagg 780 

gtttagtagg tgcggaaagg gaatgaatta ttttttgttt ttttttaagt cgtgtgtaat 840 

ttttggggcg taggtagtaa aggtgtatag tgaggatggg gtttgtaggt ttgtggaggt 900 

ggtagtaggt gttatagttc gttacgtgtg tgaaatgttg gtgtagcgag tttacgtttt 960 

gagcgacgag atttgggggt tggtggagtg ttatttttat ttagtattgg gtaagttagg 1020 

tgtatggaat tattcgggtt gggagatgat gttttgtatt ttgggttagg tatgtggttt 1080 

atttaggaga tttggttgat ttttaataat ttttgttttt tgtttttgtt tttagagcgg 1140 

ggtttggagg attacgagtt cgtggtggaa gtgtaggttg tttggttcgt gggcggagat 1200 

agtcgtttcg tttttcggaa aaatttcgtt aagtacgaat tgtttaagag ttttttagtg 1260 

agtgtatgag ggttgtttgg gcgttgggat gttttgattt ttaatttgga tgttggagtt 1320 

ttgatttttg atatttgttt atttatagta ttttttgttt ttagaaaaaa tggtttttag 1380 

ttgtttcgat gtatatattg gtatatttta tgaagatttt atttaggtgg ggggattttt 1440 

tattttattg tagatttacg attttttagt attggttagt gtttttttat ttttaagttt 1500 

tgtgtttttt ttttatgttg tagaaagagt taaattatag ttttgtgtga ttggggatag 1560 

atattttttt tgtttgagta ttagtttttt tt attaaatg ggggcgagaa atgtatgtgg 1620 

fH£ a iI£I t:t t g t ^aaaatt tgagggtggg ttgggtacgg tggtttatgt ttataatttt 1680 

agtatLtttgg__gaggtt^ag 1740 
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ttagtttggg taatataatg agatttcgtt tttttaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

aggttaggaa tggtggtatg agtttgtagt tttaggtgtt tgggaggttg aggtgggagg 1860 

attatttgag ttcgggaggt cgaggttgta gtgagttgtg atcgtgttat cgtattagta 1920 

tgagattttg ttttaaaaag aaaaagaaaa agagaaatat tttggtggta gaggggaagg 1980 

gaggaggtta gatttgttat ttttttttgt ttttttttgg tttagaattt tttgaatgtt 2040 

ggtagttttt ttgagattta gggttttttg tagttgcggg gtttaggacg gaagttttgg 2100 

aaacgttttt tttgtttttt gcgtcgattt ggtttttatt attttattaa gggtattttt 2160 

aaggtaaggt tttgagggta ttagttttag tttttttagt ttttggtttt tttagaagtt 2220 

gttttttttt tgttggaatt tttgagtttt tttttttttg ggttttttag gttagttatt 2280 

tttagtttat tatttttttt tatatttttg tttagtttat ttgtttaggg aggtagtagg 2340 

agaaaagatg attttagttt aagttttggt tttattttta tttgttgtgt gattttgggt 2400 

atttttttgt ttttttttgg ttttgaaatt tttttatttg gtttgtgggg ggaggtaata 2460 

gtgggcggga tttatttgta ttaagttttg tttatttatg ttgtttagga ttcgaggtat 2520 

ttgtagtacg tggtagatgt gaacgagttt aacgtgtacg tggtgacgta gggtcgtaag 2580 

ttttacggga tgtttattga tttcggtttt tgtgttaagg tgaagatttg gttaggtttg 2640 

gtttttggtt tggggaagta ggaattgttt aggtttttgg attttgttcg gggtttttga 2700 

gttttaattt agatatttag attttttttt ttgtattttg gtttgttgga aatttttgga 2760 

tttagttttg ttatgtggag taggggtaga tatgtggttt taaaggtaga tatgggatta 2820 

gttaattttt tttttttgta gtttaataag tttcgaaatg gttataaggg gtttcggatt 2880 

ttttgtagtg aagatgagta gagtcgtatt tgttggttgg ttgtttttcg tttttttaag 2940 

gtgagatttt gggagtggta tggggggttg gtttggttag agggattttt agttttgttt 3000 

ttaggaagtt ttaggaatga ggagggtatt atagttttgt tttttgataa tttttagttt 3060 

aagtttgaag ttataggaag tgtttatcga aggtagaaat atagttttgg tttgggtaag 3120 

ttttggtttg aggggggcgt tatagttacg tttttaggat ttttcgattt taagtttttt 3180 

tttttttttt atttttagta cggggtgtag ttgtataaga attattagta ggtatagttt 3240 

cgttatttgt atttattttg tttgggtttt ttatttttgg tgagtgtgtt taaggggatg 3300 

ggagggtggg tatgtaggtt ttgttttacg ggtatttggg ttttgttttg attttttttt 3360 

ttttttttta tttttagaga agtgttttag ataatatttt ggtggttatg gatttttttg 3420 

gttatgttgg gcgtgttatt gagaattttc gggaggtttt gagtgtggtt ttggaggagg 3480 

tttaggtttg gagggtgagg tttgttgtgt gtgtgtgtgt ttgtgttggg gatttatttt 3540 

ttgggtggga tttttgaaat aggagggagg aagagagggc ggggggaggt ttttggttgg 3600 

gaagaagtgt tttatttttt tgaggtgttg ggtaatgttt ttaagtacgt ttcgattttt 3660 

tcgtattttt tattgtttta tagaagaaga taaattatcg ttttagtttg tttatgttag 3720 

ttttcggtac gagttttagt gtaggtgggt gacggtttcg agttttgggg cgggggttgt 3780 

tttaattttt ttttgtattt ttagtgggga gtagatggta taggggtttt ttttttaaag 3840 

tgatcgttta tgtttttttt tattatagtt atttatcgta tttaattttg gttttacggg 3900 

cgtatttttc gtgaggagag ttagcggttt attggatagt agggtttggt agacgggtaa 3960 

ggggtagggt cgggtaatag atttagggat aagagagatt ggggtttagg tggtagttat 4020 

ggtttttggg tagtaatgtt gttttatttt ttgttttttg gtagtttgtt tttggttcgg 4080 

gagagttagc ggaattttta gggttttgtt ttttttttgt gttatttgta gaaagtgaag 4140 

tattatttta ttttgtcggt gagttttttt gcgttttcgg agttttgtaa tgagatatag 4200 

gatttttagt aatttgtttt ttttattagg tttttttaga ggttttttgg tttttagtgc 4260 

gttttttttt tttttgtata agaagtggga ggttgagtgc ggtggtttat atttgtaatt 4320 

ttagtattta ggacggttaa ggtgggagaa tggtttgagt ttaggagttc gagattagtt 4380 

tgggtaatat agggagattt tatttttata aataatttaa aaatgagtta ggtatggtgg 4440 

tgtatatttg tagtttagtt atttaggagg ttgaggtggg agtattgttt gagtttagag 4500 

ggttaaggtt gtagtgagtt atggtggtat tattatattt tagtttggat gatatagtga 4560 

gaaaattgtt ttaaaaaaaa aaaaagaaag aaaaagaaaa agaaaaaaga aaagaataaa 4620 

aggaaatggt ggggttttgg tttaggaggg agttttttaa gtttcgggtt tttttttgaa 4680 

tttttatttt ttttattgta ttttagagcg aggaggaggg tcgtttgtat tttagtatgg 4740 

atgatggtta gattcgtttt attgatttgt tgtagttcgt ggagttttat tagttgaatc 4800 

gcggtatttt gtcgtgtttg ttgcgttatt gttgtacgcg ggtggttttt tgattaggtc 4860 

gtggattggt ttatgtttta gttcgttttt aggttgttcg tcgttttttt atttatttag 4920 

tggattttgg ggcgcggtta taggggacgg gatgaggagc gggagggttt cgttatttta 4980 

gttttttttt ttgttttttt gtttttttta gatagaaaat agtttttatt ttagtttatt 5040 

tttgattttt ttttttaagg gaaggttttg ggtggttttt tttttttttt ttagttttgg 5100 

aggtgttgtt ttagggtagg gaattatggg agaagtgggg gtagtttagg cggttttacg 5160 

tattatattt tgtatagatc gagaggttag ttgatttgtt ttgttttata ttagtgataa 5220 

taaagattat tttttgatat atttatgagt tttgtttggt aaggtttggt tggttgaatt 5280 

aagaagggaa ttagagttgg acgtggtggt ttatgtttgt aattttagta ttttgggagg 5340 

ttaaggtagg agaattgttt gagtttagga gtttgagatt agtttgggta atatggtaag 5400 

attttgtttt tataaaaaat aaaaaaatga gtcgggtatg gtggtgtgta tttgtagttt 5460 

$agtttttta ggaggttgag gtgggaggat tttttgtttt ttgagtttgg gatgttaagg 5520 

ttgtagtgaa ttgagattgt gtcgttgtat ttagtttgag tgatagagtg agattttgtt 5580 

tggaaaaaaa aagaggggag atttggagag gtgggtattt gtggaggttt tggtggaagt 5640 

gttaaatgga tcgaggttat gttttagttt gt|ttggatgt tttttaaggg taggagtggt 5700 

tatttgagag t;ttttagtgt ttattatgta gtttgatata tagtaggcgt ttagttattg 57§p 

.ttaattaagt aagtgaatalg ataagagatt attattttag agagattttt tgatagttt^ 5820 
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agtttagaga ggtaattaat agggtttggg agttggagat gagttcgata gtatgtttgt 5880 

ttttcgttag ttttggatta gttgacggat agttggtttt agtatgtata tagttatgta 5940 

tttagtttta ttattaagat atagaatggt tttattattt ttaataagtt tttgtgtttt 6000 

ttattattag ttatgttttt ttttatttaa attgaaattt tttttttgtt ttagttttgt 6060 

tttttttaga aagttatata aatggagttt gttagtattt agttttttgt atttggtttt 6120 

tatttttaat atatttttta gtatttagag tgattttttt ttaaatataa atttattatt 6180 

ttcggtaggg tattgtggtt tatatttgta attttagtat tttgggaggt cgaggcgggc 6240 

ggattacgag gttaggagtt tgagattagt ttggttaata tggtaaaatt ttatttttat 6300 

taaaaatata aaaattagtt gggtatggtg gcgggtattt gtaattttag ttattaggga 6360 

ggttgaggta ggagaatagt ttgaatcggg gaggtagagg ttgtagtgag ttaaga 6416 

<210> 293 
<211> 6416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 293 

ttttggttta ttgtaatttt tgttttttcg gtttaagtta tttttttgtt ttagtttttt 60 

tagtagttgg gattataggt gttcgttatt atgtttagtt aatttttgta tttttagtag 120 

agatggggtt ttgttatgtt ggttaggttg gttttaaatt tttgatttcg tgattcgttc 180 

gtttcggttt tttaaagtgt tgggattata ggtgtgagtt atagtgtttt gtcgggagtg 240 

gtgaatttat atttgagaaa agattatttt gggtgttgaa ggatgtgttg gagatggaag 300 

ttagatgtag agggttgaat gttagtaggt tttatttata tggttttttg gaaggggtag 360 

aattaggata gaaaggaggt tttagtttgg gtaggaggga gtatggttga tagtaggggg 420 

tatagggatt tgttgggggt gatgaggtta ttttgtgttt tggtggtggg gttggatgta 480 

taattgtgtg tatgttgggg ttaattattc gttagttagt ttaggattag cggaggataa 540 

gtatgttgtc ggatttattt ttaattttta ggttttgtta attatttttt tagatttaga 600 

ttgttagaaa atttttttgg gatgatggtt ttttgtttat ttatttattt aattagtaat 660 

gattgggcgt ttattatgtg ttaagttgta tggtgggtat tggagatttt tagataatta 720 

tttttgtttt tgaggaatat ttaggtaggt tgagatatgg tttcgattta tttgatattt 780 

ttattaaggt ttttataggt gtttattttt ttaggttttt tttttttttt tttttagata 840 

gggttttatt ttgttattta ggttgagtgt agcggtataa ttttagttta ttgtaatttt 900 

gatattttag gtttaaggaa taagggattt ttttatttta gttttttgag aagttgggat 960 

tataggtgta tattattatg ttcggtttat ttttttattt tttgtaggga tagggttttg 1020 

ttatgttgtt taggttggtt ttaaattttt ggatttaagt aattttttta ttttggtttt 1080 

ttaaagtgtt gggattatag gtatgagtta ttacgtttag ttttggtttt ttttttgatt 1140 

tagttagtta ggttttgtta gatagaattt ataggtgtat taaaaaataa tttttattgt 1200 

tattagtata aaatagagta gattaattgg tttttcggtt tgtataaagt gtggggcgtg 1260 

aaatcgtttg ggttgttttt atttttttta taattttttg ttttagagta gtatttttag 1320 

agttaggaga aggagagggg gttatttaag gttttttttt gaggagaggg gttaggagtg 1380 

gattggagtg ggggttgttt tttatttgag ggaggtaaag aagtagagga gaaaattgga 1440 

gtggcggaat tttttcgttt tttatttcgt tttttgtggt cgcgttttag agtttattgg 1500 

atgggtggag gggcggcggg tagtttgaag gcgggttgag gtatgagtta gtttacggtt 1560 

tggttagagg gttattcgcg tgtagtaatg gcgtagtaag tacggtagga tgtcgcggtt 1620 

tagttggtgg aattttacga gttgtagtag gttagtgaag cgggtttggt tattatttat 1680 

gttgaagtat aggcggtttt ttttttcgtt ttggggtata gtaaaggggg tggaagttta 1740 

gggaggggtt cgaggtttgg gaagtttttt tttgggttag ggttttatta ttttttttta 1800 

tttttttttt tttttttttt tttttttttt tttttttttt ttttgagata gtttttttat 1860 

tgtgttattt aggttggagt gtggtggtgt tattatggtt tattgtagtt ttgatttttt 1920 

gggtttaagt aatgttttta ttttagtttt ttgagtagtt ggattatagg tgtgtattat 1980 

tatgtttggt ttatttttaa attatttgta gagatggggt ttttttgtgt tgtttaggtt 2040 

ggtttcgaat ttttgggttt aagttatttt tttattttgg tcgttttaag tgttgggatt 2100 

ataggtgtga gttatcgtat ttagtttttt attttttgtg tagggaaaag ggagacgtat 2160 

tgggggttag ggagtttttg gaggggtttg gtgaggagga taggttgttg ggagttttgt 2220 

gttttattgt aggatttcgg ggacgtaggg aagtttatcg gtaggatgag ataatgtttt 2280 

attttttgta ggtggtataa agagaggata aagttttggg ggtttcgttg attttttcgg 2340 

attaggaata ggttgttaga agataggaga tggggtagta ttattattta aggattatgg 2400 

ttgttatttg gattttagtt ttttttattt ttgggtttgt tgttcggttt tgttttttat 2460 

tcgtttatta agttttgttg tttaataagt cgttggtttt ttttacggga aatgcgttcg 2520 

tggaattaga gttgggtgcg gtggatggtt gtggtgggga aggatatggg cggttatttt 2580 

ggggagggga tttttatatt atttattttt tattgggaat ataaagagaa gttgaggtag 2640 

ttttcgtttt aggattcggg gtcgttattt atttgtattg aggttcgtgt cggaggttgg 2700 

tatgggtagg ttgaggcggt ggtttgtttt tttttgtgga gtagtgggag atacgggaga 2760 

__^tcggggcgt_s_tttggggg^ 2820- 
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ttaggggttt tttttcgttt tttttttttt 
tgggttttta gtataaatat atatatatat 
ttttagggtt atatttagag tttttcgggg 
gaagtttatg gttattaggg tattatttga 
agaggttaga atagggttta ggtattcgta 
tttttgggta tatttattaa gggtggggag 
tgtgtttgtt ggtaattttt gtatagttgt 
agtttgaggt cgagaggttt tgggggcgtg 
tttagattag ggttgtgttt ttgttttcga 
tgggggttat tagagagtaa ggttgtgatg 
agagttgggg atttttttgg ttaggttagt 
aagaggcgga aggtagttag ttagtaggtg 
cgaagttttt tgtggttatt tcgaagtttg 
tttatatttg tttttaggat tatatgtttg 
ggagttttta gtaggttagg gtgtagggag 
aagtttcggg taggatttag gggtttgagt 
tttggttagg tttttatttt gatatagaaa 
cggttttgcg ttattacgta tacgttggat 
ttcggatttt aagtagtatg agtgagtagg 
attttttttt ataagttagg tggggagatt 
agggttatat agtaaataga agtagagtta 
ttattttttt gggtaggtga gttaggtaag 
gttggtttgg ggggtzttagg gggagaaggg 
tttaaaagga ttagggattg gaggggttgg 
gtgtttttgg tggagtaata gaggttagat 
agttttcgtt ttgaatttcg tagttgtaga 
tttaggaagt tttgagttag aggagagtag 
tttttttatt attaggatgt tttttttttt 
agtgcggtgg tacgattata gtttattgta 
ttattttagt tttttaagta tttgggatta 
tttttttttt tttttttttt tggagagacg 
aaatttttaa atttttaggt ttaagtaatt 
ttataggtat gagttatcgt gtttagttta 
atgtattttt cgtttttatt taatagaaga 
ttttagttat ataggattgt gatttggttt 
ttaagggtgg agaagtattg gttaatgttg 
gtttttttat ttggatgagg tttttatggg 
agattatttt ttttgggaat agggagtgtt 
ttagtattta ggttgaggat tagggtattt 
ggggagtttt tgaatagttc gtatttggcg 
tcgtttacgg gttaggtagt ttgtattttt 
tttggggata agggtaaaaa gtaggggtta 
tatatgttta gtttaagatg taaggtatta 
atttatttag tgttaggtgg gggtggtatt 
cgtgagttcg ttgtattagt attttatata 
-ttatagattt gtaggtttta tttttattgt 
tatacgattt gaggggaggt aaagggtgat 
agggattttt aattaagatt ttttgttttt 
tagaattcgt gtattttttt tttttttttt 
tcgcggggga gtagtttttt tgtattggag 
ggattgtaga gtttagggaa ggggttgggg 
ttttttcgaa gttttttgtg gtaagaagaa 
tggtttgtaa gtttttgtgt attatttagt 
gtatttgtcg gtggttggga tgaggagagg 
ggtattaggg ggtcggggag ttttaggagg 
agagttgtta agatgaggtg gagatagatt 
tttaacggaa ttagttgtgt gtttttgttt 
tttgatttta tacggcggtt agggtgtttt 
attggagttt ttttgaagtt tttggggaag 
atatcgagtt tttgagaaga aagtattttt 

■ » ■ • 
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ttttttattt tagggatttt atttagagag 2880 

agtaggtttt attttttagg tttgggtttt 2940 

gtttttaatg atacgtttag tatggttaga 3000 

ggtatttttt tggagatgga aggagaggag 3060 

agatagggtt tgtatattta tttttttatt 3120 

tttaaataag atggatgtag atggcgagat 3180 

atttcgtatt gggggtgggg agagaaagag 3240 

gttgtgacgt tttttttaga ttaggatttg 3300 

tgggtatttt ttataatttt aggtttggat 3360 

ttttttttat ttttgagatt ttttgagggt 3420 

tttttatgtt atttttaggg ttttattttg 3480 

cggttttgtt tatttttatt gtagaagatt 3540 

ttgggttgta gagagaggaa attgattggt 3600 

tttttgtttt atatagtaga gttgaattta 3660 

aggagtttag gtgtttgaat tggggtttag 3720 

agtttttgtt tttttaggtt aggggttagg 3780 

tcgaagttag tgggtatttc gtagagtttg 3840 

tcgtttatat ttgttacgta ttgtaggtgt 3900 

atttggtata ggtagatttc gtttattatt 3960 

ttaggattag agaggggtag gggaatattt 4020 

ggatttaaat taagattatt tttttttttg 4080 

gatgtaggga gagatggtaa attggaggtg 4140 

tttagaggtt ttagtagaga aggggtaatt 4200 

ggttggtatt tttaagattt tattttagag 4260 

cggcgtaaga agtagaaaaa gcgtttttaa 4320 

aagttttgga ttttaggaaa gttgttagta 4380 

aggagagtga taggtttgat tttttttttt 4440 

tttttttttt ttgagatagg gttttatgtt 4500 

atttcgattt ttcgagttta agtgattttt 4560 

taggtttatg ttattatttt tggttttttt 4620 

aggttttatt atgttgttta ggttggtttt 4680 

ttttcgtttt agttttttaa agtgttggga 4740 

tttttaggtt tttataagga aatgttttat 4800 

aattgatgtt taggtaggga aagtgattgt 4860 

tttttataat atagagggga ggtataggat 4920 

gggagtcgtg aatttgtagt gaaatggggg 4980 

atatattagt gtgtgtatcg agatagttgg 5040 

gtgggtagat aagtgttagg gattagggtt 5100 

tagcgtttag atagttttta tgtatttatt 5160 

aagttttttc ggaagacgaa gcggttattt 5220 

attacggatt cgtggttttt taaatttcgt 5280 

ttggagatta attagatttt ttggatgagt 5340 

ttttttagtt cggatagttt tatgtatttg 5400 

ttattagttt ttaggtttcg tcgtttaagg 5460 

cgtggcgagt tgtggtattt gttgttattt 5520 

atatttttat tatttgcgtt ttaggaattg 5580 

ttattttttt ttcgtatttg ttagattttt 5640 

atggaagcgg ttgtgtattt tttatagatt 5700 

gagggataat ttatatgggg gcggttggta 5760 

gggttttcga gaattgggtt ttgtgaggga 5820 

atagagggga gggaggtggt acgttttttt 5880 

aaagggagag aggtgtgggg aagggagtag 5940 

attgattttt tttaagtttt acgttttatc 6000 

ttgggatttt tttatttttt taggtagagg 6060 

ggttttaggg gttgggtaaa ggtttttcgg 6120 

tagttttatg gcgtttgagt aagagagggg 6180 

tggggagagg ggatgagaga atttagattt 6240 

ttagagttta tttagtattt tattgtttta 6300 

gatagaatga gttttatttt ataaaagggg 6360 

gtttttagtt ataggtgagt cgggag 6416 

• - . ■ ■■ 
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<400> 294 

ttttaatatt agaaggaata aattttgaat atattatttt taagaattgt aatatttatc 60 

gtgagggttt acggttttat ttttgaagtt agtgagatta agaatttatt aattttggat 120 

ataaggtgat aggttgaggg cggtggttcg gttttgggtt tttttggggt ttttttaggg 180 

aatgttttgg tatttgtcga ttgagttttg ggaggtagrt ttggtatata gttttttgat 240 

atgatttgtt ttttattttg gggtgttata tatgaaggga ggtgattgtt gtgatggtgt 300 

tggtaggatt gttgtttttg atgtggggtg ggttgagtta ggtttgaaat atgggttttt 360 

aggttgagtt ttgttttttt tattatattt agggttgatt gatattttta gttagtttat 420 

tttggttttt tttttatatg ttaggataat gtagtttttg ttattaattt gggtagttag 480 

agttgggtta gtgggggata tgggattatg ggtaagggta attgatattt gtttagtttt 540 

aacgtattcg ttttaaatgc ggttaggcgg tggggtaagt aggaatgagg taggggtggg 600 

gttgttttga ggaggatgat tttaacgagg gcgtgagtag gggatttaag ttggaattat 660 

tatattgttt tattgtatat tagagttttt ggttagggag taggttgggg attaggtatt 720 

ttattttagc ggggtatagt ataaagttcg tagggggatg gggttattag aaagttgacg 780 

atacgagagt ggttgggtcg gggttgttcg gcggttacgg agaagttgaa gtgttgtagt 840 

agggaggtga agaagaggaa gagttttatg cgggttaggg gtttttcgag gtatgtacgg 900 

cggtttgtgg ggaggggagg ggcgttagtg agtttggttt ttgggtgata tttttgtaag 960 

attttacgga aggggatagg gagtcgggtt ttttataggt atttgttgag aaaggtagga 1020 

aggttttcgg ttttataaag tggttttggg tatttaggaa gtgttcgggg tggaagcgga 1080 

agggtttttt ttagacggtt ttatttttta gtatcgatga taggttggtg atgagtgtcg 1140 

ttttttgggt aggagatgta gggtgagagt ggggattgga ttttaggatg ttgggatttt 1200 

tgttattaaa tatacggggg atatatattg tttggtatat agttggattt tgttaattag 1260 

ttttgcgttc gagaagtttt atagtatttt tttcgatttt atagtagggc gtagttatat 1320 

tttttagagg tatttatatt gttttttttt tttgtaggcg ttgggttttt taatattttg 1380 

gtaggttttg atttgttttt tttattagat tggggttttg gatggatagg ttagttttgt 1440 

ttatattttg gattttttat ttaagcgggg atagttagtg tggtggtatt gaggattagg 1500 

tggttagggt ttttagagtg ggtttatttg gtagtagtta tgttggggtt attattaggg 1560 

gttggtgttg agttggggtg aggagggcgt taggtttatt ttagggatgc ggaagttttg 1620 

tatttcgatg ttacgggatg ttatatgggt tatatttagg gggatgatgt ttttaaagcg 1680 

ttgtatttcg tgaattacgg tagtggtgta gggtatgtga gtttggttat ttatttttgg 1740 

tcgtcgtatt tgttttatta cgtcgtxgat tttttgttgg atacggattg gatagatatg 1800 

cgtttttata atgggttagt atttagggga tatttttttt ttttttgtgt tggaggaagt 1860 

taggtttata ggagtttrggt tacgtttgtg ttggaagttt cgggtgtttt agttaagttt 1920 

aggggttttt agttgtattt tttttttttt agtttttgtt ttgggtttta gttgggttta 1980 

cgttgtatat ttaggtgtag gattatgagt aggaggtttt aggttagcgt ggtcgaggtg 2040 

gttattattt cggtaaggaa taggttattt attattatgc gtaggttttt attattgaag 2100 

ttgtttttag ggtttttttt ggtttgagta gggtcgagag gatatttagg ggatagaacg 2160 

gggtagtttt taaatgattt ttaattttgt atttgttagt ttagatgcgg ttcgtcgggt 2220 

gatgtattgg tttaattttt tgtttagttt ttttttattt tttttgggac gtttaattta 2280 

ttatttttgt tttttatcgt ggtagttatt tttatttttt ttttttttgt taggaaggtt 2340 

ttagttaggt ttcggggtgg ttgggttggg ttttaggtta ttttgtgttt agttagtagt 2400 

ttatttagtt gggttaggaa agttttttgg aagcgtagga ttttgttagt tagcgttggg 2460 

atgtgcggga ggacggggat agtattnagt atttatatta gatagaacgg ggttttaatt 2520 

ttttttgtgt tttgcgttta tttggattag ttttaggttt tagttatttt taggaagatt 2580 

tagggtttgt ttgtttttat tattgatttt attaagtttt tttttaagtg ttagttttta 2640 

tttttttttt ttttgtttag aggagaaatt taaaatcgaa atttttaacg tggacggggg 2700 

tatagagttt ttggtttttt ttggtgtttt ttgattcggg tatatttttt ttacgattat 2760 

gtttgagatg tttttttttt ttttaggttt tttttatagt ggggtttttt ggaatgtttt 2820 

tttttaaatt tatttatgta aattttgttt ttcggaggtt ttagtttagt ttcggtattt 2880 

tttaggagtt cgttttgtag agatttttcg gtttttcgtt tcgtatttcg cgtaggaagt 2940 

tcgatttttt ttttagtttt ttttgagtta ggtttagtag tttgaggaag cgagggtcgt 3000 

cgtattcgaa gcggcgttcg taggtgaggg aggcgattac gttgtttacg gttttgttta 3060 

agaggtcgtt ggggcgaaag gggcgttttg ggggtgggag atgcgggtaa ggggttgttt 3120 

ttttcgtttt tcgttttttt agttttcgtt ttgtgttttt ttgtttatta tttatcggtt 3180 

tggtcggcga aggcggtata aaggtaggcg gttttttcgg ttatttattg ttttagcgat 3240 

tttttgttta ggtttaagtt gcgtaaggtg gatacggaga agcgtttttg ttcgcgttac 3300 

gcgggtttat agcgcgatag gattattttt gggggcggga cgggtacgtg ggcgttgtta 3360 

tgaaggtttt ggttttattt ttcgttattt attttaattt tggcgtttta taaggttttt 3420 

cgtagttttt agttcggttt agttgggtat agggtttatt ttttgtttat ttatattgtt 3480 

tttttgtttg gggcggggtt tggttttatt tcgtttttgt ttattttgat tattttttta 3540 

tttaaggaag atttcgttcg tttcgtttat attgagttcg tagtataggc gcggttttcg 3600 

ttatcgttat ttcgacgtat tagtttcgtt tatcgggttt ttggcgggtt tgggtagtag 3660 

tttcgttttt ttttagttta tagattcgta tttttttcgt gtaggtggtt ttttggttta 3720 

ttgtttttag tttattcgtt ggtttttatt tttgttttac gtttaggatt ttacgttttg 3780 

tcggcgttgt ttgggttacg gttattgttt attcggggtt tacggaaacg cggtttttgt 3840 

.__±£tttatcg±-.-cg:^ 3900 
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aggcgggcgg tcggtcgtgt tttcgtcgcg ggttattatc gtttcgcgta cggtcgttag 3960 

tttattgagt acgattatcg gcgtttaggt tagttgtagg ttgaatacgt tttcgaagcg 4020 

gcgtcgtaat tgtagaggga gggttagggt tttttgttaa gttaggatta ttttagatta 4080 

taggttttag ttttatttga attttggacg attttcgggg ttattaggag tgagtaggtg 4140 

gaaggaggag atttagtttt ttgattttgg ggcgggggtg ggggttatat tttttgtgat 4200 

ggaggaattt agtttggatg cgttatttag gtatgatttt gtaagagtta ttaaaattgt 4260 

cgagaggttt tagttagtat tttattttta gatgatggtt tatgtcggtg agtagtgagg 4320 

ttcgaggatt tatagtgtaa aaggtttgaa tcgggttatt gtattttttt tattttcgat 4380 

ttcgtgattt aaacggtatt taggattaat ttatttttta tttttaaggt tttttttttt 4440 

ggtgttagta gaagggattt tgtattttat aatatatgtt gtttaatggg tttgtatgtt 4500 

tattgttaag tttagtttta tttttaggtt tttgttttat ttttttttgg tttttggaaa 4560 

atttagtttt ttatgttatg tataaatgtt tttttttagg acgtttttta aatttgtttt 4620 

ttttttttag tttggttttt gatttagttt gtggtttaat ttattattta tgtttgttgg 4680 

tggtggggta tttttaggat ttttgtcgtt ttttaggatt ttttttttta tttggtcgaa 4740 

gtagtatggt gtgttttgga agtttatatg tagtaaggtt gtttagttcg ggtagtggta 4800 

ggggatttgg cgggtagcgt gtagtttagc gttggtgtcg gtgtattagg tttattagga 4860 

gtaggaagat ggttattatt atggttaggg gtattagtgt ttttagtttt atggttgttt 4920 

tattattaat tgggtttttt tggatatatt tggtattttt attttattag gtatagagga 4980 

ttaggtagga tattttcggt atatcgagcg cgtgattttt tttttataaa gggagttgat 5040 

gatggttttc gttttttgtt gtgagtgaat ttgttgtgtt gattgtgttg ttagtggtag 5100 

agttaggtta gggtaggtat gggttgtttt agaggttttt gtcgttgttt tttgttttag 5160 

gtttttattt agggtagggt ggtagaaagg tttggtcgga gaagttattt ttttttttta 5220 

ttttaagttt tttaagttta tataggtttt tgggataatt agggttttag tggattcggt 5280 

tatttatttt ttagttaggt ttatatattt taatgtagtt ataatttttt ttttagaata 5340 

tgattttgtt ttttttttat ttttatttgt ttattttaga gtgattttta gtatttttat 5400 

ttgttattgg tatttatttg gggttttaga gtttttgatg atgagtggta ttatgggttt 5460 

ggttttttta ttttattttg tatttttgat atgtatagac gttatgtata tatttgatgg 5520 

tgtatagatt ttttgtttat ttttagatat ttgtttattt gtttatattt gtagggatac 5580 

gattatatat gtagaaaatt atttatataa agataatatt tatatatata tagatttata 5640 

ttgatattta gggtatatat ttttttttat atatattagt tatatatata tatagattcg 5700 

gtattaagta ttttattttt tagttatgtt cgaggttttt tggatgggat tttttttgtt 5760 

tagaggttgt tttcggtgag ttttaaagtt gttatatgga ttttagttta gtttatattt 5820 

tgggttttgg tcgggttatg attttttgtt tgtaataggg ttgtttttag agtttttagt 5880 

tggtagtttg aaggtttttg ttttagtttg tgatagtatt ttttagggtt gtttgagggt 5940 

cgttattttt tattgttttt tggtttttat gtttttgatt agaaatttgg tggaaatatt 6000 

a 6001 

<210> 295 
<211> 6001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 295 

taatgttttt attagatttt taattagaaa tatggaggtt aggaagtagt ggagaatgac 60 

gatttttagg tagttttgga ggatgttgtt ataggttggg gtaagggttt ttaggttatt 120 

aattgggagt tttgggaata gttttgttgt aaataggaag ttatggttcg gttagagttt 180 

agaatgtggg ttgagttggg atttatgtga tagttttgag gtttatcggg agtagttttt 240 

ggataggaga ggttttattt aggaaatttc gggtatggtt gggaagtggg gtatttggtg 300 

tcgggtttgt atgtgtgtgt gattggtgtg tgtgagagag aatgtgtgtt ttgagtgtta 360 

gtgtgagttt gtgtatgtgt gaatattgtt tttgtgtggg tgattttttg tatgtgtaat 420 

cgtgtttttg taagtgtgaa taagtggata agtgtttggg agtggataag agatttgtgt 480 

attattaggt gtgtgtatag cgtttgtgta tgttaagagt gtaaggtgaa gtgaagggat 540 

taggtttatg atgttattta ttattaggag ttttaaggtt ttaggtaagt gttagtgata 600 

gataagggtg ttgaaggtta ttttggagtg ggtaggtggg ggtagggaaa gggtaaggtt 660 

atgttttgga ggaggggttg tgattatatt agggtgtatg agtttagttg ggaggtggat 720 

ggtcgggttt attgagattt tggttatttt agaagtttgt gtgggtttgg ggagtttgga 780 

gtggggagag ggggtgattt tttcgattag gtttttttat tattttattt tgggtaaggg 840 

tttggagtag gaagtagcgg taaggatttt tggagtagtt tatatttgtt ttggtttgat 900 

tttgttattg gtagtatagt taatatagta ggtttattta tagtagaggg cgaaggttat 960 

tattagtttt ttttataagg gaagggttac gcgttcggtg tgtcgagagt gttttgtttg 1020 

gttttttgtg tttggtgggjg tgggggtgtt aqgtgtgttt agaggagttt agttggtagt 1080 

g^ggtagtta tggggttaga agtattggtg ttttt'ggtta tgatagtggt tatttttttg 1140 

tttttggtgg atttgatgta tcggtattaa cgttgggttg tacgttattc gttaggftttt 1200 

ttgttaittgt tcgggttggg taattttgtt gtatgtggat ttttagaata tattatattg 1260 
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tttcgattag gtgagggagg aggttttgga gggcggtaga ggttttgagg atgttttatt 1320 

attagtaaat argggtggtg ggttaaatta taggttggat tagaagttag gttgagaagg 1380 

ggaagtaggt ttgggggacg ttttggggaa ggatatttat atatggtatg aaggattgga 1440 

ttttttaaag gttaaggaag agtagggtaa gggtttggag gtggagttgg atttggtagt 1500 

gggtatgtaa gtttattggg taatatatgt tatggagtat aaagtttttt ttgttgatat 1560 

tagaaggaaa ggttttggga atggaagatg agttagtttt gagtgtcgtt taaattacga 1620 

aatcgaggat gaagggggtg tagtgattcg gtttaaattt tttgtattgt gggttttcgg 1680 

gttttattgt ttatcggtat ggattattat ttgggaatgg gatgttaatt ggggtttttc 1740 

ggtaattttg gtgatttttg taaggttata tttgggtgac gtatttaaat tgagtttttt 1800 

tattatagaa ggtgtgattt ttattttcgt tttaggatta ggaggttggg tttttttttt 1860 

ttatttgttt atttttggta gtttcggggg tcgtttaagg tttaaatagg attaggattt 1920 

gtagtttggg gtgattttgg tttgataaga ggttttgatt ttttttttgt agttgcggcg 1980 

tcgtttcggg gacgtgttta gtttgtagtt ggtttggacg tcggtggtcg tgtttaatgg 2040 

gttggcggtc gtgcgcgagg cgatggtgat tcgcggcgag gatacggtcg atcgttcgtt 2100 

tgcgtttatt tattaggttt tgggtttcgg gtcgcgtttt taaggtaagc ggcggtgggg 2160 

gatagagatc gcgttttcgt gggtttcggg tggatagtga tcgtagttta agtagcgtcg 2220 

atagggcgtg gggttttgga cgtgaaatag agataaaggt tagcgagtgg gttgaggata 2280 

gtgggttagg aaattatttg tacgggggag gtgcgagttt gtgggttggg agggggcggg 2340 

gttattgttt agattcgtta gaagttcggt gggcgaggtt gatgcgtcga agtggcggtg 2400 

gcggggatcg cgtttatgtt gcgggtttag tgtgggcggg acgggcggga ttttttttga 2460 

gtggaaaggt ggttagggtg ggtagagacg aggtggggtt aaatttcgtt ttaggtaggg 2520 

gagtaatgtg ggtgagtaaa gagtgggttt tgtgtttagt tggatcgggt tagggattgc 2580 

gggagatttt gtggagcgtt agggttggag tgggtggcgg agggtggggt taaggttttt 2640 

atggtaacgt ttacgtgttc gtttcgtttt taggggtgat tttgtcgcgt tatgggttcg 2700 

cgtggcgcga gtagaggcgt tttttcgtgt ttattttgcg taatttgggt ttgggtaaga 2760 

agtcgttgga gtagtgggtg atcgaggagg tcgtttgttt ttgtgtxgtt ttcgtcgatt 2820 

aagtcggtgg gtgatgggta gaagggtata aagcgggaat tgggaaggcg ggggacggag 2880 

aaggtaattt tttattcgta ttttttattt ttaggacgtt tttttcgttt taacggtttt 2940 

ttggataaag tcgtgagtaa cgtgatcgtt ttttttattt gcgggcgtcg tttcgagtac 3000 

gacgattttc gtttttttag gttgttggat ttagtttagg agggattgaa ggaggagtcg 3060 

ggttttttgc gcgaggtgcg gagcgagaga tcgaggagtt tttgtagggc gagtttttga 3120 

gaggtgtcgg ggttggattg gggttttcga agggtaggat ttgtatagat gggtttggga 3180 

aaggatattt taggagattt tattgtaaga agggtttgga ggaggagggg atattttaga 3240 

tatggtcgtg ggagaggtgt gttcgggtta gggggtatta ggagaggtta aggattttgt 3300 

attttcgttt acgttggaga tttcgatttt aggttttttt tttgggtaag gagagagagg 3360 

gtggaggttg gtatttgggg agggatttgg tgaggttagt ggtaaggata ggtaggtttt 3420 

y »a u- yy«ya«-yv,«-^ aaya 1 -^ <-y u y dn.yyi.n.ay y Lyaacy uciy ay i_a Lay u ay 3*froU 

ggattgagat ttcgttttgt ttggtgtagg tgttgaatgt tgttttcgtt ttttcgtata 3540 

ttttagcgtt ggttggtaag gttttacgtt tttaaaaggt ttttttgatt tagttggatg 3600 

agttgttaat tgagtatagg atgatttggg atttagttta gttatttcga gatttgattg 3660 

aggttttttt ggtaaagaag gagaaggtga gagtggttgt tacggtgggg ggtaagggtg 3720 

gtgggttgaa cgttttagga ggaatgaggg gaggttgggt aaaaggttgg attagtgtat 3780 

tattcggcga gtcgtatttg ggttgatagg tgtagaattg gaggttattt gggggttatt 3840 

tcgttttatt ttttgagtat tttttcggtt ttgtttaggt taaggggagt tttgagagta 3900 

gttttaatga tgagaatttg cgtatagtgg tgggtaattt gttttttgtc gggatggtga 3960 

ttatttcgat tacgttggtt tggggttttt tgtttatgat tttatatttg gatgtgtagc 4020 

gtgagtttag ttggggttta aggtagggat tgagggagga agggtatagt tgggggtttt 4080 

tgggtttagt tgggatattc ggggttttta gtataggcgt ggttaggttt ttgtaagttt 4140 

aatttttttt aatataggag gaaggagagt gttttttggg tgttgattta ttgtggggac 4200 

gtatgtttgt ttagttcgtg tttaatagga gatcgacgac gtgatagggt aggtgcggcg 4260 

attagagatg ggtgattagg tttatatgtt ttgtattatt gtcgtgattt acgaggtgta 4320 

gcgttttggg gatattattt ttttgagtgt gatttatatg atatttcgtg atatcgaagt 4380 

atagggtttt cgtattttta aggtaggttt ggcgtttttt ttattttagt ttagtattag 4440 

tttttggtga tagttttagt atggttattg ttaggtgggt ttattttagg aattttggtt 4500 

atttagtttt taatgttatt atattgattg ttttcgtttg gatggggggt ttagagtata 4560 

ggtagggttg gtttgtttat ttagagtttt agtttagtgg ggaagataaa ttaggatttg 4620 

ttagaatgtt ggaggattta gcgtttgtag ggagaggggg tagtgtgggt gtttttgaga 4680 

ggtgtgattg cgttttgttg tggggtcgga gagggtattg tggagttttt cgggcgtagg 4740 

attagttgat agagtttagt tgtgtgttag gtagtgtgtg tttttcgtgt gtttggtggt 4800 

aggggtttta gtattttaga gtttagtttt tatttttatt ttgtattttt tgtttaggga 4860 

acgatattta ttattaattt gttatcggtg ttgaaggatg aggtcgtttg gaagaagttt 4920 

tttcgttttt atttcgaata ttttttggat gtttagggtt attttgtgaa gtcggaggtt 4980 

tttttgtttt ttttagtagg tgtttgtggg gagttcggtt ttttgttttt tttcgtggag 5040 

ttttgtaggg gtattattta ggagttaggt ttattgacgt tttttttttt tttataggtc 5100 

gtcgtgtatg tttcggggag tttttggttc gtatggagtt tttttttttt tttatttttt 5160 

tgttgtagta ttttagtttt ttcgtggtcg tcggatagtt tcggtttagt tattttcgtg 5220 

tcgttagttt tttggtgatt ttattttttt acgagttttg tgttgtgttt cgttagaatg 5280 

gg3$atttag_ tttttagt.^ 5340- 
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ggtagtttta atttgggttt tttgtttacg ttttcgttgg gattattttt tttagggtaa 5400 

ttttattttt gttttatttt tgtttatttt atcgtttggt cgtatttgag acgggtacgt 5460 

tgaggttgag tagatgttag ttatttttgt ttataatttt atgtttttta ttgatttaat 5520 

tttgattgtt tagattggtg ataaggatta tattgttttg gtatgtgggg aaggggttag 5580 

aatgggttga ttagaggtgt tagttagttt tggatgtggt ggagagggta ggatttagtt 5640 

tggaggttta tattttaggt ttaatttagt ttattttata ttagggatag tagttttgtt 5700 

agtattatta taatagttat ttttttttat atatgatatt ttaaaatgga agataaatta 5760 

tgttagggag ttatatgtta gggttatttt ttagggttta gtcggtaggt gttagaatat 5820 

tttttgggaa ggttttagga aaatttagga tcgagttatc gtttttagtt tgttattttg 5880 

tgtttaaaat tggtgggttt ttggttttat tgattttaag aatgaagtcg tggattttta 5940 

cggtgagtgt tatagttttt aaagatggtg tgtttagagt ttgttttttt tgatgttaag 6000 

a 6001 

<210> 296 
<211> 6456 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 296 

taagaatttt aagttattat ttttgtgtat gtaggtagta ttagtgttag taataataag 60 

ttagtttttt tggtaaattt attaaaatta aattttaaat ttttgaaaaa attaatatgt 120 

ttattatgtt tttgttagtt tagaggttta attatttgtg ggggagggtt tataattatg 180 

attaattatt tatgatttga aggtatttat aaaagtattt gtttttttaa gaaattatta 240 

aagaattagg tttaagattg taaaattaat tttaggggta atagtaaatt ataggtttgg 300 

ggatttaatt ttgataattt ttttttggaa agaaaagaat tgttgttaat tgtttttatt 360 

ttttgatttt gtttgagtaa atattcgtag tttttaaaat ttagaatttt tagtttggtt 420 

aagtaagtta tggggattaa tggttgtttt aaggtagagg tagaagagaa gaaataggaa 480 

ggaaataaaa agggaatgaa gatttgtagt ttatgagggt gatattatta ataatttttt 540 

taagtattat tgaattttat ggttcgtgat tgagagtttt ttttgatatt ttataggtta 600 

attattttta aggatttttt tggttttaga gtttgtaatg tttgggtttg aatatgattt 660 

gttttataaa aaaggttagg tgaattgatt tttttttgtg aagttgtttt atggtaagag 720 

attattaatt tgttgaaacg taaataattt tttagatatt tgaagtattt ttaaatttat 780 

aaagtgtttt tatatacgtt attttatttt atttttacga ttttaatagg taggtgttat 840 

tattatttta atacgtaaaa ataattttag aaaataatgt ttttgttttt agtttataag 900 

aatatttttt tgaaatttat agtatagaat ttaagttatg ggttagttag aaacgtttag 960 

aaatatatta attttaggag gataaattaa gagaatagga tggattagat aaaatttata 1020 

agtataatgg tagaaagtgt gtttttggta taagaatggc gtaagttttg taaataatag 1080 

tgttatatag gggtgttttt gtagtatttt tacgtttgga aggatattat ttatttatat 1140 

atatgataag tatttttaat tggatatttt ataggtaata gataaaataa aataagaatt 1200 

ttcgttttaa aattgtagtg gtggaattta tttttcggga tgtattaggt gattttttta 1260 

ggacgatagt tgagttatag ggggttaaat atattttttt tttttttttt taatattata 1320 

aagagtttta aaaacggatt ttgttgtttt taataaataa aatatatatg aaaaaaagtt 1380 

atttataggt ttttatatat aattttattt tttaaaatat ttattttttt ttattagtta 1440 

ggtcgaggta taatgcggtt ttaattttat aattatttat tgttttattt atatatttat 1500 

gattttttga tgttttttag tttgagttta aattgcgttg ggaataaagt atcgtataaa 1560 

ttattagttt gggataagag gagataagtt aaaatttttt aagaagattt tgtagatatt 1620 

tggttttttt gagtttgtag agagtttggt attttaaagg ttagttttgg gtttttttat 1680 

tttgatgatt aatgttaaat tgtttgattt tttataaaaa tatttagttg tgtttattta 1740 

tattatttta tttttagcgt atatagtagg tatgtatttg ttgaatagaa gttttaaatg 1800 

gatgagaaaa aagatatttt tgttatatat ttttttaaga tcgagaaagg taaaattata 1860 

atttaatttg tagaagtttt taggtatttt tttttaatgt aaagttatgt atttttgttt 1920 

aaaaaagaaa aggatgatat aattaaataa tgttaattat ataatgggga aattttagga 1980 

ttattaagaa aatattatta ttttaagtta tatattttta tttaatattg gttattatat 2040 

aggagagagt aagaattgtt gtttttttat ttattcgtat aatttatttg ggaaaagtat 2100 

gggtagaata tttatgattt aattatatta tattttggat ttatatttaa ttttttggag 2160 

aattagttta aagagtatgt tcgtttagag atataaagta aaatagatat agggataata 2220 

gttttggaag aaatttttat gacgttgttt gagattgggt attagtattt aaatttttta 2280 

gattattagg agttattttt attacgtaat tttaataaag agaaattgag agtatgattg 2340 

gtaaaaatat tatgatgttt ttgtattttg ttaatttgat ttatatttgg ttttatttat 2400 

aaagttttta ttgattatta tacggtattt ttatttttta gttttattta ttttgttttt 2460 

taaaaatttt ttaacggtat ggttacgaaa aatatgcggg ggttagcggg gtatgtttta 2520 

ggttggttta ttttgttttt aaaacgtatt tatttttttt ag^ttttttt ttagggtggg 2580 

tagggagaaa acgtttttaa gttgtatagt tgtagggaag tttgtttagg ttatgttttt 2640 

ttgatttgtt tttcggttcg ttgtttggta aggttatcgg ttttggttcg cggttttgtc 2700 
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gataaaattt 
ttttgtttcg 
ggttttagat 
gtttttaggt 
tcggtttgta 
gttttgaata 
ttagcgcggt 

tggggcgggc 

tttcggagat 
ttttcgtcgt 
gtcgtcgtcg 
tcgatttttt 
tttgtttaag 
gaagaaagtg 
ttttttttta 
ttaatttttt 
agtcgttttc 
gtagtggaac 
ggaggcgtcg 
tcgttatttt 
ggagggcgtt 

gggtcgtaga 
agaaggggtg 
ttacgaggag 
tttcggttgg 
gaagattagt 
tttgggggag 
gttataattt 
cgtttcgttt 
agggtggggg 
ttcggggttc 
gaggtggttg 
ttcggagttt 
tagtagttat 
atttttatta 
gtaatcgttg 
gagcgggtgg 
ttttttaaaa 
ggttatatag 
ttaatttttt 
ttttattttc 
tttttttatt 
tttttttttt 
atgtgatttg 
ttcgcgatta 
atatttaatg 
tattagtata 
agatgcgttt 
gcgtgtgttt 
tgcgttttcg 
aatacgttcg 
atttatattt 



aatttttatt 
tatttttttg 
ggcgagggcg 
gttttagtat 
ttcgtacgcg 
tttttttata 
ataggtcgtt 

ggggggtgcg 

cggtttagtt 
tgtagaagtt 
cgaaatttcg 
tttttttttt 
cgcgttttag 
taattagata 
cgaagttttt 
ttggtttttt 
gcgcggagag 
gggagggggc 
ggtacgttaa 
tttcgttttt 
taggggtcgc 
atgtgggtgg 
gttagatatt 
atatttaaaa 
aaacggaatt 
tatttggtcg 
gggagatatc 
cgtttgtcgt 
ttaagtttta 
gtggggggtt 

ggattcgcga 
ttttattttt 
tttgtataat 
ttgggattta 
cgacggtcgt 
gttcgttcgt 
agcggatcgc 
gtcgaggaaa 
gtagtcgggg 
ttcgttcgag 
gtagggaatt 
cgttttttag 
ttttttttta 
attttatttt 
taaaggttgt 
aaatttttta 
gcgataagag 
atatgtatac 
aattttaata 



agtatggagt 
ttcgagtaga 
gtgtgcgtac 
tttttttttt 
atttcgtttt 
ttatttttcg 
ttttagattt 
tatttgaggg 
tcgtacgtgt 
atttttttcg 
gttcgcgtcg 



cggggtttta 

taggaagttt 
tagcgagttg 
ttttagaggt 
gttttagttt 
ggatacgtgg 
tattagataa 
agttcgtaag 
ttatttttta 
ttttcgggat 
gtttagtcgc 
tcgcggtttc 
gcgagggggt 
ttcgttgggg 



ttgtaagatt 
ggtttgagga 
tagttttttg 
cgttatttcg 
ttttaacgtt 
ggcgtttttt 
tttttttatg 
tttaattagt 
ttatttgttt 
ggatggagag 
tttgtagagt 
ttcgaagcga 
tttagaatta 
gtttagaagt 
atcgttattt 
cgagaaaagc 
gttgtagggg 
gtgtcgagta 
tgttacggtt 
tgaggttcgg 
gtataagttg 
gggtttttag 
tgtttagaaa 
acgattcgta 
tttttttcgg 
gggagggttg 
gttttttacg 
ttttttaaag 
aattgatttc 
aaattttgcg 
attttttttt 
ttttcggtta 
aaagcgttag 

gggggttttt 

atgttcggaa 
tttttgttag 
ggcgagagta 
gagtttttag 
agtcgtcgtc 
tttagtttcg 
gcgtttgttg 
acggtcggtt 
tttttttttg 
aaaaaaagtt 
agtcgttttt 
tgcgtttttt 
gttatatcgg 
ggacgggcgc 
ttttaaatta 
agtttgggta 
gcgttaatat 
gagagggagt 
agtgtatttt 
cgtttggaga 
aagtaggatt 
ggagaatttt 
gggttaatta 
tttattttat 
tttttatttc 



seq_prot . 
aataggtcgc 
ttttcggttc 
gtttatttga 
cggatggtgg 
acgaaacggt 
tcgattttcg 
tcgtttggac 
tggcggtcgt 
ttcgtcggtt 
attttttcgc 
cgtcgtcgtc 
agtttttaat 
ttgtatttaa 
ttttttcggg 
ttcgcgcgtt 
gaagttattt 
tttttcgggg 
gtttagagtg 
taatatttat 
acgtgtaggg 
gagcggaagg 
gattttgtaa 
ttgttttttt 
tatataaatg 
ttgggaggat 
gggtttatcg 
ggattttaag 
aataatttta 
gatttgagta 
tttttagggg 
atttttagga 
aaggttcggt 
agtagggtat 
tttttttagg 
ttttcggggt 
gaattaaata 
gagtttttgg 
ggattgttcg 
ggtcgtttaa 
ggtttcgtag 
ttttcgtcgg 
tcggcggaga 
ttgttgttgt 
tgatatagtg 
ttttgttttt 
tcgaagtttg 
ggatcgcgat 
gttgcgggga 
taaaagtaaa 
ttggtcggtt 
ttatttcgtg 
tttcgagggg 
aaattttaaa 



txt 
gtaggatttt 
ggtttttttc 
ttacgttggg 
agagcggtat 
tttcgttttt 
ttacgtattc 
gtcgttcgtc 
cgttcgttta 
taggcgttcg 

ggggttggcg 

gtcgtcgtcg 
atagatacga 
aggaggagac 
gtttttttag 
tttgtttttt 
tttttttttt 
atgagcgagc 
tgtcgggagc 
ttttatatta 
agacggggtt 
attgtgggtt 
ttcgatgttg 
tttttttatt 
gtttttttgg 
ttttttggtg 

ggggtgggcg 

ggtgggagaa 
ggaaggggat 
aaatttaatt 
tgagagagaa 
ttttttcgtt 
ttagtttgta 
tttaattttt 
gtcgtcgcgg 
atttcggaat 
aacggcgcga 
tatttattat 
ggggtttgag 
ttcgtttgtt 
cgaaggaagc 
gtttcgtatt 
gaaacgggag 
tgttttttta 
ttttaatatt 



gtcgggtttt 
gcggagtagg 
gggtatagcg 
gcggacgggg 



aattgtatga 
tcggtgtcgc 
ttagtttttt 
aagtcgattt 
ttatagtttt 
gtagtttatt 
tcgagggggt 
taattagatt 
gcggagggcg 
gtcgtt 



tgtttttgtt 
ttaagtatta 
tttcgcgtgt 
ttagggttgt 
tttggaaatt 
gcgtgtattt 
gaggggaagg 
gaaatgtaag 
tgtgaattat 
ggcgtgtttc 
gggtgttttt 
aaaattggtt 
taaggaaata 
gtttcggatt 
tattcgagcg 
tgcgagtgtt 
ttggggaagg 
gagggacggc 



tttagtcgtt 
gcgcgcgcga 
gtgttcgaag 
gtttttttcg 
taagtcgcgg 
gtattgttgg 
gcggcgtcgt 
tttcggggtt 
ggcgttgggg 
taattttggt 
gaggagcgag 
ttatatagtt 
gggaggataa 
attttttttt 
tttaagtcgt 
gtagggttga 
ggcgcgggac 
gcgggggagg 

ttgtgttgcg 
taggggtgtc 
tattcgtgtg 
ggagtgtgcg 
tttagttttt 
ggtaaaggag 
gaaattttgg 
agcgagcggt 
aggggtgttc 

agtataatcg 
ttttttttaa 
ggtttttgtt 
gtaggtagta 
gtcggacgtt 
tttttttatt 
tttcggtagg 
tatttgtttc 
tttttacgat 
atttagaatt 
ttaggtcgag 
ttgtattttt 
gtcggggacg 
gttttggttt 
tagtaatgtt 
cgttggttga 
ttcgtatttt 
ttttgttttt 
taagggtcgt 
gcgtatacgt 
ttattgcggg 
atagtaaacg 
ttttgtgtat 
tggagtttat 
tatatttttt 
tatgtgagtg 
gtgtgggtgt 
aggttgggat 
tttataatta 
taggtttttt 
tcggttaatt 
tatagttttt 
agtcggtttt 
agttattagt 
ggagggcgtc 



2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6456 



<210> 297 

<211> 6456 . , . 

<212> DNA 

<2M> l A rtl fi cl al _ se q u e n ce__ : 
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<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 297 

agcggttttc gttcgttttt agcgagcggc gcgggcggcg tttttcgtcg ttttttcgtt 60 

tttcgtcgtt gtgtttattt tttcgtcgga gagagtgttg gtaatttttt ttttagagtt 120 

tgattatttg tttcgcgagg tcgcggatac gtgcggagag tcgattgata ttcgtagttt 180 

tttcgggagg ttcgacgcga ttgggttttt taggtgagga gttgtgcgtt cgggtgggtg 240 

ggttgcgggg tggggggttt cgggggagat ttgggggatt ggtcggggtt cgaggtagaa 300 

ttgtgggtgg ggtaagtgga gggtggcggg gggtgggggg gatttgtgtt tttttggaat 360 

cggttttggt taatttttta cgaattgaga cgggattggt tatgggggtt agttttgggg 420 

agttagagga ttttttttat ttgatgagag ggggagattt tagtttaggg atattcgcga 480 

tatcgagatt ttgtttttac gtattcgtac gtatatatat ttatacggaa tacgttttat 540 

atagtttttt taaacggaat tggaatttta ttcggatatt tatataataa tttatattta 600 

gggtttaaaa tgtattgttt ttagggattt tatgttggag aatgtgtttg tatttttttt 660 

tcgaaaattt tttttttagt tcgttgaggt gtgggtgtgg gttttatttt tttttttacg 720 

agataaatgt taacgtaggt tttttgcggg cgtgttatat atagaaaagt gtacgtagtc 780 

ggttagtgtt taggtttgga gtttcgcgaa gacgtacgtt tgttgtagtt tttaagttta 840 

tttttataat ttaaaatatt ggagttaaat atacgtttcg tagtgagtag ttttggtttt 900 

cgtagcgcgt tcgttcgtgt gtatatgggc gtatttgcgt gtgcgtatac gcgggagtcg 960 

cggttttcgg tgtggttttt tgtcgttgtg ttggtggcga tttttgtggt gtttggtagg 1020 

tttcgggaga ggcgtatgaa ggattttatt gagtgtgaag atagagagta gagatagaaa 1080 

gtagaaggag gcggttgtag tttttgtagt cgcggaggaa tgcggaaatg ttgaagtatt 1140 

atgttaaatt ttttttgaaa tggggttaag ttatatttag ttagcgtggg aaagtaataa 1200 

taataataaa aaaaagtgga ggagaggggg ggaaaaggta ttgttgtttt cgtttttttt 1260 

cgtcgaagtc gatcgtttgg ggggcgagtg gggggagagt tagggtagtg cggggttcgg 1320 

cgggggtagt aggcgtagtt ttttgcgggg gtgggacgtt ttcgacgttt ttttcgttgc 1380 

ggagttcggg gttagattcg agcgggaggg ggttgggggg atgtaggata ggcgggttgg 1440 

gcggtcggcg gcggtttttc ggttatttgt gtggttttcg atttggttta gattttcggg 1500 

tagtttttgg gggttttttt ttcggttttt aaggaaagtt ttgaatgtag tgagtgttag 1560 

aagttttgtt ttcgtcgcgg ttcgttttat tcgtttatcg tgggggtcgc gtcgtttgtt 1620 

taatttttgg taaaaggcga gcgggttagc ggttgcgagg taggtaattt cgaggtgttt 1680 

cgggaatttc ggatatgcgg tcgtcgtggt gaaagttttg tcgaggtcgc ggcggttttg 1740 

gaggagggag gttttttagg ttttagatag ttgttgaatg gggagggaaa gttagggtat 1800 

tttattttgg cgttttgttg tgtagagggt ttcgggggcg ttcggttgta gattggatcg 1860 

ggtttttagt cggaggggga atggggtagt natttttgtt atttatagcg gggaggtttt 1920 

gggaatggga aaaggttcgc gagttcgggt ttcgaggata gagatttttt tttttatttt 1980 

tggaggcgta ggatttaatt ttttattttt tattttttga aagggaggtt gagttttgtt 2040 

taggtcgaaa ttagtttaaa atttaaggac gaggcgcgat tatgttattt tttttttgaa 2100 

gttatttttt aaggagacga taagcgaaat tgtggcgaat attttttttt tttatttttg 2160 

ggatttcgtg ggggacggtg tttttttttt tttaggatcg ttcgttcgtt tattttcgat 2220 

aagttttaat ttttttcgat tagatggttg gtttttttaa ggtttttatt agggaggttt 2280 

ttttagtcgg agagagaatt tcgtttttaa tcgagatttt tttattttaa aaagattatt 2340 

tgtgtatgcg agtcgttttt aaatgttttt tcgtagaaaa attgaaagta gaaagaaagg 2400 

aagtaatttt tgagtagatg tttgattatt rtttttcgta tatttttagt atcgggttat 2460 

agggttttgg gagtttttat ttatattttg cggttttata cggatggatt tatagttttt 2520 

tcgttttagt ttgtgcgcga tttttgaacg uttttcggta tttttggatt tcgttttttt 2580 

gtacgttcgg gttttaggag gcggggaggg t:agcggcgta atataatgat atagggataa 2640 

atattaagtc gtgatattga cgtgttcggc gttttttttt tttcgcgttt tcggtatatt 2700 

ttgggttgtt cggtacgttt tttttcgttt tattgcgttt cgcgtcgttc gtttattttc 2760 

gaggggtttt tgtaattttt tcgcgcgaag acggttttag ttttgtaggg aaagaaaagt 2820 

aatttcgttt ttttcggagg aattaggaag gattaagcgg tttgggagag ggtaggagcg 2880 

cgcggagggt agcgatggag gtttcgtaaa ggaggaggag gggagtttgg aggaatttcg 2940 

aggaaggttt ttgggttgtt tgattgtatt ttttttttat tttttcgttt ttttttttag 3000 

gtgtaatgat tttggattga gacgcgtttg ggtagaggtt atgtaatcgt gtttgtgttg 3060 

aggatttcgt ttcgaggagg gaagaggagg gatcggttcg ttttttcggc ggcggcggcg 3120 

gcggcgattt tgtaggcgga gtttcgcggc ggcggtatta gggttacgtt agtttcgcgg 3180 

ggaggttttt ttatttagtt tttgtagcgg cgaaagtttt agcgttcgag cgtttgagtc 3240 

ggcggggagt aagtaaagtt agatcgattt tcggggagtt tcggagtagg cgagcggcgg 3300 

tcgttagtta gttgagcgta tttttcgttc gttttagcgg cgtcgcggcg ggcggcgttt 3360 

aggcggtatg gagaaggacg gtttgtgtcg cgttgattag tagtacgaat gcgtggcgga 3420 

gatcggggag ggcgtttatg ggaaggtgtt taaggttcgc gatttgaaga acggaggtcg 3480 

tttcgtggcg ttgaagcgcg tgcgggtgta gatcggcgag gagggtatgt cgttttttat 3540 

tattcgcgag gtggcggtgt tgaggtattt ggagattttc gagtatttta acgtggttag 3600 

gtgagttagg gagttgcgtt ttcgttattt ggggtttcgc gcgcgcgggg agggtcgagt 3660 

cgggaatttt taggtttaga aaggtg<;ggg gtagagggcg attggggagg ttttgcgcga 3720 
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tttgttggtt ttatagagtg agagttaagt tttgtcgata gagtcgcgag ttagagtcgg 3780 

tgattttgtt agatagcgag tcgaaagata agttaaaaaa atatgatttg agtaggtttt 3840 

tttgtagttg tgtaatttag aagcgttttt tttttgttta ttttgaagga gggtttgggg 3900 

gaagtgggtg cgttttaggg ataggataag ttaatttgag atatgtttcg ttagttttcg 3960 

-tatatttttc gtggttatgt cgttggggaa tttttgaaag atagagtagg tgagattgga 4020 

gaatgaggat gtcgtgtgat aattagtggg aattttatgg gtggagttaa gtgtaaatta 4080 

ggttaataag atgtagaaat attataatat ttttgttagt tatattttta gttttttttt 4140 

attaaagttg cgtagtaaga atgatttttg gtggtttaag gagtttgagt gttgatgttt 4200 

aattttaaat agcgttataa aaattttttt taaaattgtt atttttgtgt ttgttttatt 4260 

ttatattttt gggcggatat attttttaga ttaatttttt aaagggttaa atgtgagttt 4320 

aagatgtggt atggttagat tataaatgtt ttgtttatgt tttttttaga tggattgtgc 4380 

gagtgggtgg ggagatragta gtttttattt ttttttgtgt ggtaattagt attaaatgaa 4440 

aatgtgtgat ttaggataat ggtgtttttt tagtgatttt ggaatttttt tattgtgtgg 4500 

ttgatattgt ttggttgtgt tatttttttt ttttttaaat agagatgtat agttttatat 4560 

tgaaaaggaa tgtttagaaa tttttgtaaa ttgggttgtg attttatttt tttcgatttt 4620 

gagaaaatgt gtagtagaaa tatttttttt tttatttatt tagggttttt gtttaataaa 4680 

tatatgttta ttgtgtgcgt tgaggatgaa atgatgtaaa taagtatagt taaatatttt 4740 

tgtaaaaagt taaataattt aatattagtt attaaagtgg agagatttaa gattgatttt 4800 

tgaaatatta aattttttat agatttagag agattaaata tttatagaat ttttttqgga 4860 

ggttttaatt tgtttttttt tattttaggt tggtgatttg tgcggtgttt tgtttttaac 4920 

gtagtttaga tttaagttag aaggtattaa agaattataa atatataaat aaggtagtaa 4980 

atggttataa agttaagatc gtattatatt tcggtttagt tagtgaagga gaatgagtgt 5040 

tttagaagat gagattatat ataaaaattt atagatgatt tttttttatg tgtattttat 5100 

ttgttaaaaa taataaaatt cgtttttggg gttttttgtg atattggagg gaaaagggga 5160 

gggtatattt agttttttgt aatttagttg tcgttttagg ggaattattt aatatatttc 5220 

gaaggatgaa ttttattatt ataattttaa gacggaagtt tttgttttat tttgtttgtt 5280 

atttatggag tatttagtta agggtgttta ttatgtatat gaatgaatag tgttttttta 5340 

aacgtagagg tattatagag gtatttttgt ataatattat tgtttataaa gtttacgtta 5400 

t S2!2£ tt a 9aggtatat tttttgttat tgtgtttgta gattttgttt gatttatttt 5460 

atttttttga tttatttttt tgagattgat gtgtttttag acgtttttaa ttgatttatg 5520 

atttggattt tgtattgtga attttaggag gatattttta tgaattgaag ataaaggtat 5580 

tgttttttaa aattattttt acgtgttaag gtgatgatgg tatttatttg ttaaagtcgt 5640 

gagaataaaa tgagatggcg tatgtgaaag tattttgtaa atttgaaagt attttagata 5700 

tttaaaggat tgtttgcgtt ttagtagatt gatggttttt tgttatggag tagttttata 5760 

gggaggggtt aatttatttg attttttttg tagaataaat tatatttaaa tttagatatt 5820 

*5*§S tt: " 9 9 aattaa 9 aa gatttttgag gataattagt ttgtaagatg ttagaaaaga 5880 

tttttagtta cgagttatga aatttagtga tatttgagga aattattgat gatgttattt 5940 

ttatgagtta tagattttta tttttttttt gttttttttt tgtttttttt tttttgtttt 6000 

tgttttgaaa taattattgg tttttatgat ttgtttggtt agattaagaa ttttggattt 6060 

H2S§?H§S 9 aata tttgt ttagataggg ttaggaaata aggatagttg atagtaattt 6120 

tttttttttt aagggagagt tgttaaagtt gggtttttaa atttgtgatt tattattgtt 6180 

^ISSS 31 ** 9 tt1:ta tagt tttaagtttg gttttttaat ggttttttaa aagggtagat 6240 

gtttttataa gtatttttaa gttataagta attgattata attataggtt tttttttata 6300 

gatagttaag tttttaagtt aataggagta taataaatat attagttttt ttaaaagttt 6360 

aagatttaat tttgatggat ttgttagaaa agttgatttg ttattgttgg tattggtatt 6420 

gtttgtatgt atagggataa tgatttgaaa tttttg 6456 

<210> 298 
<211> 4499 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 298 

tttagaggtt attattcgaa ttttggttaa ttgtgaggaa aggttggaag aggagtttgg 60 

agatggattg tggtaatttt ggaagattta tgttttggaa ttcgagagaa agagaggagg 120 

^I*? 1 ^ ta 99 a g a 9 a 9 gggttgttga ggtggggtga gttggaattt gtttttataa 180 

gttatttttg ttttttggtt tttagggatt tgggggtttt tggttttttt ttttattttq 240 

gtttaggatt aagttttttt ttaatttttt tttttgtacg tggattttta gatttagttt 300 

ggtgttttag ttagagttta ggttatagtg gggagcgcga gtgaggagta tgttttttga 360 

t:gggtttttt agcgagggtc gaaaggtaga tgttgaaaag gtgattcgtt tagatttacg 420 

ggattagtcg gggtagtttt gggattttta gagtttttaa agacgatggt ttaaaaatta 480 

ggagtagcgt gagcggttgt cgagttttat tttattcgta agttttcgtt aagtattggg 540 

fcaggaaatat gatagttaag aggattggag cgttgatttt tttttttggg agtgatattt 600 

_ggg a tagttt jttaaggtt^ 660— 
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ttttttggtt tttcgtcgtc gcgtatacgt attagttggg tagtaggaga ttagttaatg 720 

agattgtaaa cgttgtttgt ttttatttat ttatttttga gattgagttt cgttttgtcg 780 

tttaggttgg agtgtagtgg cgcgatttcg atttattgta agtttcgttt ttagggtata 840 

cgttattttt ttgtrttagt ttttcgagta gttgggatta taggcgttcg ttattacgtt 900 

cggttaattt ttttttattt ttagtagaga cggggtttta tcgtgttagc gaggatggtt 960 

tcgatttttt gatttcgtta ttcgtttttt tcggtttttt aaagtgttga gattataggc 1020 

gtgagttatc gcgttcggtt cgttgtttgt ttttagttat tgattgttaa aagattaagt 1080 

tgttggtaag tcgatattaa taattttttc ggaaaaattt tttaattaag ttaataatag 1140 

aaaaatattg gttgtaattg tagatattag ttatttcgaa gatgtgtgag agggaggtat 1200 

agtgggtttc gggtaggttg taggggtgtg tgcgcgcgaa cgtgttgggt tcgggagatt 1260 

gagagtgtgt gtgtaagatt tcgggaggga gttttttttt cggggtgagg agtgcggatt 1320 

gtagtttttt taggttagtt ttcgtttaat attttttata atttttaatt tttaagtttc 1380 

gtatattcgt ttcgttttag gttggatagt tatttttagg gatttttttt tttgttttat 1440 

ttagaaggag gtagttgcga gtcgggcgtg gtggcgggcg tttgtaattt tagttattag 1500 

ggaggttgag gtaggagaat tgtttgaatt tgggaggcgg aggttgtagt gatttaaaat 1560 

cgcgttaatg tattttattt tgggcgatag atcgagattt cgttttaaaa ataaataaat 1620 

aaataaacga agtcgttgcg agcgtttttt tttttttttt tttttgatta gttgagttgt 1680 

atcgattaag gattattcgg ttcgggcggc gggataggga gggtttgggc gagttttttt 1740 

cggagttttg gagttcgtag gagatttgcg ggggcgggtt gcgtttttaa gagagcgagc 1800 

gtttttaaga gagcgagttt tttgttatgg gcggtgttta taaaatgcgt ttttgtgggc 1860 

gatagagggg tgcggtgggg tttaggaggt tcgtgggcga tttcgagtat cgggtcggtt 1920 

tcgagtgagg ggttagtttt ggggttggga ggaaggcgag aggtagcgtt attgtttttg 1980 

atttttcggg cgattttacg aaggaaggta gagggttgag ttttagcgcg gattagttaa 2040 

attcgacggg tttttgtcgg ttattcgttt ttaattttgg ttgggttggg aggggttttt 2100 

tcgtaagttt tttcgttcgg cggaggtttt ttttttttaa ttttcgcgac gggcgtatgt 2160 

tttaagttgt tttcgttcgt cgggttcgtt tgtcgttttt agggtttggt ggttcgcggt 2220 

ggttcgcgtt tatcgtgtcg gttttatcga tttcgattaa cgtaggttcg agatgcggcg 2280 

gttttcgtcg agtgggcgcg gcgttcgggg tttttcggga gtaggtgcga aggacgtgtt 2340 

tggagtttgt cgtttggatt tttggttttt gggtgcgtag taatttttcg cggtaggatt 2400 

gtaatcgcgg aggtttgtag ttgtgattaa attcgcgttt ttttcgtcgg tttcgcggtt 2460 

cgtagtttat tcgggttaat ggtgttaatt ttggattttt tttaatgtcg ggttttgtgg 2520 

tattcgggaa agtttttacg taggaaggtt tagggttttg cgggtgtata tatgggaagg 2580 

tataaaagtt atagatattg tttgtttaaa ttgaagttcg gtggaaagtt atgttttttt 2640 

attttttaaa aaatttaatt ttgttgaagt atagtttata tgtaataaaa tgtatttatt 2700 

ttaagtggat atttagatat atattttgat atagttatat tttattatta tgattaagat 2760 

atagaatatt tttatgtttt taaaaattat tttgtgttgt tattttagat tattttgtgt 2820 

ttttttattt attttatttt atatatttga gttttaggta atcgtcggat tgtttttttt 2880 

tattatagat tatgtttttg tattttttgg agttttatat taatggaatt atatattatg 2940 

agttttgtgt ttgattttta tttagtttaa tgttgttgag attttttttt attgttggaa 3000 

agattagtag tttatttttt tgtgtgtgag atggagtttc gttttgtcgt ttaggttgga 3060 

gtgtagtggc gcgattttag tttattgtaa ttttcgtttt ttgggtttaa gcgattttcg 3120 

tgttttagtt tttcgagtag ttgagattat aggtattcgt tattatattc ggttaatttt 3180 

tgtatttttt taatagagac ggggttttat tatgttggtt aggttggttt cgaatttttg 3240 

attttaggtg atttatttat ttcggttttt ttaagtgttg ggattatagg tgtgagttat 3300 

cgcgtttagt ttttttattg ttgatattgt attgtttggt tgtatcgtaa tttgtttaat 3360 

tttttttttg tttatagata tttggatttt tttttagttt gggttatttt gaattaagtt 3420 

gttaggaata tttgtttaat tttgtgtgat tttatgattt tatttttttt gggtgtatat 3480 

atagaaatgg aattggtggg ttatattaga aaatatattt ttaatgttgt aagaaattgt 3540 

aaaattattt tttaaagtgg ttgaagtatt ttatattttt attaatagta tatgaaagtt 3600 

ttagttattt ttatatagtt attaattttt agtattgtta gtttttttaa ttttagttat 3660 

gttagtgggt gtttagtggt attttattgt atttttaaat tttatttttt aaataattac 3720 

gttgagtatt ttatgacgtg ttcgttttaa aaatgtgtat attttattta gtgtaatgtt 3780 

tgtttaaata ttttatttat ttttattagg ttatttgttt tatattattg agttgtaaaa 3840 

gttttttata tatttaggat ataatttttt tatttatgtt ttgtatatat ttttttatag 3900 

tttgtggttt gttttattat tttttttttt tttttttttt tttttttttt tttttttttt 3960 

tgagacggag ttttattttt gttgtttagg ttggagtgta atggtgtgat tttagtttat 4020 

tataattttc gttttttagg tttaagcgat ttttttgttt tagtttttta agtagttggg 4080 

attatggtta tatattatta tgttcggttg attttgtatt tttagtagag atggggtttt 4140 

tttatgttgg ttaggttggt tttgaatatt tgattttagg tgattcgttc gttttggttt 4200 

attaaagtgt tgggattata ggtatgaagt attatattta ggtttttttg ttttgttttg 4260 

ttttgttttt gtttttgttt ttttaagata gagttttgtt ttgttattta ggttggaggg 4320 

tagtggtatg attttaggtt attgtaattt ttattttttg ggtttaagcg atttttttgt 4380 

tttggttttt ttagtagttg agattatagg cgtgcgttat tatgtttggt taatttttgt 4440 

gtttttt&gt agaaatgggg ttttgttatg ttggttaggt tggtttcgaa tttttgatt 4499 

<211> 4499 * i* .-.s .-I-.. * <•* ~i - .t 
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seq_prot.txt 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 299 

ggttagaagt tcgagattag tttggttaat atgataaaat tttattttta ttaaaaaata 60 

taaaaattag ttaggtatgg tggcgtacgt ttataatttt agttattggg gaggttaagg 120 

taggagaatc gtttgaattt aggaggtgga ggttgtagtg atttgagatt atgttattgt 180 

tttttagttt gggtgataga gtaagatttt attttgaaaa aataaaaata aaaataaaat 240 

aaaataaaat aaaaaagttt gggtgtggtg ttttatgttt gtaattttag tattttggta 300 

ggttaaggcg ggcggattat ttgaggttaa gtgtttaaga ttagtttgat taatatggag 360 

aaattttatt tttattaaaa atataaaatt agtcgggtat ggtggtgtgt gattataatt 420 

ttagttattt gggaggttga ggtaggagaa tcgtttgaat ttgggaggcg gaggttgtgg 480 

tgagttgaga ttatattatt gtattttagt ttgggtaata agagtgaaat ttcgttttaa 540 

aaaaaaaaaa aaaaaaaaaa aaagagagag aaaaagaaaa tgatgaggta agttatagat 600 

tgtgagaaaa tatatgtaaa atataaataa gggagttata ttttaaatat atagaaaatt 660 

tttatagttt agtaatataa aataaataat ttgataaaaa tgagtgaaat atttgaatag 720 

atattgtatt aaataagata tatatatttt taaaacgaat acgttataag atgtttaacg 780 

tagttatttg agaaatgaaa tttaaaaata tagtaaggta ttattgggta tttattagta 840 

taattaaaat taaaaagatt ggtaatatta aaaattggtg gttatgtgga gatgattgga 900 

gtttttatat attattggtg ggagtgtaaa atgttttagt tattttagaa aatagttttg 960 

tagtttttta taatattaaa aatatatttt ttgatatgat ttattaattt tatttttata 1020 

tatgtattta agagaaatga aattataaag ttatatagaa ttgaataaat atttttaata 1080 

gtttaattta aaatagttta aattgggaaa aagtttaaat gtttatggat aagagaaagg 1140 

ttaaataaat tgcggtataa ttagataatg taatattagt aataaaaaag ttgggcgcgg 1200 

tggtttatat ttgtaatttt agtatttagg gaggtcgagg tgggtggatt atttgaggtt 1260 

agaagttcga gattagtttg gttaatatgg tgaaatttcg tttttattaa agaaatataa 1320 

aaattagtcg ggtgtgatgg cggatgtttg taattttagt tattcgggag gttgaggtac 1380 

gagaatcgtt tgaatttagg aggcggaggt tgtagtgagt tgagatcgcg ttattgtatt 1440 

ttagtttggg cgatagagcg agattttatt ttatatataa aaaggtaaat tattgatttt 1500 

tttaataata gggaggggtt ttaataatat taagttaagt gaaagttaga tatagagttt 1560 

atagtgtatg attttattga tatgaaattt tagaaaatat aaaaatataa tttatagtag 1620 

aagaaagtag ttcggcggtt gtttagaatt taggtgtgtg ggatggggtg agtagaggga 1680 

tatagggtag tttgggatga tagtataagg taatttttga ggatatggaa atgttttgta 1740 

ttttgattat ggtagtggag tgtgattgtg ttaaaatatg tatttaaata tttatttaaa 1800 

gtgggtgtat tttattgtat gtaaattgta ttttaataaa gttgaatttt ttaaaaagta 1860 

agaaggtata gttttttatc gagttttaat ttgaataaat aatatttgta gtttttgtgt 1920 

ttttttatgt gtatattcgt aggattttgg gtttttttgc gtggagattt tttcgggtgt 1980 

tatagggttc ggtattgaga ggaatttaaa gttgatatta ttagttcggg tggattgcgg 2040 

gtcgcgaagt cggcggagga ggcgcgagtt tagttatagt tgtaggtttt cgcggttgta 2100 

attttgtcgc gaggggttgt tgcgtattta gaggttagag gtttagacga tagattttag 2160 

atacgttttt cgtatttgtt ttcggaggat ttcgaacgtc gcgtttattc ggcggaagtc 2220 

gtcgtatttc gaatttgcgt taatcggggt cggtgggatc gatacgatgg acgcgggtta 2280 

tcgcgggtta ttaggttttg ggagcgatag acgagttcgg cgggcgagag tagtttaagg 2340 

tatgcgttcg tcgcgggggt tgggaaggag aagttttcgt cgggcgagaa gatttgcgag 2400 

aagatttttt ttagtttaat tagggttagg gacgggtgat cggtaagggt tcgtcgagtt 2460 

tggttgattc gcgttggggt ttagtttttt gttttttttc gtggagtcgt tcggggagtt 2520 

aggagtagtg gcgttgtttt tcgttttttt tttagtttta gggttggttt tttattcggg 2580 

atcgattcgg tgttcggaat cgtttacgag ttttttaggt tttatcgtat ttttttgtcg 2640 

tttatagagg cgtattttgt ggatatcgtt tatgataggg agttcgtttt tttggagacg 2700 

ttcgtttttt tggagacgta gttcgttttc gtaaattttt tgcggatttt aaggtttcgg 2760 

gagggattcg tttaggtttt ttttatttcg tcgttcgggt cggatggttt ttgatcgatg 2820 

tagtttagtt aattagaggg aaggagagga ggggacgttc gtagcgattt cgtttgtttg 2880 

tttgtttgtt tttgagacgg agtttcggtt tgtcgtttag ggtggagtgt attggcgcga 2940 

ttttgaatta ttgtaatttt cgttttttag gtttaagtag tttttttgtt ttagtttttt 3000 

tagtagttgg ggttataggc gttcgttatt acgttcggtt cgtagttgtt tttttttgag 3060 

taaagtagga ggaagagttt ttgggagtaa ttgtttagtt tggggcgggg cgagtgtgcg 3120 

gggtttgggg gttaggggtt gtggggagtg ttgggcggaa gttggtttgg gagagttgta 3180 

gttcgtattt tttatttcgg agaaaaagtt ttttttcgaa attttgtata tatattttta 3240 

gtttttcgga tttaatacgt tcgcgcgtat atatttttgt agtttattcg ggatttattg 3300 

tatttttttt ttatatattt tcgaaatgat tagtatttat aattgtaatt aatgtttttt 3360 

tattattaat ttagttaaaa gattttttcg gagggattat tgatgtcggt ttattagtag 3420 

tttaattttt taataattag taattaaaag taaataacgg gtcggacgcg gtggtttacg 3480 

tttgtaattt tagtattttg ggaggtcgag gagggcggat aacgaggtta ggagatcgag 3540 

attattttcg ttaatacggt gaaatttcgt ttttattaaa aatagaaaaa aattagtcgg 3600 

gSS^astggc gggcgtttgt agttttagtt attcgqgagg ttgaggtagg agaatggcgt 3660 _ 

Sei^e 234 



gtattttgga 
gatagagcga 
attgattggt 
acgtttttag 
ggtattattt 
ttagtatttg 
ggtttttgga 
gtgagtttaa 
tagaggatat 
ggttgggttt 
agagtgaaag 
tatgagagta 
tttttttttt 
taaatttttt 



ggcggagttt 
gatttagttt 
tttttgttgt 
gtgggagcga 
ttaggagaaa 
gcgagagttt 
ttatcgtttt 
gcgagttatt 
gttttttatt 
gggaatttac 
gaaagattaa 
gattttagtt 
tttttcgaat 
ttttagtttt 



gtagtgagtc 
taaaaataaa 
ttagttagtg 
cggagagaat 
aagttaacgt 
acgagtggaa 
tgggaatttt 
tttttagtat 
cgcgtttttt 
gtgtagaaaa 
aggtttttaa 
tattttattt 
tttaggatat 
tttttatagt 



seq_prot . 
gagatcgcgt 
taaataaaag 
cgtgtgcgcg 
ttgggttttt 
tttagttttt 
tggaattcgg 
gggagtttta 
ttgtttttcg 
attatggttt 
agaagttaaa 
atttttggaa 
tagtagtttt 
gggtttttta 
tgattaagat 



txt 

tattgtattt 
taaataacgt 
gcggcggaga 
tagttttgaa 
ttggttatta 
tagtcgttta 
gagttgtttc 
attttcgttg 
aggttttggt 
gagaaatttg 
gttaaagggt 
ttttttttag 
gaattgttat 
tcgaatggtg 



tagtttgggc 
ttgtagtttt 
gttagaagag 
ggttgtttta 
tgttttttgt 
cgttgttttt 
gattggtttc 
aaaaatttat 
tggagtatta 
attttgggtt 
aggaataatt 
gtggtatggt 
agtttatttt 
gtttttaaa 



<210> 300 
<211> 4500 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 300 

ttatgtgagg 
gaattaattt 
aacgtttgtt 
ttggttttgg 
atagtaatgt 
taggttgagt 
gtcggggata 
attagagttt 
gtttttgtta 
ttgttttttg 
tatggaggtg 
agggatttgg 
tgcggtttta 
taaaggaagg 
tgtatattaa 
tagagatatt 
taagagggta 
tttaaagagt 
ttgatttggg 
atgagagttt 
agatggagat 
ttaatgtttt 
tatatagaga 
atttttgtat 
ttagttgaat 
gtagggatat 
atatatatag 
atattaaatt 
agttgttgag 
aaacggttta 
ggatagtttt 
cggagggtag 
ttgttttttt 
ggattttttt 
gggaggaaaa 
■ tttggagtta 

gggggttatg 

tagtgtggtt 
tggggtttta 
tatgtggggc 
ttttggagta 
atttttaggt 
ttatttattt 



atatagggta 
tgtttatatt 
gtttatgtta 
gtaggttttt 
gtttattttt 
aggggtattg 
ttgggtattt 
ttataaatat 
aggacgttcg 
ggaggagaaa 
gttaggatag 
aaagggattt 
ggatggttgt 
atagtgtgga 
gcgtttttat 
aaggttattt 
tttttgtggt 
tgtgttgggt 
ggtgtttatg 
tgggggtaga 
ggtttagggg 
cggtttagtt 

gggttggatt 

ttttttttta 
ttaggaaatt 
atgtatagga 
agtgacgtgt 
atttgtttat 
gagggtgagg 
gaggttttag 
ttttagtatc 
ttttgtttag 
taggaacgtt 
gagatttgtg 
atttatgtgt 
gggttgacgt 
taggtattaa 
ttattttaag 
attattttgt 
ggttataagt 
ggtttgggtt 
. tagaaaattt 
atttatttat 



acgatagtta 
tggattttaa 
tttggtttgt 
tatgtttttt 
tgggttgtgg 
taaatttgtt 
tgtttgtagt 
agttaaacgg 
attaaggttg 
ttaaaatata 
gagtttagtt 
ggtttagttt 
tgggattttt 
gtgtgttgta 
atagtgaagg 
aggtattaag 
aggtgggtag 
ggttagggag 
tgtttattta 
gttggttcgg 
tatgtttgtt 
taagtcgtat 
ttatacgtat 
tttttaagta 
gattgttatt 
tcgtgtttac 
atttattatt 
aattatttat 
ataggtcgtt 
cgagggtgta 
gttgatatag 
ggtttttttg 
tcgttcgagt 
agtagatgaa 
gatttatgat 
ttaggatttt 
ggtaaggtac 
ttgtagggta 
acggttaagg 
ttgtagttgt 
ttcggggtat 
tttgagaagg 
ttttttttta 



tttatagtta 
atttttagtt 
ggtagattaa 
gtgttttagt 
tgagtcggtt 
tttgttttta 
ttttacgtta 
agtagaggta 
gtaaggaagg 
aatgaaaaag 
gtaggtatag 
acgtggtagt 
agagatggat 
agttttaatc 
cgaggttgta 
gtgatgtagt 
tgggagttag 
gttttgattt 
tgggagatta 
gcggtttttt 
tgcggttttg 
aggaaagtta 
ttgtggtagg 
atattttttt 
tatttatata 
gtttgggttg 
tatatatata 
atatatatta 
atttttattt 
tgcgtgttcg 
tttttattgt 
gttttaggaa 
agtaagtgat 
gttggggatt 
gtttttttag 
aatatttttt 
gagcgtgtga 
gagtacgaga 
tttaagtttt 
tttttttgtt 
ttttatattt 
gatagtagga 
ataaatattt 



aggaagagag 

tttagaattg 

tacggttatt 

tttttttatt 

aagatgtgaa 

tattttttta 

gtttattttt 

ggtagggttt 

tggttagggt 

ggtttcgggg 

ttttaattta 

tattttttta 

ttttacggtt 

gatattgttt 

ggttatgttt 

tatcgaggtt 

tgtgagttgg 

aatgtttagt 

ggaggtagag 

tttatagatt 

gttttttaag 

ttttgtggat 

tttttggaga 

gggaaaatgt 

tttagagttt 

tatgttcgta 

tattggttat 

tttttatatt 

tttttatagg 

tcgagtgttt 

tttttgaagt 

gggttaatag 

tgtttagttt 

agtagatata 

acgtggggcg 

tgtatttaag 

gcgtgtagta 

gatttaacgt 

aagagttggg 

gtatatttat 

tatttgttta 

tttgggattt 

tggGgagcgt 



gtttttaata 
cgcgaggaga 
tattagtcgg 
gtagaatggg 
gagtttggag 
cgatttagag 
aaatatttta 
gttttcgata 
ttagagttta 
ttgagtaggg 
tgtgattttt 
attttttaat 
tttaggggtg 
gagtatttgt 
agtttataga 
atacgagaag 
ggagcggaga 
attaggtgag 
gtaggttggg 
ttatagtcgg 
agttttgaga 
taagtttggg 
tttttggaat 
ttatttggtt 
aattggggtg 
gtrtgggggat 
atatatgttt 
tatattttat 
atgagttgta 
tagcgtttcg 
gtagataaaa 
gtagtaaggg 
cggtttcggg 
gttttaggtc 
atgtaggtgg 
gggtggattt 

gatagtatgg 
ggtttttgga 
tagtttttcg 
ttcgagttta 
tttttaaata 
ttttatttat 
agtggtttat 



3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4499 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 



Sevte 235 



gtttgtaatt 

gagattagtt 

ggtatggtga 

tgaatttagg 

taataagagc 

ggtatagtgt 

atraggaatt 

aataaagttt 

ggaggtggtt 

cggtacgagt 

ggttttatga 

tatggtttta 

ttgttttagg 

tcgggggtgt 

gtttttgttt 

tagggtttta 

tttcgttttt 

ggcgtatatt 

ttggttaggt 

gttgggatta 

gttttttgtt 

ttgggggata 

ttttgttttt 

ggtaatgttt 

ttattttgta 

taatggttta 

gtagttatgg 

attttgtaag 

ttagttttgt 

tgttatatgt 

aatttttttt 



ttagtattta 

tggttaatat 

cgtatatttg 

aggtagaggt 

gagattttat 

taattgttag 

rggaggrrag 

ggttatggtg 

tagttaaagt 

taagtcgtag 

agttcgtttt 

gtatatattt 

aagtgatttt 

gtggaggttc 

tttcggttat 

ttttgttatt 

cgggtttaag 

attatattcg 

tggttttaaa 

taggtatgag 

ataattaatt 

gggatgggag 

agttgttttt 

atgttttttg 

gagttagata 

tagatattgt 

ggtcgtttgt 

gtgagacgtc 

tttttttttg 

tatatggagt 

ttaattatat 



tggattgaga agcggggtta 



gggaggttaa 

ggtgaaattt 

taattttaat 

tgtaatgagt 

tttaaaaaaa 

taatatttag 

aaatatagtt 

gggttattta 

tttgaaggat 

agtttagggg 

ttgttgttaa 

cgattttatt 

attttttaaa 

ggaagttggt 

ggttttttcg 

taggttggag 

cgattttttt 

gttaattttt 

tttttgattt 

ttatcgtgtt 

tataatagtc 

gaaatttagg 

gttgagaatt 

taatttttgg 

taatggggta 

ttaatttgta 

atatatatgt 

ggataaggtc 

tttttagtta 

atatatgtat 

atttagtatt 

taataaaatg 



seq_prot. 
ggtgagtgga 
tgtttttatt 
tatttaggag 
tgagattata 
aaaaaaaaaa 
gtaaggaaga 
tttttgtaga 
gtttgaggta 
gtataagagt 
ataggttagg 
tgaatttttt 
attttcgttt 
tataatgcgt 
ttgtttgttt 
tttttttttt 
tgtagtggcg 
gttttagttt 
gtattttttt 
tttgattcgt 
tggttatatt 
gaaggaggtt 
attagataga 
gggagttaga 
ttttattaaa 
ttagaaggga 
atttgggagt 
ttttatttat 
gtttttagtt 
ggtttattat 
ggagttattt 
tttgagaaaa 
gaatttgtraa 



txt 

ttatttgagg 
agaaatataa 
gttgaggtag 
ttattgtatt 
taaaaaagtt 
tgtttatttg 
taatttgtag 
tttaggggag 
gagtgagtta 
ttggagaagg 
atggaggttt 
tcgtatatta 
ttttttgtaa 
ttgtagttgg 
gtgtgtgtgt 
cgattttgtt 
tttgagtaga 
agagacgggg 
ttgttttagt 
tatattttgt 
atgaggtttt 
ttataaaatc 
aaaaggaggt 
ttagtagtat 
attgaatttt 
taatgtggtt 
tcggacggag 
cggtgagtgg 
atagaggacg 
taataggtag 
tgagatttta 
taatcgcgta 



ttaggagttc 2640 

agattagtta 2700 

gagaattatt 2760 

ttagtttcgg 2820 

tcgtgggttg 2880 

gtttttgttt 2940 

taggatagaa 3000 

gttttttgga 3060 

aggaaggggt 3120 

gatgttgaga 3180 

ggaagtatcg 3240 

gttggggatg 3300 

tttaattttt 3360 

tggttgaagt 3420 

ttttttgaga 3480 

ttattgtaat 3540 

tgggattata 3600 

ttttattatg 3660 

tttttaaaat 3720 

ggggattagg 3780 

agcgggattt 3840 

gtttatagat 3900 

taaattttag 3960 

aattttagtt 4020 

tttttaatta 4080 

tttttttttt 4140 

tatggggttt 4200 

agtgtcgcgt 4260 

ttagtgtggt 4320 

gtaggagtat 4380 

gtaatttaat 4440 

gttaaattag 4500 



<210> 301 
<211> 4500 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 301 



ttaatttaat 
attgaattat 
atgtttttgt 
gttatattga 
gcgcggtatt 
gggttttatg 
aagaggagaa 
taattaaagg 
agttgggatt 
ttgagatttg 
gtttgtgagc 
aagtttcgtt 
tttggttttt 
attttgggag 
tatggtgaaa 
tgtagtttta 
gttgtagtga 



attttaatta 

gggggttaag 

tatttttagt 
cggtgttttt 
ttttagtatt 
gttttttttt 



tgcgcgattg 
taaaatttta 
ttgtttatta 
cgttttttgt 
ttatttatcg 
tttcgttcgg 
aattatatta 
gaaatttaat 
gtgttgttag 
gttttttttt 
ggttttgtaa 
ggggttttat 

atagagtgtg 
gttgaggtag 
tttcgttttt 
tttatttagg 
aktaagatcg 
atatatatat 
ttagttgtaa 
ttgtaggaag 
tggtatgcgg 
aggtttttat 
ttttttttaa 
tggtttattt 
tttt tttqaq 



ttataaattt 

tttttttaga 

ggatggtttt 

gtggtgggtt 

ggttgggggc 

gtgaatgaga 

atttttagat 

ttttttttaa 

tttaatggga 

tttggttttt 

tttgtttagt 

agtttttttc 

ggtgtggtta 

gcggattaag 

aggaaaatat 

aggttgaggt 

cgttattgta 

aaaaaaaaaa 

gagtaggtag 

acgtattgta 

agacggaagt 

gaagggttta 

tttgatttgt 

atttttgtgt 

tgttttaggt . 



tattttatta 

agtgttgaat 

atgtatgtgt 

tggttggagg 

ggttttgttc 

atatgtgtgt 

tataaattaa 

tgttttatta 

ttagaggtta 

aatttttagt 

tttgaatttt 

gattgttgtg 

ggtacggtgg 

aggttaggag 

aaaaattagt 

aggagaatcg 

ttttagtttg 

cggaagagtt 

gttagttttc 

tttaaaaggt 

gatgggatcg 

ttggtagtag 

tttttgggtt 

attttttaag 

tggatgattt- 



tagtttcgtt 
atgtaattag 
gttttatgtg 
taggagagag 
gacgttttat 
gtaggcggtt 
ataatgtttg 
tgtttggttt 
taaggagtat 
aggggtagtt 
tttttatttt 
ggttgattgt 
tttatatttg 
tttgagatta 

cgggtgtgat 

tttgaattcg 
ggtgatagag 
ataatcggaa 
gggtttttat 
gaggttattt 
gggtgtgtgt 
aggacgggtt 
ttgcgatttg 
gttttagttg 
tattataat-t- 



ttttagttta 
aggaaaaatt 
atatgtgata 
gtagggttgg 
tttgtagggt 
ttatgattgt 
tgggttattg 
tgtaaggtaa 
gaatattatt 

gggggtagaa 

tgttttttaa 
ggtaggaggt 
taattttagt 
gtttggttaa 
ggtatgcgtt 
ggaggcggag 
tgagattttg 
gagtaagggt 
atattttcga 
tttggagtag 
tggagttatg 
ttatggggtt 
attcgtgtcg 
ggttattttt 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500- 



ttttgttttg ttatagatta tttgtagagg 
gggtaaaaat tagatagata ttttttttgt 
tagtttacga gatttttttg tttttttttt 
tcgagattgg agtataatgg tgtgatttta 
agtgattttt ttgttttagt tttttaagta 
tggttaattt ttgtattttt agtagagata 
gaatttttga ttttaaatga tttatttatt 
atgagttatt acgttcgtta agatgtttat 
atgaatgggg gaattttaag ttttgttgtt 
tgtttgagag tgggtaggtg ggatatgggg 
tgagttcggg atggatatat agtaggaggg 
cgggggattg tttagttttt agaatttagg 
tttagagatt acgttgggtt tttcgtgttt 
ttatgttgtt tgttgtacgt ttatacgttc 
aggtttattt tttgggtgta gagaaatgtt 
ttatttgtat cgttttacgt ttgggagggt 
ggtttggagt tatgtttgtt aatttttagt 
ttcggggtcg gagttaggta gttatttatt 
tttttgttgt ttgttggttt tttttggggt 
ttttgtttgt attttagagg gtagtggggg 
cgggacgttg gaatattcgg cggatacgta 
tgtagtttat tttatgaggg aggtgggagt 
gtagggtgtg agtgtggagg tagtgtgtgt 
ggatatgtgt gtgattaatg tgtgtgtgta 
gttttttagt tgcgggtatg tagtttaggc 
tattttagtt gaattttgag tgtgtgagtg 
gattaggtga gtattttttt aaaggaatat 
gttttaggag tttttaggag tttgttatag 
tttagatttg gtttatagga tggttttttt 
ttttagagtt tttggagagt tagggtcgta 
tcggttatag aatttgtgag gaggaatcgt 
tttaatttgt ttttgttttt tggtttttta 
tttatttggt gttgggtatt gagttagagt 
ttttcgtttt ttaatttata ttaattttta 
tttttcgtgt ggtttcggta gttgtattat 
tttgtgagtt ggatatggtt tgtagtttcg 
gtaggtgttt aagtagtgtc ggttggggtt 
tatttttgga agtcgtgaga gtttattttt 
gttaggaagt tagagaagtg attgttacgt 
aaaaattata taaattgggg ttgtatttgt 
ttttgtttag tttcggagtt tttttttatt 
tgagttttga gttttagtta ttttttttgt 
tgtcgggagt agattttgtt tgtttttgtt 
tggggtgttt aggagtaagt tggcgtgaga 
ttttgagtcg tggggaagtg taagaatagg 
ttttaggttt tttatatttt aatcggttta 
tttattttat agtagaggaa attgaggtat 
tcgattagta agtagtcgta ttggtttgtt 
ttttttcgcg tagttttgaa ggttgggagt 
tgttggaggt tttttttttt tggttgtgga 

<210> 302 
<211> 6499 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic 

: . . -.1.. 

<400> 302 " : ; 

tattgtatat ttttatttat aattggtagt 

attttaatat attattattt ttatttagat 

taataaataa aagtttttag aaattgatgt 

tgattatagg agttatttta tttatattaa 

ttttttataa gaaaaattta atttcgggag 

aggtagaagt tgtagtgagt cgagattgta 

gaaattttgt tttaaaaaaa taaataaata 



seq_prot . txt 

ggttatgttt ttaattttta aatttttggt 1560 

ttgggtgttg ttagtagtta gtattgtgtt 1620 

tttttttgag atggagtttc gtttttgttg 1680 

gtttattgta atttttgttt tttgggttta 1740 

gttgggatta taggtgtgcg ttattatgtt 1800 

aggttttatt atgttgatta ggttggtttc 1860 

ttggtttttt taagtgttgg aattataggt 1920 

taagaaaaaa atgaatgaat gaatgaatga 1980 

tttttttaga gagttttttg gtttaagagt 2040 

gtgtttcggg aatttaggtt tgttttaggg 2100 

atagttgtag gtttgtgatc gttttatata 2160 

ttttggtcgt ataggataat tgagatttta 2220 

tgttttgtag tttgaggtgg gattatattg 2280 

gtgttttgtt ttagtgtttg tatggttttt 2340 

ggggttttag gcgttagttt tggttttagg 2400 

attatgaatt atatatggat tttttttttc 2460 

tttatttatt tatagatttt aggagaattt 2520 

attcgagcgg agcgtttttg gggaaggtag 2580 

tagggaggtt ttgagtagag ttgtttttcg 2640 

ttgtattaac ggtgttgggg agaattgttt 2700 

tgtattttcg ttgaagtttt tgagtcgttt 2760 

agcgatttgt ttttattttt tttagtagtt 2820 

gtgagtggtt gtgagtaaat gatttagtgt 2880 

agtgatgggt gtacgttatt ttgtgtgtgt 2940 

gtgggtacgg ttttgtgtat gtgtttttat 3000 

ggtggtaatt aattttttgg atttagttga 3060 

tgtttgagaa tgaagggaga atgtaggggt 3120 

atgcgtgtgg agtttagttt tttttgtgtg 3180 

gtgcggtttg ggttaagtcg ggggtattgg 3240 

ggtaggtatg tttttgagtt atttttattt 3300 

tcgggttagt tttattttta ggatttttat 3360 

taagtggata tatgggtatt tttaggttag 3420 

ttttttggtt atttagtata gttttttggg 3480 

ttatttattt gttatagaga tgtttttttg 3540 

tttggtgttt gggtggtttt ggtgtttttg 3600 

tttttattgt atgaaagcgt ttagtatata 3660 

tatagtatat tttatattgt tttttttttg 3720 

gggagtttta gtagttattt tggggtcgta 3780 

gggttgagtt aggttttttt ttaggttttt 3840 

agttgggttt ttgttttggt tatttttatg 3900 

tatgttttaa tttttttttt tagaaaatag 3960 

tagttttggt cgggcgtttt tagtaagggt 4020 

tcgtttggtt gtgtttgtag gggttttggt 4080 

gttgtaagta aggtgtttag tgttttcggt 4140 

agtaggtttg tagtgttttt gtttagtttg 4200 

ttatagttta agaggtgggt atattgttat 4260 

agagggtata ggaagtttgt ttagggttag 4320 

atagattagg tggtataaat aataggcgtt 4380 

ttgaggttta ggtgtgggta gggttggttt 4440 

tggttgtcgt tgttttgtgt ttttatatgg 4500 



j 

dna (Homo sapiens) \ ' s ; 1 ; 



taattattga atatatatag atataaatgt 60 

tittgtgtaat atttaattat ttaattatat 120 

atttagtata attaag'gggt aatttgttta '180 

tttgatgtga ggagataatt aaattttttt 240 

gtttaggtag gagaattttt tgaatttagg 300 

ttattgtatt ttagtttggg taataagagt 360 

akataaaata aa'aagtaaaa tttaattttt 420 
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atattttttt tttaaaaata ttatattatt gaaatagaat agaattttgg attttaatgt 480 

ttttatgttt tttgataatg taaagattat attaagaata atattgattt tttgtgtata 540 

tgtgtgaaat tatgtcgagt tgatgtattt agaaaagaaa ttgtagcgat attgggtata 600 

gaatttttaa ttttaatttt ttttttgttg tggaatagaa atttagtttt attttttttg 660 

tttataatat ataagagtag ataattagat atgaggtgat taaatatata aattgatttt 720 

atagttatta tggaaagatt ataatatgtt tggacgaggt gggggaaaga gttttttttt 780 

gaattgagat ttgaataggt tttagataga ttaaaaattt atggtttgtt ttttttaatt 840 

ttttattatt gattatttgg gagaattaaa atggtttgga ttataaagtg attataatta 900 

gtagggtagg tatattaaat aaaaatgggg tgagtgtgtt tggatgggaa aaattggggt 960 

gtagagaaat attttttgtt ttggaggttt cgtttatagc gagtttgagg atgtagatga 1020 

tatggaatat gagttttatt aggatatata tttttttgta tagttttttt tgattagtgt 1080 

tgtttattgt tttaggtatt atttttagtt attttgtgtt aggtatttag gatagttttg 1140 

tttaaagtat gattgttttt tggttagtga agaaatagta aagaaatgag agtttgtttt 1200 

tgaagttttt gtagtaattt aatatttttg taatgtcgtg gtatatgttt tatatttttt 1260 

aaaagtattt atttgtagat tagaaataaa agagtttttt agtatagata gtttgagaga 1320 

tgttattgta gagtgttttg ggaggatggg atttttttag atattgacgg ttttttggag 1380 

tggtgttgga gtttgttgag atagggacgc ggggattgtt gtggaatagt atgttgtgaa 1440 

aataggtata gatgggttgt tgttacgtat ttagtatttg gtaggtttag gtattttgtt 1500 

tcggggagat agagtaatag ttgttatttt taaaaatgtg atataaatat gattttcgtt 1560 

ttttattgta tgtgtatgta tgtgtgtatg cgtatgtgtg gttggtgtgt gggtaaataa 1620 

tattggtgag ggagttagta gtgtagagac gtagagatga atttaatgaa tttatatttg 1680 

gaggtttttg aaaggagttt tgtgtatgta taattaggaa gtagaagtgt agtcgtaggt 1740 

tagttttatt ttttggtgga aatagtaaat tatagtttta gtatttttta tttgtagtaa 1800 

aagaaatgta tgttgtattt gggaggggta ggagaatttt ttagaatttg gtgggtttat 1860 

tagttagggt tgtgtagaga aatagaatta atgagataaa tatgttagat agatgataga 1920 

tagatagata gatagataga tagatagata gatagataga gatgatagat gatatagata 1980 

gatgatagat gatagataat agatgatata tagatagata tagtaaatag atttatagat 2040 

atgatagaga gagaatgaga gagatttata ttaaagaatt ggtttatatg attgtggggg 2100 

ttgtaaattt tatatttgta gaatttgttg gtaggatgga aatttaggta agagtggata 2160 

ttgtagtttt gagtcgaaag tttatagggt agtaagttgg aaatttaggt aaggttttta 2220 

ggttgtagtt ttaagaagaa ttttatattt tttaggaaat tttagtttgt gtttttaggg 2280 

tttttaattg attgaatgaa gtataaatat tatggaagga aattcggttt atttaaagtt 2340 

tgttggatta agtattaatt ttaattaaaa aacgttttta tagtaatatt tagattattg 2400 

tttagttaag tatttgggta ttatagttta gataagttga cgtataaaat ttattattat 2460 

aagggaggaa tttatatata tattttattg tagtaaaatt tttatagttt aacgttttgt 2520 

atttttttta gaaggaaacg ttttagtgta gagttgaata tcgtattttt tcgtagttta 2580 

tataaatttt atataatttt gatttgttga gttttttttg aaaattggat aatttaagtg 2640 

tttatgaaag gttttaattg tttaagaaaa tagattgttt gtgtgaatta taaagaaaaa 2700 

gggattttag aaggaatatt ggtatttcgg gaagtaggtt ggggtaaggt ttgtataagt 2760 

gaattagaag ttttaggtac gaagttagta tttttgttat ggtttatgtt agttgagttt 2820 

ttatttttgt ttcgtgtttg gtttttagat tttgtgagtt cgtttcggga tagggttacg 2880 

gttaatttag tagagatttt ggtaaagtat ttcgggaatg agagtgagaa aggtttaaag 2940 

tagttaaggt ttaaagataa ttttcgattt tattttttta aatgttagtt tttaagattt 3000 

taggtttttt ggattttaat ttttatgtta ttttaaaggt tttaaatttt agttttagta 3060 

gttttagttt atagttttag gtttttagag attattttag taatttttaa atattatttt 3120 

agggatttag ttttgatagt taagattatt ttttttataa aataatcgta gatattaggt 3180 

gaagattttt aaagttttaa ggaaatttta aattttattt tgaggatatc gattttatta 3240 

aagtttcgag gaaattttaa attttatttc gaggatatta gttttatcgt taggaatcgg 3300 

gattttaatt tatagtattc ggatttcgag aatagaggtt tcggggttaa atgggttgaa 3360 

tttagtattt tatttttacg ttttcgggtg gatagtaatt tttttttatc gcgttttttc 3420 

gtgggtttta ggttttatat ttgagggatg tggttttttt tttttatatt atgttggtta 3480 

agaatgattt atatagttat tatggaatat tgtatggaga taaggagtgg ttgtgttttt 3540 

gtttgaaggg tttattttat ttatgaggta ggatgatttt taaggtagtg ttatttatta 3600 

gagttaaagt ttaggtaagg tgtttagggg ttagttgtaa tatgaagtat attagttaag 3660 

gtagttttaa agagtattgt tagaggaaga atttatattt ggttgtttaa tgaggttgaa 3720 

aggtaaattt aggttaagta ggttaagtaa agaggttagt ttagtatttt gggaggtaga 3780 

ggtaggtgga ttatttgagg ttaggagttc gagattagtt tggttaatat ggtgaaattt 3840 

tgtttttatt aaaaatataa aaattagttg ggtgtggtgg tacgcgtttg taattttagt 3900 

tatttaggag gttgaggtag gagaattgtt tgaattaggg aggcggaggt tgtggtgagt 3960 

taagattagg ttattgtatt ttagtttggg tgatagagtg aaatttcgtt ttaaaaaata 4020 

aataaataaa taaataaaaa taaaaaagaa gttagaaagt ttatgaaaaa tatttggagg 4080 

gaagaagtta gtttaagaat aaaagtatta ggttagggcg tttaagattt ttaaggattg 4140 

gttttggttg tgtagatatt taaagagatt gtgtaaggtt ggataggagg gtgagttggt 4200 

taaaaaggtg atttgttaat tgtggaatta attatcgttt aatgttttgg ataattgagt 4260 

tt^attcgta atggttaggt ttattattgt aaattagttt tgaagtataa ataagtttat 4320 

taaattattt tttttgttta tatttttttt tattttttga tttatgtaat ttttgtagtt 4380 

tatatttaaa taaaaaataa taatatggat ttgttattat gtatgtcgtg gtttattttt 4440 

aattgtaaag -tt-t^ 4500 - 
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ttgttattta ttatgagtgt agaaaattgt ttattagaaa cgattttaaa tagtaaagga 4560 

tgtatatttg aaataatagg gttgtaatga aaaagtatat attttttaat tttatttaat 4620 

atattgtatt cgtagagtaa aagatataga aaagtatttt atgttttaaa aatgtaagtt 4680 

tgagagtgaa aaatagagat aataaaaatt tttttttaaa tattattggt ggaaaagttt 4740 

taaagtaata gaaagtatat ataaaatgtt tgtgaatttt tgtgtaataa aattatatat 4800 

atatatgtgt gtgtgtatgt atatttgttt tttttgtttt gttttgtttt tttttttttt 4860 

tttgagacgg agtttcgttt tgtcgtttag gttggagtgt agtggcggga tttcggttta 4920 

ttgtaagttt cgtttttcgg gtttacgtta tttttttgtt ttagtttttt aagtagttgg 4980 

gattataggc gttcgttatt acgttcggtt aattttttgt atttttagta gagacggggt 5040 

tttatcgttt tagcggggtt ggtttcgatt ttttgatttc gtgatttgtt cgtttcggtt 5100 

ttttaaagtg ttgggattat aggcgtgagt tatcgcgttc ggttaatgta tgtatatttg 5160 

taatttagat ttgtaaatgt agttatattt atatttatat attaaatgta ttaaaaagat 5220 

ttattaagtt ttttatagtt agtttataaa tttttagttt ttttatatat tcgtattttt 5280 

tgttttaata tgatataata gttaaaaaaa tttaaaaatt aataattaat taagtttttt 5340 

ttttttgtag attatattag ttattattta gaaatattaa tttattgttt taaaaataat 5400 

ttttttagag atgtttggta tttagtatat gaaattattt ataattgaat aatataaata 5460 

aaatagaaaa aagaatttat ttttgggata tttgaagaaa gtgatttata aatttagttt 5520 

atttttgagt tttattattt ttaggaaaat ggtttttatg ttatttttat gttatgtttg 5580 

atttgttttg tttgagtttg ttttttagaa agattgtgat ttttggtttg ttgaatatga 5640 

aatttatatg attatttttg ggaaaaattt tttttttagt gattgtaagt taggttttgt 5700 

ggtcgagaag tattagttgt atttggttga ttgattgtat tattttgttt ttatgttgtt 5760 

tataaagata tatttgagat ttggtaattt ataaaagagg tttaatagat ttatagtgtt 5820 

aagtgggtgg gggtgtttta taattatggc ggaaggtgaa aggtatgttt tatatggtgg 5880 

tagataagag aagagagttt gcgtaggaaa attttttttt ataaaattat tagattttat 5940 

gagatttatt tattattata agaatagtac gggaaggatt tatttttatg atttaattat 6000 

ttttaattag gattttttat aatatgtggg aattatggga attataattt aagataagat 6060 

ttgggtgagg ttatagttaa ttatattatt gatttttatt atattataaa gaattatttt 6120 

gtagagagaa tttgtttttt atttttaata atttgtttat tttggatatt ttatataaat 6180 

agagttatat aatatttgtt gttggttttt ttatttagtg taatgttttt aaggtttatt 6240 

tgtgttagag tatgtatttt tgttttattt ttttatatta ttaaataata ttcgattgta 6300 

tggatgtgtt atattttatt tgatttttta ttagttgata gattgttggg tttttttatt 6360 

ttttggttat tacgaatgat ggtattatat gaatatttat atataagttt ttatgggtat 6420 

atgtttttat tttttttggg tatatattta ggtgggggat tgttgttata tggtaatttt 6480 

atgtttaata ttttgagga 6499 

<210> 303 
<211> 6499 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 303 

tttttaggat gttaaatata aagttattat atgatagtaa ttttttattt aggtatatat 60 

ttaagaaaga taaaaatata tatttatgaa aatttgtgtg tgaatgttta tatagtatta 120 

ttattcgtaa tagttaaaaa gtagaaaaat ttaatagttt gttaattgat aagagattaa 180 

ataaaatgtg gtatatttat ataatcgaat attatttagt gatgtaaagg aatgaagtaa 240 

aggtatatat tttaatatag atgaattttg aaaatattat attaaatgaa aaagttagta 300 

ataaatatta tataatttta tttatgtgaa atatttagaa taaataaatt attagagatg 360 

gaaagtagat tttttttgta agatagtttt ttataatatg atgaaagtta gtgatatggt 420 

tggttgtgat tttatttaaa ttttattttg aattgtagtt tttataattt ttatatgttg 480 

tggagggttt tggttggagg taattgaatt atgggggtgg gtttttttcg tgttattttt 540 

gtgatagtga gtaagtttta tgaggtttga tggttttata agaggaaatt tttttgcgta 600 

agtttttttt ttttgtttgt tattatgtga gatatgtttt ttatttttcg ttatgattgt 660 

gaggtatttt tatttatttg atattgtgag tttgttaaat tttttttgta aattgttaag 720 

ttttaggtat gtttttatga gtagtatgaa aatagagtaa tatagttagt taattaagta 780 

tagttgatgt ttttcggtta taggatttgg tttgtagtta ttaaaaaagg aatttttttt 840 

aagagtgatt atgtaaattt tatatttaat aggttaagga ttataatttt tttggaaaat 900 

agatttaagt aagataagtt aaatataata tagagatgat atagagatta tttttttaaa 960 

aataatgaga tttagggata aattgagttt gtaggttatt ttttttagat gttttaaaag 1020 

taagtttttt tttttgtttt gtttgtatta tttaattata aatgatttta tatattaagt 1080 

gttaaatatt tttagagaaa ttatttttaa aataatgaat taatgttttt gaataatggt 1140 

tgatatagtt tataagaaaa aaaaatttgg ttaattatta atttttaaat ttttttaatt 1200 

attatattat gttaagataa aaaatgcgaa tatgtgaaga agttgggaat ttatggattg 1260 

attatgagga atttagtgaa tttttttggt gtatttgatg tgtggatatg aatgtggttg 1320 

tatttataaa tttgagttat aaatatatat atattggtcg ggcgcggtgg tttacgtttg 1380 
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taattttagt attttgggag gtcgaggcgg gtagattacg aggttaggag atcgagatta 1440 

atttcgttaa aacggtgaaa tttcgttttt attaaaaata taaaaaatta gtcgggcgta 1500 

gtggcgggcg tttgtagttt tagttatttg ggaggttgag gtaggagaat ggcgtgaatt 1560 

cgggaggcgg agtttgtagt gagtcgagat ttcgttattg tattttagtt tgggcgatag 1620 

agcgagattt cgttttaaaa aaaaaaaaaa aaataaaata aaataaaaaa aataaatata 1680 

tatatatata tatatatata tatatgattt tattgtataa aaatttatag gtattttata 1740 

tatatttttt attgttttga gattttttta ttaataatat ttagagaaag gtttttattq 1800 

tttttgtttt ttatttttag atttatattt ttaaaatata agatgttttt ttatattttt 1860 

tattttacgg gtataatgta ttgaatgaga ttgagaaata tgtatttttt tattataatt 1920 

ttattatttt aaatatatat tttttgttgt ttagaatcgt ttttagtaag tagttttttg 1980 

tatttatggt gaatagtaaa gcgttttttt ttgatatgaa gattagtatg ataggagaat 2040 

aattagaaat tttatagtta gaagtaaatt acgatatgta taataataag tttatgttat 2100 

tattttttat tztagatatga gttatagaga ttatatgaat tagggaataa aagaaaatgt 2160 

aaataggagg ggtagtttaa tgaatttgtt tatattttaa gattgattta taatggtaag 2220 

tttaattatt gcgagttaga tttagttatt taggatattg ggcggtggtt gattttatag 2280 

ttagtagatt atttttttgg ttagtttatt tttttgttta gttttgtata atttttttaa 2340 

gtgtttatat agttaggatt aatttttggg gattttagac gttttgattt aatgttttta 2400 

tttttggatt aatttttttt ttttaaatgt tttttatggg ttttttgatt ttttttttat 2460 

ttttatttat ttatttattt attttttgag gcggagtttt attttgttat ttaggttgga 2520 

gtgtagtggt ttgattttgg tttattataa ttttcgtttt tttggtttaa gtaatttttt 2580 

tgttttagtt ttttgaatag ttgggattat aggcgcgtgt tattatattt agttaatttt 2640 

tgtattttta gtaaagataa ggttttatta tgttggttag gttggtttcg aatttttgat 2700 

tttaggtgat ttatttgttt ttgtttttta aagtgttggg ttgatttttt tatttggttt 2760 

gtttgattta gatttgtttt ttaattttat taaatagtta ggtgtagatt ttttttttgg 2820 

taatattttt tgagattgtt ttggttaatg tgttttatgt tgtaattggt ttttaaatat 2880 

tttatttgag ttttgatttt agtgagtagt attgttttga gggttatttt gttttatgaa 2940 

tgggatgagt tttttaggta ggggtatagt tattttttat ttttatgtaa tattttatga 3000 

tgattgtgta gattattttt ggttaatatg atataaaagg aggaggttat attttttaaa 3060 

tataggattt aaaatttacg aggggacgcg gtggaggagg gttgttgttt attcgggggc 3120 

gtgggagtga ggtattggat ttagtttatt tggtttcgaa gtttttgttt tcggaattcg 3180 

ggtgttgtgg gttgaggttt cggtttttaa cggtgggatt ggtgttttcg agatgaaatt 3240 

tggggttttt tcggggtttt ggtgggatcg gtgtttttag gatgagattt agggtttttt 3300 

tggggttttg gggattttta tttaatattt gcgattattt tatgagagga gtggttttgg 3360 

ttgttagaat tggatttttg gggtgatatt tgggagttat tggagtgatt tttgaagatt 3420 

tagggttatg agttggagtt gttggggttg aaatttgggg tttttgaagt ggtatggaga 3480 

ttgaggttta gagagtttga gattttgagg gttgatattt ggagagatgg ggtcgagggt 3540 

tgtttttggg ttttgattgt tttgggtttt ttttattttt attttcggga tgttttgtta 3600 

gaatttttgt tggattggtc gtaattttgt ttcggagcgg gtttataggg tttgaaggtt 3660 

aggtacgagg taaaggtaaa gatttaattg atataggtta tgatagaggt gttgatttcg 3720 

tgtttaaaat ttttgattta tttatgtaag ttttgtttta atttgttttt cggagtatta 3780 

atgttttttt taaaattttt ttttttttgt aatttatata aatagtttat ttttttaagt 3840 

aattaaaatt ttttatgaat atttaagttg tttagttttt aggagaggtt tagtaaatta 3900 

aagttgtatg ggatttgtat gagttacgga aagatgcgat atttaatttt gtattgaaac 3960 

gttttttttt gggaaaagta taaaacgtta aattgtaaag attttgttgt aataaagtgt 4020 

atatataaat tttttttttg tgatggtgaa ttttatacgt taatttattt gagttatggt 4080 

gtttagatat ttggttgaat agtagtttaa atgttgttgt gaaggcgttt tttagttggg 4140 

attaatgttt aatttagtag attttgagta aatcggattt ttttttataa tgtttgtgtt 4200 

ttatttaatt agttgaagat tttaagaata tagattgagg ttttttaaag aatatgaaat 4260 

tttttttaag attgtaattt agaaattttg tttgagtttt tagtttgttg ttttgtggat 4320 

tttcgattta aggttataat atttattttt atttgaattt ttattttgtt agtaaatttt 4380 

atagatgtag gatttgtagt ttttataatt atgtgagtta attttttaat gtaaattttt 4440 

tttatttttt ttttgttata tttatagatt tatttgttat atttatttat gtattattta 4500 

ttatttatta tttattattt atttatatta tttattattt ttatttattt atttatttat 4560 

ttatttattt atttatttat ttattattta tttaatatat ttattttatt ggttttgttt 4620 

ttttatataa ttttgattaa taaatttatt aaattttaag aagttttttt atttttttta 4680 

ggtgtagtat atattttttt tgttataggt gagaagtgtt aaggttgtaa tttgttgttt 4740 

ttattagagg atgggattgg tttacgattg tatttttgtt ttttgattat gtatgtatag 4800 

ggtttttttt aagggttttt aaatataaat ttattagatt tatttttgcg tttttgtatt 4860 

gttaattttt ttattaatat tatttattta tatattaatt atatatgcgt atgtatatat 4920 

gtatatatat ataataaaaa acgaaagtta tatttgtgtt atatttttgg aaatgatagt 4980 

tgttgttttg ttttttcggg gtagggtgtt tgggtttgtt aaatattgag tacgtggtaa 5040 

tagtttattt gtgtttgttt ttataatatg ttgttttata gtaattttcg cgtttttatt 5100 

ttagtaagtt ttagtattat tttagggagt cgttaatgtt tagaagaatt ttattttttt 5160 

agggtatttt atagtagtat tttttaaatt atttgtgttg aaggattttt ttatttttag 5220 

tttgtagatg aatattttta aaaaatataa aatatgtatt acgatattgt agaaatgtta 5280 

agttgttata aaagttttaa aagtagattt ttattttttt attatttttt tattgattag 5340 

aaagtaatta tgttttgaat agaattgttt taagtgttta atatagagtg gttgaggatg 5400 

gtgtt_taggg„taatagatag -ta^tga«:ag ggaagattgt-gtaggaggat -gtatattttg 5460- 
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gtggggttta 
ataggaaata 
tttagtatgt 
tagatgatta 
tttatttaaa 
gtttttttat 
tatttttatg 
aggttgagat 
ttcgatatag 
attgttaaaa 
gtttttaaag 
ttttttgaga 
tttattgtaa 
aagttttttt 
gaaatgattt 
taaaggtttt 
aataataatg 
taagtgagaa 



tgttttatat 
tttttttgta 
ttgttttatt 
atgatgggaa 
ttttagttta 
ggtaattgtg 
tattatgaat 
tggaaatttt 
ttttatatat 
agtatagaaa 
aaaaaatgtg 
tagagtttta 
tttttgtttt 
tataggaaaa 
ttgtaattat 
tatttgttga 
tgttaaaata 
tatgtagta 



tatttgtatt 
ttttagtttt 
aattgtaatt 
gttgaagaag 
gagggagatt 
aaattagttt 
agagaaggtg 
atatttaata 
gtgtataagg 
tattgaggtt 
agggttaagt 
tttttgttgt 
ttgggtttaa 
aaggaatttg 
aaataaatta 
tgtaattaaa 
tatttatgtt 



seq_prot . 
tttaggttcg 
ttttatttaa 
attttataat 
ataggttatg 
ttttttttta 
atatatttaa 
gaattaaatt 
tcgttgtaat 
agttaatgtt 
taaaatttta 
tttatttttt 
ttaggttgga 
gagatttttt 
attgtttttt 
ttttttaatt 
tggttaaata 
tatgtgtatt 



txt 

ttatgaacga 
gtatatttat 
ttaggttatt 
ggtttttagt 
tttcgtttaa 
ttattttata 
tttattttat 
tttttttttg 
atttttgata 
ttttatttta 
attttatttt 
gtgtaatggt 
tgtttaagtt 
tatattaaat 
gtgttagata 
ttatatagaa 
taatgattag 



agtttttaaa 
tttattttta 
ttgatttttt 
ttatttaaag 
atatattatg 
tttgattatt 
aataagaagg 
gatatattag 
taatttttat 
atagtgtaat 
atttatttat 
gtaatttcgg 
tttcggaatt 
tagtgtaaat 
tattaatttt 
tttaagtggg 
ttgttagtta 



<210> 304 
<211> 4441 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 304 

ttagtataag 
agtattttgg 
tataatttag 
aataaaatat 
atattttatg 
aatagtttag 
ttgatttttg 
ttgtggttag 
taaatttgtt 
tagttttata 
aaattttatg 
atgtgtgttt 
gaaatttttt 
tgtgtgtgtg 
tttttgtggg 
tatttaggag 
tataatgtag 
aaggaagaga 
aattaaattt 
ggggttcggt 
cgttttatga 
atcggaaatg 
ttaaataagt 
aattttagag 
aagagggggt 
aaatgtttgg 
ttgttttaaa 
tgttagtttt 
gtgttggtat 
tatattattt 
gtttaagtat 

ggCjtttattt 

ttaaaattga 
ttggtgagta 
attagtcgtt 
ataatatttt 
atcgttattt 
ttggtaagag 
tttaggagga 



gtttttgtta 
ttttcggata 
gaggtttgtt 
atttgttggt 
ttagaatgag 
tgattttttg 
gttcgttttt 
ggttttatag 
gtttaaatat 
cgttttattt 
atttgatgtt 
atttttagga 
gtaaagggtt 
tgtgtgtgta 
gttttttttt 
tttttttata 
ataagtttta 
ggtataaatt 
attaaagtat 
ttgttgtagg 
gataatgtgt 
tttattgttt 
aaaaggaaag 
ttataaggaa 
tttgtaaaaa 
tcgatgcgga 
atgtgtttta 
gtttttgttt 
tagtattaat 
gtttgtgttt 
ttggtaagta 
atattttaag 
tgaatggtat 
agcgtgtatt 
atgatgttaa 
gattttttaa 
ttttttgagg 
tagtttatat 
aattgaagag 



agtttatggt 
gtatagaaaa 
agtttgattg 
ttttatgtcg 
aagtgaatta 
gaatttaaat 
ttttcgtttg 
agaattttag 
gtgaagttta 
tttttataaa 
gtcgtggtta 
tttatagttc 
tgaagtatgt 
tgtgtgtgtg 
tttatttgtt 
gttgaagtgt 
aggtatgttt 
gtgtcggtta 
gaagattgtt 
agtaaatgtt 
agtatttgat 
tcgttttatt 
agaattttgc 
ttagaaaaag 
tatatatatt 
ttagtattga 
ataaatataa 
ttatattgtt 
tcgatttaaa 
tggttttttt 
ttttaaaaat 
aaaggatttt 
gttgtcggga 
gttttgatat 
gttttaagtt 
ttacggtttk 
taagtataat 
agtataacga 
atatttttat 



tgtttttttt 
tagaaattga 
attttttaat 
tagagtagtt 
aacgaggtgt 
aggatagggt 
tgattatgtt 
tgttgtttag 
gttgggagtt 
gattaaataa 
gttgtaattt 
cggggggaaa 
gattaaagtt 
tgtgaatgtg 
aaaagtatta 
aatgaatgga 
tgtattgtag 
taaaatgaag 
ttaagttggt 
tttagatttt 
tttttattaa 
tgaaaggtat 
ggttcgtaga 
ggaggagtta 
tgtttggtta 
ttaacgtcga 
atggatttgg 
ggtgtagggg 
atatattttg 
taggaatatt 
attattagta 
ttttttttgt 
tatttttatt 
tttttggata gtaaataatt 
aatagtggtt aatgatcgtt 
agaagttttg ggaatgaggg 
gtgttgtgga aataggttat 
ttgagcgtta tggttgtgt* 
aagagtttgt cgaagaggat 
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ttgaaaattt 
attaaatagt 
agttttatta 
tggtttgttt 
tttggggata 
attaagtgaa 
tttatggttt 
ggttatggtt 
tagtttgggg 
atttaaagta 
tattttttaa 
gtttagtttt 
agtggtttgt 
tgtgtattta 
gtttttacgt 
tttaaatatt 
tattgaatgg 
gttatttaga 
ataagttgtt 
tatttttttt 
aaaaattatt 
atttatgttt 
ggtttaagtc 
gaaaagggaa 
agattttttt 
gattgagtag 
gggggaggaa 
tttgtatagt 
ttttggtttt 
agatgtaaaa 
attagttatt 
tttaatttta 
aaaatttatt 



cggatttgaa 
ataattatag 
cgtaagaagg 
atttaagttt 
gtatttgttt 
tatagggtag 
atattttttt 
ttaagaggta 
aaaggggagg 
aatcggtgtg 
atttatttgt 
gagaatattt 
taggatagag 
tgagtgttta 
tatatggtag 
agtttggtta 
tgatttggga 
ttatgtggaa 
tatttttaaa 
ttttatatgt 
cgacgttttt 
tgtatatttg 
aattttaata 
gatttatatg 
gtttattatt 
atttgggtta 
gaaggttcgt 
ttatatatag 
taattttggg 
aatagaaggt 
tttggagggt 
gtgtcggttt 
ttttttttaa 
gaatttgttg 
ttgggaaaaa 
ttcgttagtt 
ttttgtattg 
t^tttgtgtpt 
tagggggcgt 



5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6499 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 



seq_prot.txt 

taacgttcga tttttatttt agtagtagtt ggattttttg aagggagaag atattgtagt 2400 

gattatttat tttgtattgt tatggttttt ttatttttat ttggggtggg gtggggtggg 2460 

gtgggggagg ggggggtggg gtggggagaa attatataat tttaaaaagg attatattaa 2520 

ttattttttt tgtaattttt ttatagtttt aggtttagtg aaaaattgtt gtaaatatag 2580 

gggatatagt ttaataatgt aatttttaat tattgttttt ttttttttta atttattaat 2640 

agtttgttga tttgataagt aagagtgggc gggtgagaaa aatcgaattg ggtttagtta 2700 

attattgtat tgtatgtaaa taagaaacgt gttatatttg tgacgtcggg tatttatata 2760 

ggaagaacgc ggtgtgtaat attgtgtata ttttaaatat tattttaatt tattttttgt 2820 

agtgaatttt ttgtttagaa tattaaagat aaggattaga tattattttt tttttttcgt 2880 

ataattttgt agatatttat ttgatgattt ttaatttttt atttttaaat gagacgaaat 2940 

gttgatgtat ttttttattt agttaataaa ttagaaaagg ttatgtttat tttttaaaaa 3000 

gggaagtaag taaataaata ttgttaattt ttttatttat ggatattata tatattagta 3060 

ggagtaataa atttatttat agtatttgtt tttaggataa tattttattt ttaggaaatt 3120 

tattttttat agagttaaaa tgttatttag taataaataa tatttgttag ttttagagta 3180 

tttaaggaaa ttagataagt aaaattattt tttttgtaat ttaatgaaaa ggtataatag 3240 

aataatgtat gatgaattta tttaattatg aggtgggagg agcgaaattt aaattttttt 3300 

tgttatagtt atatattaat ttaaaaagta aaaaaaaaaa aggggggggt aatttttttt 3360 

tgtgtttttt tttttttttt tttttttttt tttttttttt tattgtgtat tagtttttat 3420 

gaaagatttg aatattattt attttaaatt aagtatatgt gttattttaa gtaatacgtt 3480 

ttgatataag atggttgatt aaggtgtttt ttttcggttt gagtttatta tttttttatt 3540 

taaattgtat ttttagttag agatgtaata tatttttatt atttaatatt atttttgaat 3600 

gttataacga atttatagtt tagtatttat tatatgttgt tatatataag taatgtaaga 3660 

aaaaaattta ttgggtaggt gattttaatt atttgtagtt ttttttgtat atttaattat 3720 

agttaaagaa gtaatttttt tattgtgttt tagtatgatt atgtattttt ttatgttttt 3780 

ttaattaaaa atttttaaaa tatttgtttt agtttttttg ttagattttt atattaattt 3840 

gaaaattttt taattaagtc gtttttaggt ttttaaggat aatttttttt aattatatta 3900 

tatattatat aagatttgat tgtaatattt aaatattatt ttttaagttt gtattttaaa 3960 

tgaatttttt aaggagatgg attaattgat ttgtaaagat ttatttttag attttaaaaq 4020 

gaatgaattt gttatttgta gtatttattt gtttttttaa tgtttgaaat agtttaaatt 4080 

gtagttaatt ttagttaaaa ttatttttgt aaaagatatt tgatagaaag gaatacgttt 4140 

ttatatattt ttgtaaaata agtaaataat aaataaaata aaagttaatt tttaaagaaa 4200 

tttgaagttt tgtaggtgag atgtaataag ttttgttttt gtataatgta attaaaaata 4260 

tgtgttttta agattagttg aatataagaa aatgtttgat aaatattttt atgtatttta 4320 

tataaatgtg atttttgtaa tatgttttaa ttagatttat tttaaacgtt ttttatgtag 4380 

agtttttatg tttttttttt ttagtgagtg tgttgatttt ttaatatggt attattaatt 4440 

g 4441 

<210> 305 
<2U> 4441 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 305 

tagttgataa tattatgtta aaaagttagt atatttatta ggagagaaag gtataaaaat 60 

tttatataag aagcgtttaa aataaatttg gttgagatat attataaaaa ttatatttgt 120 

gtaaaatata tgaaaatatt tgttaagtat ttttttatat ttaattaatt ttaaaaatat 180 

atatttttga ttgtattatg taaaagtagg gtttgttgta ttttatttat aaagttttaa 240 

gtttttttga aggttggttt ttattttatt tattatttat ttattttgta aaagtatgta 300 

aaaacgtgtt tttttttatt aaatgttttt tataaaaata gttttgatta gggttagttg 360 

tagtttgaat tattttaaat attgaaaaaa taaatgaatg ttgtaagtaa taagtttatt 420 

ttttttgaag tttggaggta ggtttttgta agttaattag tttatttttt taaagaattt 480 

atttgaggta tagatttgga gggtaatatt taaatattat agttaaattt tgtgtgatqt 540 

gtagtgtgat tgaggaaaat tatttttaag aatttaggag cgatttggtt aaaaaatttt 600 

taagttaatg tagaaattta gtagagaagt tgaaataagt attttaaaaa tttttaatta 660 

aaaaaatata gaaaaatata tagttatgtt gaaatatagt aaggagattg tttttttaat 720 

tgtaattaag tgtataaaaa gaattgtaga tgattagaat tatttattta gtaagttttt 780 

tttttgtatt gtttgtgtat agtagtatgt aataagtatt agattgtaaa ttcgttgtaa 840 

tatttagagg tagtattgag tagtggggat atattgtatt tttggttaaa agtgtagttt 900 

gaatgaagag atggtgaatt taagtcgaag aaaagtattt tggttaatta ttttatgtta 960 

aaacgtatta tttgaagtaa tatatatgtt taatttgagg taagtggtat ttaggttttt 1020 

tatggaaatt gatatataat gaaaagagag agggagagga agagagagag agaaagatat 1080 

agagagagat tgtttttttt tttttttttt ttgtttttta aattgatgta taattatagt 1140 

iSSSK*;" tagatttcgt tttttttatt ttataattag gtgagtttat tatgtattat 1200 

tttgttgtat ttttttatta aattataaag aqqataattt tatttgttta gtJit^ttiaa 1260 — 
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atgttttgag gttgataagt gttatttatt 
agattttttg aaaatgaagt gttgttttga 
ttgttgatat gtgtgatatt tataaataga 
ttttttgaaa aatgaatata atttttttta 
tatttcgttt tatttagaaa taaaaagtta 
tacgaaaaag agaaagtagt atttagtttt 
tatagggagt gggttggggt ggtatrtgag 
ttatatgaat gttcgacgtt ataagtgtga 
ttgattaaat ttaattcggt tttttttatt 
tattagtagg ttaagaaaaa agaaaatagt 
tttgtgttta tagtagtttt ttattaaatt 
attaatatag ttttttttaa ggttatgtga 
ttttatttta ttttatttta gatgaaagtg 
tattgtagtg tttttttttt ttaaaagatt 
ggcgtttttt agtttttttc ggtagatttt 
aatatataaa gggtatagtt atgacgttta 
atagtgtaga ggtaatttgt ttttatagta 
tgattggcga gtttttattt ttagggtttt 
ttttttttta gagcggttat tagttattgt 
ttagtaaatt taattgtttg ttgtttagag 
attagaaaaa aagtaggaat ggataaattt 
aaaatcggta tataagaaaa atagagttag 
tattttttag atattggtaa taatagttgg 
tattttttgt taatattttt gttttatatt 
gtttaaaatt atagggtgtg tgggattagg 
tttgtgtgtg atttttatat tattgtgtag 
aacgagtttt tttaaattta tttttttttt 
gtgatttaag ttcggcgtta attgtttaat 
tgatggtggg ttagttaagt agagaaaatt 
ttatataaat ttaatttttt tttttttttt 
ttgttggggt ttttgcgagt cggtttgagt 
ataaatgtgt agtatttttt aggatatggg 
taaaagcgtc gttagtgaga ggatgatttt 
ggtatgtgaa aaagatttgg gggaagaggt 
ttttaagaat ggttagttta aaatagttta 
tttttatatg gttttatttt gtttgggtgg 
tttttaggtt attatagtat attatttagt 
atgattaaat ttttatttat tagtgtttga 
gttattatgt gtgatgtttt tgcgtaaaag 
atgggtattt atatatttat atgaatgtat 
attttatttt gggttttaat tataggttat 
taaatatttt tttttttttc gaagattgag 
tataaatgaa tggattatag tttgaaaaat 
ttatatcggt tttgtttagt ttgttttggg 
attttttttt taatttttaa tttttaaatt 
atattttttg attaggtaat aggttatgat 
aaggaggatg tagtatgatt aaagttatgg 
gttgttttgt gattttgttt ttttatttaa 
taggtagata tatatttcgt ttgtttttag 
tggatttagg tgattatttt gaaatagatt 
ttttttttac ggttaaaaaa ttigatagaat 
gttgtaatta tttaattttt aattgtttga 
ttttaaattc ggaaggaaat agaattttta 

g 

<210> 306 
<211> 4343 

<212> DNA < i 

<213> Artificial sequence 

<220> i ■ 

<223> chemically treated genomic 

<400> 306 

tttaaagaat tattaaatta ttttttgggt 
gttgttaatt gagtgaaggg tgttggggaa 
tagagaatcg aaaagacggt agttgtatga 



seq_prot.txt 

gttaaatggt attttggttt tgtaggaagt 1320 

aaataagtgt tgtgagtaaa tttattattt 1380 

agagttggta atatttgttt gtttattttt 1440 

gtttgttagt tgaatgaaag gatatattag 1500 

aaaattatta agtaagtgtt tataagatta 1560 

tatttttggt gttttaaata gaggatttat 1620 

gtgtatatag tgttatatat cgcgtttttt 1680 

tacgtttttt gtttgtatgt agtgtagtga 1740 

cgtttatttt tgtttattag attaataaat 1800 

aattaaaagt tgtattgtta agttgtgttt 1860 

tgggattgtg aagggattat aaagaaggtg 1920 

ttttttttta ttttattttt ttttttttta 1980 

gaaagattat ggtaatatag aataagtggt 2040 

taattgttgt tgaggtagaa atcgaacgtt 2100 

tgtgaggatg tttttttagt ttttttttgg 2160 

atcgttatat tgtgtgggtt gtttttgtta 2220 

tattatgttt gttttaaaag aggatgacga 2280 

ttaaatcgta attgaggaat taaggtgttg 2340 

tgatttaagg tttgatatta tggcggttgg 2400 

aatgttaggg taatgtacgt ttatttatta 2460 

ttttcgatag tatgttattt attagtttta 2520 

ggggattttt ttttaaggta taaatgaatt 2580 

tatttttaga gtgtttatta gatgtttagg 2640 

tgagaaaatt aaggtataga taagtaatgt 2700 

atttgggtcg ggttggtatt ggtgttaata 2760 

aggtagtgtg agagtagggg tagagttgat 2820 

tttgtattta ttaaagtata ttttgagata 2880 

tttaatatta attcgtatcg attaaatatt 2940 

taatatatat gtttttgtaa ggtttttttt 3000 

tttttttttg attttttatg gttttggaat 3060 

cgtaaagttt tttttttttt tgtttattta 3120 

tggtaaagcg gagataataa atattttcga 3180 

tattaaatgt tgtatattgt tttataagac 3240 

aaatatttat ttttgtagta gatcgagttt 3300 

taatagtttt tatattttaa tagatttgat 3360 

ttaatcgata taatttatat tttttttttt 3420 

aaagtatgtt ttaaagtttg tttgtattat 3480 

aatattttag ttgtgaaggg gtttttaaat 3540 

tagtaaatgg gagaaaaaaa ttttataaga 3600 

atatatatat atatatatat atatatatat 3660 

tgtatatttt aaattttttg taaaaggttt 3720 

cgagttgtaa attttgggaa taaatatata 3780 

ataattacgg tagtattaag ttatgagatt 3840 

ttttgtggga aaagtggaac gtatgaggtt 3900 

gtgggtttta tatatttaag tagtaggttt 3960 

tttgagattt tttgtggagt tttggttata 4020 

gtagacgaaa gagagacgag ttaaagatta 4080 

tatttaaatt ttagaaagtt attaagttgt 4140 

atgatttatt ttttattttg atatgaggtg 4200 

acggtataaa agttagtaag tatattttat 4260 

ttagttaagt tggtaaattt tttggattgt 4320 

tttttttata ttattcggaa attaaggtat 4380 

ggttatgaat ttaataaaga ttttgtatta 4440 
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ttttgaatta gatggagttg aattattttg 60 
aagtgttatt gtgtaagtta aggaaagggt 120 
aagggttttt gtgttgggat ttttttttta 180 
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agttatttaa aggtataaat taagaatcgt aattttggag atatgataaa tgaatataat 240 

tttggttttt tgttttattt taaagtgaaa ttttattgtt ttttaagaat ttgtatatga 300 

agttttatat attttttatg ttagtttttt tttttgttat ttttgttttt tagtttgttt 360 

ttttttaata gttggatata ttatcgtttt atttttttat tagttgtgat tgaagtagtt 420 

tgaggttaat tattttttta gtaattgata aagaattttg atgaagagga aagtgatatt 480 

gttttttgaa taatattgtt tttttttttt tttgtttttt cggttgagag gagagagtag 540 

ggtagatttt ttttttgtaa ggagtttagt agggcggtgg cggtggggtt tgtggaagtg 600 

cgtgggtaat atatggtaag tggagaaaat atttatattt atggtgaaag agtaggagaa 660 

gatttaaaga aggttatttt atagagattt taagtaagag atattgattt gatattgtat 720 

ttttgatttg gatataatag agaatttaaa gaaagagttt ttttttgtgg tttttttqtt 780 

atcgttattt ttaataattc gaattattgg taatttaaaa aagaaaagat ttcgtttatt 840 

?S?Hi2f ta aaa 9tattat attggttttt tttcgggttt tggttaggtt gttacgtttt 900 

tttttttttt tttgcgttag gtttttaaat ttagaattcg gagtcgcggt tattgagttt 960 

gtgttttggg tttgcgggcg gtgtttgttg ttgtgattgg gcggttaggt gagacggagc 1020 

gtacgggtag agtgcgtatt tggggcgatt ggtttttggg ggaagatata aataatgaat 1080 

cggtgttaga ttattttttg atgagatata tgaaattttt agtgttttta gtaggtattt 1140 

tgtattatag ttttttaaga aaggtaaaag gttttttttc ggttttttag tggtaaaatt 1200 

cgtagttgtc gtttagtatt ataattaaat gcgtgcgggg ttgcgggata gaggcgtttt 1260 

cgcgtatttt cggatagatt tatagttgtt tggcggatat atacgtgcgc gtttggtcga 1320 

tcggattagt agttgtttgc gttttttttt tcgcgtacgg tggcggtttc gtattaagat 1380 

tcgttgagga aacgaagagt agggttattt tttattttgt cgtcgaaagt ttattttatt 1440 

tttcgggtgt gtaggaagag atggcgcgag ggtaggaagg tgtgaaattg gggttttgcg 1500 

ggagatttat cgtggtttta tatttatttt cgtttgaaaa ttgttagcgt tttatttttt 1560 

ttaaatttta gttttcgagg gggaggtagt gagatgggat ataggtgcgt ttgggttttc 1620 

gtttgggagg ggttttttta ttttgggatt tttcgtggag tagtttataa tcgatcggtt 1680 

SHiS 1 ?* 38 a 99"atgtt taatggattt tcgtagataa aagggtttgg ttattttttt 1740 

tttttataat ggtttttgag gttaatgttt taaagaggaa ataaagggat tgttttatag 1800 

B!; at ^ g99 ^ggtggcggg aggaataggg agaggaggag gaaggggtat tggatacgcg 1860 

gagcgggagg aggtttttcg ttttttaatit ttttattttt tgcgatcgtt taggtttttc 1920 

gag i tt: S2H t J tttattta ggattcgaaa tagcggggag gaggcgatag tgcgtggagg 1980 

gtttcgtttt gtgattcgat ttttagtttt ttttattttt tttttcgttt ttttaaaqaa 2040 

tttttgaaag ggagaacgga aaagatgagg ggatttatat ttcgtgagtt gtagttttga 2100 

aaagttagtt ttagaggggt cgtttttcgc gaggttcggg tgtttatata ggatcgacgc 2160 

gtggtagtgt tatttgggtt cggattttgg gattgcggga ggcgggagcg tttggggtta 2220 

2?SJn?22« 2^ttgag ggtattcggt gaaggtcggg gaaagtgggg tagtttcgaa 2280 

atgcggtttg tacgtcgttt tgggagtcgt tttcgggttt ttttgtagat atagttttta 2340 

ggtgtgagcg gtgtgtgcgt cggtcggggg cgttttttat aggaagttta tatttgtaaa 2400 

H 9gg S ggtg 99gggggggg ggggcggggc gcggcgggag agagagagtt ttagtggttt 2460 

ttgttttttg gtttttttgt tttggtttta ggaaaagatt aaaataatag ttaaatagta 2520 

attgaagata agtgatgtaa gaaataaana attcgataat aaaaatgttt atagggacgg 2580 

ttttttaaaa ttttttatta tggaaattag agtatatgta taaaaggaga gagaaaggta 2640 

^Wtagtg ggttttaaat ttgattaagt attcgaatcg cggagcgaaa ttttttggag 2700 

ggtttgttaa aatatagatt gttgggtttit agttttagag ttttttgttg tgtattttta 2760 

ag PH a H5 g OSgttagttt ttttttgaga atttgtaatt tgttagagtt tttaggtggt 2820 

SHEF^* 8 9tttggaaag gatattgtta tttattggtt taataaattt tcgcgtattt 2880 

ttttttattt ttaatatgta taatattttt ttattttgtt tattttattt ttttttatat 2940 

tttgtttggt ttgttggaga attataaagt aatttttagt tattttatta ttttatttat 3000 

SHKI^i i tt: 9 a ttcgg tggttattta agtcgagagg attgtaggtt tttttgtggt 3060 

tattttatat tggggtagaa tttggtattt ttttttggga aatttatgtt tatattttgg 3120 

ttttgagatt atttttcggg agtttttttt tagtagattg ttatatgttt aaatattatt 3180 

^HSSP 1 ! 8 "SQtaagtt ggtttggttt ggttagggta ttatttggtt tttatttttt 3240 

attttttgtt tattgttttt aatttgttat tatgtatttt tataattatc gttttttttg 3300 

^HPPHi "aataaggc gttttttttt tagggtggag ttttgagtgt ttagggatgg 3360 

ggtggtattt ataatagtgt aagtttatag gattatttgt attagttagg tttaggattt 3420 

atattttaag gggagttttt ttgggttatt tgatttttcg tataattttt gattggaaaa 3480 

ggggatattt taggtatagt ttttgggaat ggttattatt gatatttgtc gtaatttttt 3540 

ttgattgtcg ggttttagtt tagtttggtt atttgtgttg tttttgtttt agagaggtgt 3600 

" ta 9 a tttt atttggttag ttttaagtgt gtagttttta tttatttttt 3660 

t 2^ ttttt:1: tt ggg tc 9 tt tcgaaataag tttttttttt tttttttttt tttttttttt 3720 

SHHSEF a 99atattat ttgttttttt tatagttttt ttattttagt tttttttagt 3780 

ttt Hfi ttt tagaggtttt ttgggttgtt ataaagttta gatatttttt 3840 

tttatttttt aaatttttag agatggggtg ttatttattg tgttatttag gttggagtgt 3900 

?2S!2HE8 a at H g 2 ttta "Otagtttt aattttttag acgtaagcgi tttttttatt 3960 

?2H a i tggg attata 99ta cgtattatat aggtttgtta attgttgtat 4020 

tttttttttt ttttgtagag atagggtttt gttttgttgt ttaggttggt tttgaatttt 4080 

tgagtttagg taattcgttt gttttggttt tttaaagtgt tgggattata ggtatgagtt 4140 

attgtgttcg atttagattt ttttattgtt tttttttttt ttttgatttt ttatglagta 4200 

ag.tatattat atgaattttt tatt tttt ta tttgttgtta aagaaagttt cggggatgga 4260 
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gtaggttttt ttattttaga gttttgtttt tatttttata ttgttttatt tttttatgtt 4320 

tttttttttt tattagtttg gaa 4343 

<210> 307 
<211> 4343 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 307 

ttttaggttg gtgaaggaag aaaaatatag gaagatgagg taatatgaaa ataggaatag 60 

agttttgggg tggaggaatt tattttattt tcggggtttt ttttaatagt agatggaaaa 120 

atgaagaatt tatatagtat gtttgtttta tgaagagtta ggaaaagaaa gagaataata 180 

aaagagttta ggtcgggtat agtggtttat gtttgtaatt ttagtatttt gagaggttaa 240 

ggtaggcgga ttgtttgagt ttaggagttt aagattagtt tgggtaatag ggtaaaattt 300 

tgtttttata aaaaaaaaaa aaaatataat aattaatagg tttatgtggt gcgtgtttgt 360 

agttttagtg atttaggggt tgaggtggga ggatcgtttg cgtttgggag gttgaggttg 420 

tagtgagtcg aattcgtatt attgtatttt agtttaggtg atatagtaag tgatatttta 480 

tttttaaaag tttaaaaaat aaaaaaaaat gtttaaattt tgtaatagtt taagagattt 540 

ttgagggttg aggtggggat aggattggga ggggttggga tgagaaagtt gtggaaggag 600 

tagatggtat tttaaattag gatggggagg ggaggggagg gaaagaaggg aagtttgttt 660 

cggagcggtt taaaggagag gtaaggagat aggtagaggt tgtatattta ggattagtta 720 

ggtggagttt aagagagtat ttggtatttt tttggaatag aagtaatata gatagttaga 780 

ttgagttagg gttcgataat taagggaggt tacgatagat gttagtgatg gttattttta 840 

gaagttatgt ttagaatgtt ttttttttta attaggaatt atgcggaggg ttaagtagtt 900 

tagagaagtt ttttttgggg tgtgagtttt ggatttaatt gatgtaaata attttatggg 960 

tttgtattgt tgtgggtatt attttatttt taggtattta gaattttatt ttggagggga 1020 

gacgttttgt taggaaaata ttttagggga aacgatggtt gtgggagtat atgatgatag 1080 

attggggata atgagtagaa aatgaaagat ggggattagg tggtgttttg attaggttag 1140 

gttagtttgt tagtagattt agtggtgatg tttgggtatg tgataatttg ttggaaagga 1200 

attttcgaag ggtggtttta gaattaagat atgggtatgg attttttaga gagaggtatt 1260 

agattttgtt ttaatgtgaa atgattataa ggaagtttgt agttttttcg gtttaagtaa 1320 

ttatcgaatt aaaaattatt tatataggtg aaatgatagg atggttggga attgttttat 1380 

aattttttag taaattaaat aaaatgtggg aggggataga atgaatagag tggggaggtg 1440 

ttgtatatat tgaaggtggg agagggtacg cggagattta ttggattagt ggatagtagt 1500 

gtttttttta aattagaagt agtattattt gagggtttta ataagttgta aatttttagg 1560 

agggagttga ttttagattt gtttaaaaat atataataaa aaattttagg gttagggttt 1620 

agtaatttgt gttttaataa gttttttaga ggatttcgtt tcgcggttcg gatgtttgat 1680 

taaatttgag atttattgta taatattttt tttttttttt ttgtgtatgt gttttgattt 1740 

ttataataaa gggttttaaa aaatcgtttt tatgggtatt tttgttgtcg agttgtttat 1800 

tttttatatt atttgttttt aattgttatt taattattgt tttgattttt ttttggagtt 1860 

agaatagaaa agttagaaag tagaagttat tgagattttt tttttttcgt cgcgtttcgt 1920 

tttttttttt ttttatcgtt tagtttgtaa atataaattt tttgtaagag acgttttcgg 1980 

tcggcgtata tatcgtttat atttgaaggt tgtgtttgta gaggggttcg ggaacgattt 2040 

ttagggcgac gtgtaggtcg tatttcggag ttgttttatt tttttcggtt tttatcgggt 2100 

gtttttagcg tacgtttagg ttttggtttt aggcgttttc gtttttcgta gttttagagt 2160 

tcgggtttag gtggtattgt tacgcgtcga ttttgtgtgg gtattcgggt ttcgcgagag 2220 

acgatttttt tagaattgat tttttagaat tgtagtttac gagatatggg tttttttatt 2280 

tttttcgttt ttttttttag aagtttttta aaaagacggg gaaggaggtg aggagagttg 2340 

gaggtcgagt tatagggcgg ggttttttac gtattgtcgt ttttttttcg ttgtttcggg 2400 

ttttggatga gggtgcggag ttcgggagat ttgggcggtc gtaggaggtg aagggttaag 2460 

aagcgggagg ttttttttcg tttcgcgtat ttaatgtttt tttttttttt tttttttgtt 2520 

tttttcgtta ttttttttgt gtgttgtaaa ataatttttt tgtttttttt ttaagatatt 2580 

agttttaggg gttattatag gaaagaaagg tagttaggtt tttttatttg cgagaattta 2640 

ttaagtatgg tttttgtata gtaggtcggt cggttatggg ttgttttacg aggggtttta 2700 

aagtgggaga atttttttta agcgaggatt tagacgtatt tgtattttat tttattattt 2760 

ttttttcggg aattggaatt tagggggagt ggaacgttgg tagtttttaa acgagagtga 2820 

atatagagtt acggtgggtt tttcgtagaa ttttagtttt atattttttt gttttcgcgt 2880 

tatttttttt tgtatattcg ggaaatggga taaattttcg acggtaagat gggaaatgat 2940 

tttatttttc gtttttttag cgggttttgg tgcggggtcg ttatcgtgcg cgggagaggg 3000 

gacgtaagta gttgttggtt cggtcgatta gacgcgtacg tgtgtattcg ttagatagtt 3060 

gtgggtttgt tcgagggtac gcgaggacgt ttttgtttcg tagtttcgta cgtatttgat 3120 

tgtggtgttg ggcgatagtt gcgaattttg ttattggggg atcgaaagag ggttttttat 3180 

tttttttagg aagttgtgat gtaaaatgtt tgttgaaagt attggaggtt ttatgtgttt 3240 

tattagaaaa taatttaata tcgatttatt atttatattt ttttttaaga attagtcgtt 3300 
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ttaaatgcgt attttgttcg tgcgtttcgt tttatttagt cgtttagtta taataataaa 3360 

tatcgttcgt aggtttaagg tataggttta gtgatcgcgg tttcgaattt tggatttggg 3420 

gatttggcgt agggggaggg gagaaggcgt ggtagtttgg ttagggttcg gggaagggtt 3480 

agtgtgatat ttttattaag tttagtgggc ggaatttttt tttttttaaa ttgttaataa 3540 

ttcggattgt tggaagtaac ggtggtagga aagttataaa agaaaatttt ttttttaaat 3600 

tttttgttgt atttaggtta aagatataat attaagttag tgttttttgt ttgaaatttt 3660 

tgtggaatga ttttttttgg attttttttt gtttttttat tatgaatata aatatttttt 3720 

ttatttarta tatgttattt acgtattttt atagatttta tcgttatcgt tttgttgggt 3780 

tttttgtaaa aggaaagttt gttttgtttt ttttttttag tcgggagggt agggggagga 3840 

gggagtagtg ttgtttaagg gataatgtta tttttttttt tattaagatt ttttgttaat 3900 

tgttggggaa ataattgatt ttaagttatt ttagttataa ttaatagaaa aatggaacgg 3960 

tgatgtgttt agttgttaag gggaggtagg ttggaaaata aaggtggtaa ggaagggggt 4020 

tgatatgaaa aatatgtaaa gttttatgtg taaatttttg ggaaatagtg gaattttatt 4080 

ttaaaataaa ataaaaaatt agaattgtgt ttatttgtta tatttttaag gttgcggttt 4140 

ttgatttgtg tttttggata gtttaaaagg agaattttag tatagaagtt tttttatata 4200 

gttgtcgttt tttcggtttt ttgatttttt ttttaattta tataatggta ttttttttta 4260 

atatttttta tttagttagt agttagaatg atttagtttt atttgattta ggggtttagg 4320 

gaataatttg gtagtttttt gag 4343 

<210> 308 
<211> 4476 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 308 

tttttaattt ttattttaag aagatttaga gtaataacgt agaaaataag cggtttgagg 60 

aggatcggga gttagggcgg aaagttagta gtatagttaa ggtgtttgtt agttttttgt 120 

tcggtttgga gaggaagaag attattttaa ttttttttac ggtgagtcgt atttttcgtt 180 

tttttttttt tttttgcggt ggggtttgtt ttttttaggt agtttttttt ttaatttaga 240 

ttttgttttt tttggttatt atttttgttt atagtaggaa gtttcgtttt tagtagtaaa 300 

tgtagaattt tttttttaat ttattattgt ttgttttagg tggaagggat aggaagtttg 360 

ttttatgaat ttggggggag aatttggttg tagattattt tggttttttg atagaacgtt 420 

tgttttttat tttttataga atagcgtttt ttttattagt ataaatcgaa gtaggaattt 480 

tttatttttg gagcgggtta gtttcggtta ggtttttatt tagaatggta aagataggtg 540 

agagattcgg gttttgtttg ttttattttt taggagttat gttttatagg gtgatgtttg 600 

ttagtagtat cgttttttgt ttttgttagc gtattgtttt ttgttttttg gagttttatt 660 

ttggttgtgt ttagtttagt tttttttttt tttattttac gttattgttt tttttttatt 720 

tttttttgat tgttatttgt agtttgttaa gtgtggggtt gatcgtggtt attttagtta 780 

tatgttcgtt ttttgattac ggttagggta tggtagttgt ttttttttag atatgagtag 840 

ttaaggtttt gtgttggggg ttttagttta gggtagaatt aagagatgtt tattttgagg 900 

ggtgtatata tagagggcga ttttagttat ttttatgaga ttagagtttt ttagttttta 960 

tcggtcgtat tttttggtgt tgtatttttg gttttatttt ttttgagttt atgaaagttt 1020 

ttttttagta atattttatt ttttttgtag aagaaatttt tttgttttta aaatttttag 1080 

gaggttagtg tagtttggag gtagcggttt tttgtttgtt tttttttatt tttgattttt 1140 

ttttttaggt attgatttat ttcgttgttt ttcgatttta tttattttta ttttttagtt 1200 

tcgtattttt tagttttgat ttgtatttcg ttgttgttta ggtttgattt ttattttgtt 1260 

agagtttttt agttttggtt ttttttttgt ttttgttttt taatttaggt ttttcgtttt 1320 

tatttatttt taatacgggt tttttcgttg tttttgtttt ttagtttaat tatgttaggg 1380 

tttcgggttt ttacggtttt tgtttttgtc gtagtttttg cggttcggtt tcgttagtat 1440 

tagaaattta tgtcggtttt cgtgtatttt aataaggttt ttgggttgtt ttttacggag 1500 

agtaattgtg aggtgtcgcg gtttaggtaa gtgtgttggg gtagttggtg tatttgttgt 1560 

ttttagttta ttttattttt ttgttttaat aatttttttt ttttatttgg gggttttgtt 1620 

gtgtttttgt tattttagtt ataagaaatg ggtttgtttt ttgcggttag gaagtggagg 1680 

gaataaaaaa gagtattaat gttttttttt tttagttttt ttttttagaa taggtatgta 1740 

ggaagttgtt ttaaggtttt aaagggaaat ttttttgttt tgaatttttt agggtttttt 1800 

tagggatttc ggggatagtc ggtattatag ggatttaatt tttaagggtt ggtttttatt 1860 

gtcgttttga gggtttagtt tgttcggttt ttagggagtt cgttgttttt agtttaaatt 1920 

atattttata taggggtttt tttttgtttt tttttttttt tttttaaatt attttttttt 1980 

atttttacga gatttttttt ttattattgt ttttagtagt tatatttttt tttttgtgtt 2040 

ttcgtgatgg ttgtttttgt tttagtattt ttttttttgt ttttattata gggtgtttag 2100 

gtgtttagtg atggttgttt tgtattttaa ttcgtttttt ttaattttat tttttttttt 2160 

atagtatagt tttttagcgt gtttttgttg tttttttatt cgtttataat attagtagta 2220 

gtggtggagt tttagatcga attaattttt ttcggggtgt gtttagtcga agtatttttt 2280 

_ atgttgggta- gtttcgatag gtgcgggatl^-agtagaaitt^ 2340 
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tttttttttt tggttatagt tagggtcggc ggggggtttt tgggagtatt tttagtaagt 2400 

ttatttttaa gtttgtacgt aggtaagtaa ggagttttgg gtggtagaga ggtttaggtt 2460 

aggttttttt gttttattcg gggtgggttg ggggttgggg gttggggttt gggatatttt 2520 

gtatcggtat tgggttttgg ggttagaaga ggttttagga agtataagaa attaggtttt 2580 

tgttaatatt ttatgtgttt aggtttattt tttttaggtt tttttttaat tttttatagg 2640 

tattttttat tttttggttt taagtagatg gtcgatgtcg tttttttttt aggagagtgt 2700 

gaatttagat gttaaaataa aagttttttt tttttttttg ggtttttatg gaaatttata 2760 

tttggtgacg tagttgtaaa gttatgaggt atgagttagg ttggggttag taaggaaaat 2820 

tttgttttgg ttttttgttt ttttgattgt tttttttatt agttggtttt gtttttggtt 2880 

gtaggcgtag ttatgttttt ttgttcgggg gttttagggt gaaatttata aatgaaatta 2940 

ttggcgaggg tttatagtgg tttttttttt aatttaattt cgatgtgtta aaggtttttt 3000 

gtgttggatt tagggtgggg atttttttac ggggtttttt atatttgagt ttttagttat 3060 

tatagaggtg tagtttgaag tgtatttagt taattggttg gttttttggg atgtttcgta 3120 

tttttatttt gttatttttt tttttgtttt ggagtagtag tttaggaagt agtaggggtt 3180 

ttgagagaat aggttcgttt gttttttttt tacgttttat tttattttgt tggaggagtt 3240 

aaagttattg ttttattcga gttttagaat gtaagtgtga ggtttgtaga gagtgtgggt 3300 

aggtttgaaa gtttgggatt ttagtttcgt tgagcggttt tttcgaaaat gggatgattt 3360 

ttgaatttgt aaagttattt tttttatttg tttatttata tatcgttttg ttggtttttt 3420 

ttttattttt ttttttattt tgtttttttt ttgtttgttt ttttagaaat ttgtttttta 3480 

ggtttgttag aaggtaggcg ttgagttcgt tgtgtgtgtg tgtgtgtgtg tgtgtttgtg 3540 

ttttgtgttt tgtttttatt attaattttt tttttttggt ttttattttt ttttatttgt 3600 

ttaattaagt ttgtgtggtt tttttttttt tgttatttta aagggatgaa gattgttttt 3660 

gattgggtat tagtataagg tttgtttttc gtgtttttag tataaatagg tagggttaag 3720 

aggttatatt ggtttattta gggtaaatgg ttttaaaaat gagggttttt tttgggttta 3780 

tagttaacgt ttgtttttaa gtaaggggag attgttttag ggaagttttt ttttaagatt 3840 

gttttttttt atttatttta ttttattttt attttttttt attttaggtt ttggttataa 3900 

gtgttgggat ttttttttgt tttttttttt gttatgtgta tgcgttatga ggaagtttta 3960 

gggttataag tgtatttggg atggtatttt gtttgttgtt ttttgggttt tttgaatttt 4020 

agagttatgt gatttttttt tcgtggttta tgggatcgta ggattttgga gatattacgg 4080 

ggattttggg gttttaaggt tttagtttgg tttttttaga ttttaggagt ttttgtttta 4140 

taaatggagt atagtatttt tttttggtag tttttgtaga attagtttta tttattcgta 4200 

tttttgtttt agtatttttg ttattagtag tatttggata aattaagttt tttttttttt 4260 

aggttgtttt ttttagatat ggtttgtttt ttttaaagtg cggggagttg ggatatttta 4320 

gggtaacggt tattttattt taagtcgtag taaaataata ggagattttt gtatttttat 4380 

ttaggggttt ttttttatag tttttttttt ggttttttta tttttggttt tatgtttatt 4440 

ttttttgtag gtttcgggga ttaaatatta agttga 4476 

<210> 309 
<211> 4476 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 309 

ttagtttggt atttggtttt cgaggtttgt agagaggatg agtataaggt taggggtgaa 60 

agagttagga aggggattgt gaagggagat ttttgagtaa gggtgtagag attttttgtt 120 

gttttgttac ggtttggggt agaatggtcg ttgttttaga atgttttagt ttttcgtatt 180 

ttggaagaga taggttatat ttggagaaaa tagtttagag gagaagaggt ttgatttatt 240 

tagatgttgt tggtggtagg ggtgttgggg taggggtgcg ggtgggtggg gttagttttg 300 

taggagttgt taggaggggg tgttgtgttt tatttgtgag ataaaagttt ttaaggtttg 360 

ggggagttag attgaaattt tggggtttta gggttttcgt agtgttttta aagttttgcg 420 

attttatagg ttacggggaa gaggttatat agttttgaag tttaggaaat ttaggagata 480 

ataaataaga tattatttta gatgtatttg taattttgga gtttttttat agcgtatgta 540 

tatgataagg gaaagggtag gaaaggattt taatatttgt gattaaaatt tggggtgaga 600 

gggagtgggg gtggggtggg gtgggtgaga ggagataatt ttaaagggag gttttt,ttga 660 

gatagttttt ttttatttga ggataggcgt taattgtggg tttaaggaag gtttttattt 720 

ttaaagttat ttgttttgag tgggttaatg tggtttttta gttttgtttg tttgtattgg 780 

gggtacggag ggtaggtttt gtgttgatgt ttaatfagag gtagttttta tttttttaag 840 

atggtagaga gaggagggtt atatagattt ggttaagtag atggaggaga gtaagagtta 900 

gaaaagggga gttagtgatg gaggtaggat ataggatata ga^atatata tatatatata 960 

tatatagcgg gtttaacgtt tattttttgg taaatttjgaa agatagattt ttaaaaaaat 1020 

aaataaaaaa aagataaaat aaaaaaagaa ataaaaaaaa aattaataag acggtatgtg 1080 

gataagtagg tgggggaggt ggttttatag gtttaaqggt tattttattt tcggaagagt 1140 

cgtttagcga gati;agagtt ttaggttttt agatttgttt atattttttg taggttttat 1200 

atttgtattt tggggttcgg atggggtagt ggtt£tggtt tttttagjiag agtgg^igtga 12)50 

Sgite 247 



seq_prot.txt 

aacgtagagg aagggtaggc gagtttgttt ttttaaagtt tttgttgttt tttgagttgt 1320 
tattttaagg tagggagagg aatggtagga tggggatgcg gagtatttta ggaggttagt 1380 
tagttggttg gatgtatttt aagttgtatt tttgtggtgg ttgggggttt aggtgtgaga 1440 
aatttcgtga agagattttt attttgggtt taatatagga aatttttggt atatcggagt 1500 
taggttaggg aagaggttat tgtaggtttt cgttagtgat tttatttata ggttttattt 1560 
taaaattttc gggtagaagg gtatggttgc gtttgtagtt agaaatagaa ttaattagtg 1620 
ggagaggtag ttaaaggggt aagagattag gataaaattt tttttgttgg ttttaqtttq 1680 
gtttatgttt tatgattttg tagttgcgtt attaaatata agtttttatg ggaatttaqq 1740 
agagaagggg gggtttttat tttagtattt gagtttatat ttttttagag aggaggcggt 1800 
atcggttatt tgtttggggt tagagaatga ggggtgtttg tgaggagttg gggagaggtt 1860 
taagaggggt gggtttgggt atataaggtg ttgataaaga tttaattttt tgtgtttttt 1920 
gaagtttttt ttaattttag gatttaatat cggtatagag tgttttaaat tttaattttt 1980 
5552:225*1 H atttcgag taaa 9t:agga aggtttggtt tgagtttttt tgttatttaa 2040 
i g ttattt 9cgt ataaatttgg aggtgaattt gttgaagatg tttttagagg 2100 
555 t 3S 9t S g Qttttggttg tggttagagg gagaggttgg ggttatatcg tagggtaaat 2160 
tttgttggtt tcgtatttgt cggagttgtt tagtatggaa ggtgtttcgg ttggatatat 2220 
ttcgggggaa gttagttcgg tttggggttt tattattgtt gttgatgttg tggqcqqatq 2280 
gggaggtaat agggatacgt tggggggttg tgttgtggga agagaagtgg ggttgagggg 2340 
gacgaattag ggtataggat agttattatt gggtatttgg atattttgtg gtggggatag 2400 

ggagggggat gttagggtag aggtagttat tacgaggata tagaaggaag ggtgtgatta 2460 

ttgaggatag tggtgagaag gaagtttcgt ggaagtggaa ggagatggtt tgggaaggqa 2520 

gggagggagg taaggaagga tttttgtgtg gagtgtggtt taggttggag atagcgggtt 2580 

55552225 9 5 c 999 ta 9 a tt ggatttttaa ggcggtaatg gggattaatt tttgaggatt 2640 

gagtttttgt gatgtcgatt attttcgggg tttttaagga aattttggaa ggtttagaat 2700 

aaaaaggttt ttttttggag ttttaggata gttttttgta tatttgtttt gagagggaga 2760 

attggaaaag aggggtatta atgttttttt ttgttttttt tattttttgg tcgtagggga 2820 

tagatttatt ttttgtggtt aagatgataa gaatatagta ggatttttaa gtgggaagaa 2880 

9 2 a ?H2H§ 9g9 taa 9999 gtagggtggg ttgagggtag taggtgtatt agttgtttta 2940 

SHSH*** "Sggtcgcg gtattttata gttatttttc gtggggggta gtttagaggt 3000 

tttgttgggg tgtacggagg tcgatatgga tttttggtgt tggcggggtc gggtcgtaga 3060 

2 a Hi C 5 gta 9aagtagaag tcgtggaggt tcgggatttt ggtatggtta ggttaggaia 3120 

taaaagtagc ggagaggttc gtgttagggg tgagtgaggg cgggaggttt ggattaggaa 3180 

gtaagggtag gggaagggtt agagttgggg agttttggta gggtagaagt taggtttggg 3240 

tagtagcggg atgtaagtta gaattgagga atgcggggtt gagaagtgga ggtgagtgag 3300 

gtcgggaagt agcgaggtgg gttagtgttt gggaagagga attagaaatg aaggagagta 3360 

ga J aagggg ^ cgttgttttt aggttgtatt ggttttttaa gaattttaag aataggagag 3420 

HHHi tat a 9 aaa 9 a 9tg gggtgttgtt gaggggaaat ttttataaat ttagaagaga 3480 

tagagttagg aatgtaatat taagaaatgc ggtcggtgaa ggttggggag ttttggtttt 3540 

a i gggg " gg " ggag H g I ««tatgtg tatatttttt aaggtgggta ttttttggtt 3600 

" g "H gag «gggatttt taatataagg ttttagttgt ttatgtttag aggagggtag 3660 

H g HH2 ttggtcgtgg ttaagaagcg agtatgtagt tgagatggtt acggttagtt 3720 

99 taaa1:t g taa gtagtagtta gaggaagatg gggaggaggt aatggcgtgg 3780 

gggaaagttg gattggatat agttaggatg gaattttagg gagtigggig 3840 

taatgcgttg gtagggatag gagacggtgt tgttgataga tattattttg tgagatatgg 3900 

tttttaggga gtgaggtagg tagggttcgg gttttttatt tgtttttgtt attttggatq 3960 

gaggtttggt cgaggttggt tcgttttaaa agtggggaat ttttgtttcg atttgtgttq 4020 

g 5 g9agagga cgttgttttg tggggaataa agggtaagcg ttttattagl aagttaaagt 4080 

ggtttatagt taggtttttt ttttaggttt atggaatagg ttttttgttt tttttatttg 4140 

? a §HS? 5 t 99 ta 9gtta aggaaaggat tttgtattta ttgttgggag cgaggttttt 4200 

tgttataagt agaagtagta gttaaaggga atagagtttg aattaagaga agagttgttt 4260 

ggagagggta ggttttatcg taggaaggaa ggagagagcg gggggtgcgg tttatcgtqq 4320 

a g99ggttgg ggtggttttt ttttttttta gatcgggtag ggggttggta ggtattttgg 4380 

" g £ g " g " ggtttttcgt tttgattttc ggtttttttt aggtcgttta ttttttgcgt 4440 

tgttattttg agttttttta gagtaagaat tagagg y y 4475 

<210> 310 
<211> 6435 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 

* * 

<400> 310 



60 



?SS£££ " att H t1: 3 a H gtttttg aa tttgtggg atagattgtg gtagtaatgg ow 
taaatcgatt tgtaattagt tttaggtgta tatatatata tatatagttt gttatatttt 120 

tttagttata gttattcgtt ttttttagtt tttgttqtag ttttttacgt ttgttttegt -180 
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atatataatt gtattttttt gtatttttgg tttcggttat tggtttggtt ttttagttat 240 

tttttttcgt ataattaagt tcgtatttat atagttttat agttttagtt gatattatag 300 

ttttaagtaa atagttttgt atattattta ttttttaaac gttgagtcgt ttatagtgag 360 

gtttgttatt tatataattt ttgttataat ttttgataat ttttattgta gttttatgtt 420 

tacgtagttt tttatatgtg gttttggtta tttagtttgt tttgttcgtt attaaatagt 480 

gatagggttg tagttttcgt ttttttttcg tagatattat tttgtgtata ttattatagg 540 

tttttgtata gtattttgat tatttattta ggtttagtta ttttttgaaa tatagttata 600 

attttttata tagatatatt attaaaagtt tgtggttata atttttgtat atttttatat 660 

ataattatag ttttttacgt ttttgatata tatatggtta tcgatatgta gtttttatta 720 

ttgtttttta tatacgttat tatatcgttt tttttataat cgatttttgt tttatatttt 780 

ttatatatag ttatgcgtgt tacgtttagt tagttttgat aatattagag aaaaagtgtt 840 

agtagttttt agtattattt tgaaaaatat atatttatag gaataatttt ttattttttg 900 

ggatttttta gaaaaaaaaa aggtttattt tggggaagta aaataaatag tggagacgag 960 

tgtagtattg ttttttaaat tattaatttt taggttttaa ggtttgggtt gggttagcgt 1020 

taataggtga gtttaggagt tttttgaatt tatttttttt tgtttagaat agagatagga 1080 

taggttttat gttttttatt tttttttttt aattttaggg atattgaaag ggttttttgt 1140 

attcgttcgt aggaaattgg ggggttggga gggagggaat tgaaaattta tgtttggtat 1200 

taaaaataaa tatttggagt gggggaggcg gtaggaagat ttttttttaa attttttttt 1260 

ttttatttat tttattaaat ataggaagag atgatttttt tttttttatt gaaaagtttt 1320 

atttaaaaat agattatatt attaaaatgg tagcggggga gagataggga gatttggagt 1380 

attggttgga ggggtttttt agacggggat tattttttaa aaaaattttt tattgggagg 1440 

aaataggtag gattttaggg aggttggtag ataaaggaat ggttttttag ggggaagaat 1500 

aaaagggata tttttttttg gttaaaaagt tggttgaaaa ggataagttg tttgagagaa 1560 

aggttgggga ggtggaaatt tttattttaa gggtgtgatt tttttttggt taagattttt 1620 

aatttattaa atggnataaa ttttttttgg atttttgagt atggttagga ataataaaat 1680 

aaaataaatt aattttttta tttatattta aaaacgtaga tagggttttt ttattttttt 1740 

aagggtaggt tgagaaataa agaaagaaag gtcgttttta agttagattt cgattgtttt 1800 

gttttgggga aaaaagtggg gaggtggggg agatgtttta attattttaa aaggttaggt 1860 

taggtttttt ttagtggttt ttaaaaaata aataaataaa tcgtttttat tattgaagta 1920 

ggagacgtgt aagggcggtt attgggggat tgatagaaaa tagagaagtt gtggaagttg 1980 

tgtgtttttt gtggaggaga taatgagagt tatttttggg agtaattttt tttaaggatt 2040 

tttattttat tataantaga gtatgagttt tttagttgaa ttatgttttt ttttgagatg 2100 

gttggttgga gagagtatgg atggagtgat tttagaaggg gtggtggtgg tggtggtagt 2160 

agttatggtt tttggggttt tggggagagt attacggggg tcgagaggtt atttacgttg 2220 

atagaaggga tgtgtatttg ggcgttgtta ttggatggaa attgggggta aaaagaggga 2280 

gaggttatag atgggggtta tttttgggtt ttttattttg tttaaaacgg ggaaagagga 2340 

ttttatttgg aaggagagtt tggtcgggtt acggggcgta atgggattta gggtgtttta 2400 

gaagtgaatg ttaggaggta gcgagttggg tgttagtagt atcggggttt gtggggagag 2460 

ggtggtcgga gtttaagtga gtagaaagta gatgtgggat gttggaaggg atttggtttc 2520 

ggttttatta tagatttttt atttatattt ttagggtta^ tttttaaagt gtatatattt 2580 

agatggtaat tatggaggtt atgtttatga ttttttaaat tttgtagata ttttttattt 2640 

tttattttcg agagtttatt tttagttgtt gagattttat tggttaagat tatttataag 2700 

attttattta aatgttgttt ttaaattttt taagttatta ttttaatagt ttagggttaa 2760 

tttttaattt atgttttatt tatgaaattt tatatgtata ttttagatta aaaagttttg 2820 

tttaggtttt ttaagttata ttttaattta gaaatagttt tatagaaaga ttttcggttt 2880 

tatttataaa agtttttagt tttgtttttt attaattaaa ttttattttt ttagttagta 2940 

ttagtttagt taattatatt taaaggtttt atttttttta ggaataatta tatttatttt 3000 

tttttgattt ttgttagtta ttcgttatat gttaatttta ttggtgttgg cgtttatata 3060 

ttttattttt tcgtaggtat ttattaatgt atgcgtagga agtagggatg gggttagcgt 3120 

cggtttcgga gtttggaggt cggagttatt tttcgatttt ggggttcgtt cgggtttcgg 3180 

tttatttagt aggttcgggt tgagtggaag ttttaggttt cggggttttc ggtttttttg 3240 

cggttgggag attttagggt tggaagtcgt atggtcgttc gtttgttttg cggtgtttat 3300 

aataatcgcg ggtagtcggg ttggtacgtt tttttgtgga gggatttttg gttcggtttt 3360 

ggtggttttt ggtataaatt tttttttttt cgtaattttt aatttttttt tgggtttttt 3420 

tatttttatt tttgggggta aagttcggtt taaattcggt tttatattaa gttttttcga 3480 

ttttaggttt ggggtttcgg gcggggttag gatgatttgg tagaggagta ggtataaaga 3540 

gggtgtgtaa ggattgttgg gattatagga tttttttgtg ggttatttag tgattttttc 3600 

gggaaggtgg tatttggtat ttagggtatg attgtatatt agagtaagtt ttttaatagg 3660 

atataggatt tcgttaagga agtagggttt tttttggggt aggttttttt atgggatagg 3720 

ggatttgggg tgagataatt tgaagcgggt aggaatttga gggatattat ttttatttat 3780 

taaaatagtg gtttttaaaa atttattttt ttttggatta gtaatatttg ttttatttgg 38^0 

taataagtta aaaatgtaaa ttattgggtt tttttttagt tttataaatt taga^atttt 3900 

gggggtgtgt ttggtaattt gtggtttaat aagatttata ggtgattttg atgagtatta 39£0 

gagtgcgaga at^tagtgtat taaatatatt ttaggtattt atatgttcgg tagttattaa 4020 

ttgtagtata gtattttttt ttttttgttt taggatttgt gtagtttttt gttttatttt 4Qj30 

tttttttaag aggatgtttt ttttttatta ttattttttt taacgggata gttai^gtttt 41£0 

ttittattagt ttttttcgttj attcggggat tatttatttt ttttttttat tgtttttttt 4200 

tattgaattt ttatatttta gtttttattt gtagaggtaa gtcggttttt ttagttttta 4260 
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gataggtttt 

ttgtaggagt 

gtagagattg 

tgcgtgttaa 

agatagtgtt 

taatagatat 

aggatataaa 

ggaggagagg 

ggaggagggg 

tttcggtttt 

gtttttattt 

tttttttttt 

ttaagttttt 

tgtagttttt 

tttggtttta 

gttttggaga 

aagaaagtaa 

aagttaatga 

tttattatta 

tagtaattag 

aggttggttt 

agatgattta 

agggttgagt 

gtgtagtata 

ttttagtgtt 

agggatattt 

ttggtataaa 

gatgggggaa 

tattggggta 

taaatatgtt 

aatgttggga 

tttgatttta 

gtttagtatt 

tttttaagat 

agtattattt 

tggttttcgt 

ttggttttag 



taaggggttt 
tgtattgggg 
gatggtgaag 
arcgtttggg 
ttttggggcg 
ttttggttgg 
tttgtagacg 
aatagagggt 
ggtggagtgg 
ttcgtttttt 
tttttttttt 
ttttttttta 
ttttgttatt 
ttttataatg 
agaattagag 
ttagtaaatg 
attaatgtag 
tatagaagtg 
atattattgt 
ataagaatat 
tgttgggtta 
gtgtttttag 
ttaaggaata 
tgattagggt 
ggttagatat 
atgaatattg 
tattgggggg 
attttaaatt 
gattagtgtt 
gttttagagg 
gttttagagg 
ggaatatgaa 
ggtttagggg 
gttggttata 
tggggatttt 
taaagtagta 
ggtat 



ggattggtgt 

agtattatag 

gtggaataga 

tttgttattt 

gtatgtatag 

ggcgggtagt 

aatttttggt 

tttagaggaa 

tatttttttt 

ttttgtaagt 

tcgttagatt 

taataatttt 

ttttaggttc 

gggtttatga 

gtttataggg 

taataagata 

atggatgggg 

gaatagggta 

ggaggttgaa 

taaaaatatt 

tagagtatgt 

agggtagagt 

aaatatgaaa 

agttttggga 

gttatagggt 

tttgggaaat 

aggagaaatt 

ttagttgtag 

tttttgggaa 

ttttagttta 

acgaagtaag 

atattttgga 

ttttagagga 

ggaatacggt 

agagtgggag 

agataagaaa 



seq_prot. 
ttttgttttt 
taggtcgagg 
ggttcgagcg 
ttgtattttt 
tagtaggggt 
ttttagtgga 
cgtttatttt 
aggggtaggt 
tttttagtgg 
ttttttttta 
ttttttattt 
ttttttaagt 
gtttatttga 
agggtagtag 
ttaggttttg 
agaaaatgaa 
tgatgaatta 
aggttattgt 
gttaaaatgg 
ggtataaata 
ttggggtttt 
gagatagttt 
atggaataaa 
gtgtggaggt 

gggtttgtgt 

tttagtagaa 
tagtagtggt 
gaataagata 
tttaagaaga 
aggaataaga 
taaaagttgg 
ataagagaag 
tagaatgagg 
aggaaaatga 
tagaaagagt 
gtaggtgaga 



txt 
cgaggggggc 
atgagggggt 
tatgtattcg 
gggtgtgttt 
tatatttggt 
gtattttgcg 
qcgqataatg 
tggaggtagg 
tttttttttt 
gtttattttt 
atattttttt 
ttttttttat 
tttttttatt 
gtgtgggagt 
tgggtgattg 
gtaattgata 
ttaatataga 
agaggagagg 
taagaaaatg 
ttgggagggg 
ttaggtttgg 
tattattaat 
tgttataaat 
attagggtat 
ttttcggaag 
ggattaagaa 
ttgggggttt 
atgaaaataa 
tagtgtgtgg 
gaagaaaata 
tataatattg 
attatgaaat 
attagttatt 
aacgttgtgt 
tagtattaat 
tagtgtttag 



gttgttgttt 
ggttgcggtt 
tttcggttgt 
ggttttttag 
atggtggagg 
attttagggt 
ttttaggaag 
gtgttagggg 
atttgttttt 
ttttttattg 
ttagtttttt 
ttattttttt 
ttttttatat 
aggttttttt 
ttaggatata 
taaatgttgg 
ggattatttg 
gtgaggaggt 
aagtaattaa 
gtttatatta 
tgggttttag 
attgttttgg 
tatgaagggt 
tttgtggaag 
aggaataaaa 
aatgaaataa 
gagagggtag 
ttgatataaa 
aagttggtat 
attagttaaa 
ttagggaggt 
aattagggaa 
attaatcgtg 
atggtagggt 
aatgtagtgg 
gggtttaata 



tattatttta tttatttttt 



<210> 311 
<211> 6435 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 311 

gtattttaag 
ttttggcgaa 
ttttaaagtg 
ttaatatttt 
ggattagtgt 
atttttagaa 
gggattttta 
gaaataatat 
tggtttattt 
aaaatttttt 
taatatttat 
tttatgaata 
tgattagtat 
ggttatatgt 
tttgagttta 
aatattaggt 
tagtaaagtt 
tttgtttaat 
ggtattaata 
ttgtattatt 



attagtattg 
aattattatt 
atgttgtttt 
gaaaatacgg 
tgggtttttt 
ttaaaatttt 
atatttttaa 
gtttaatgtt 
taatatttat 
ttattttgtt 
attagttatt 
ttttttttta 
taagattttt 
tgtatatttt 
attttttaga 
tatttttgag 
agttttgata 
tgttgttagt 
atgaggtttt 
agttttaaat 



ggtttttaaa 
gtattattaa 
gttatgtata 
ttaataatgg 
tagttatttt 
tttgatagta 
ttaattattt 
aatttttata 
attaattgtt 
tttttagatt 
ttattttttt 
tttttttttc 
atagaatgtt 
ttatagttta 
ataatattag 
atttattaga 
taggtttttt 
tattttattt 
tttatttttt 
aattttttgt 



tattgatttt 
acgttttatt 
ttggttttta 
atagtttttt 
ttatattaat 
ttttttttta 
tattgttttt 
tttattattt 
tttaggttat 
aattttttta 
gagaaatata 
ttagtgtttt 
tggtatttat 
taatgaggtt 
tttgaaaaat 
tttaatattt 
ttttattatt 
tttttgtagt 
attaataatt 



ttttattttt 
tttttgtcgt 
ttttgttttt 
tttattttag 
ttttgtttat 
ttttttaagt 
ttaaattttt 
tatttttata 
tgttgggttt 
ttgaaatttt 
ggtttatttt 
tatattttta 
tttattttta 
attttatttt 
tttaaatatg 
atattagtat 
ttagttttaa 
gattttgttt 
tattatttta 



tgttttattg 
attttgagat 
atttttatag 
tgaaattttt 
aatattttat 
ttcgtttttt 
taaaattttt 
agaggagtat 
gttaaaattt 
tttttttttt 
ttaaataata 
gtgatatatt 
gaattgtttt 
tattttattt 
gttttttggg 
ttttgtgatt 
ttttaatatt 
tttttatagt 
tgttttattt 
tt_tattLt:ai:a_ 



4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6435 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
-1200- 
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ttaatttatt tttttttaat atttatatta gttattttat ttttttgttt tattgtattt 1260 

gttagttttt aaaattgtgt tttgatagtt atttataagg tttggttttg tgaatttttg 1320 

gtttttggag ttaggaagaa gatttgtttt tatatttatt gttttttatg ggttttattg 1380 

tgaagaggag ttatagtgtg gagggagtgg gaaaattaga tagacggatt tggagagtga 1440 

taaagggagg tttggaaagg ggtggatggg aaggggttta aagaaaaaat tattgtggag 1500 

agaagagaga aaggggagga attggggagg gatatagata aggggaattt ggcgggagag 1560 

gaaagaggta gaaattaatg ggagaggaaa tgggttgaga gagaaattta tagaggaggg 1620 

acgggaaagt cggagaggag taagtgggaa gaaagttatt aggagaagag aaatgttatt 1680 

ttattttttt tttttttttt gatattttgt ttttagtttg tttttttttt ttaaggtttt 1740 

ttgttttttt tttttttttt tagaatatta ttcgtaaggt gagcggttag aagttcgttt 1800 

ataagtttgt gttttatttt gaggtcgtag ggtgttttat tgaggattgt tcgttttagt 1860 

tagaggtgtt tgttattttt attatgttaa atgtggtttt tgttgttata tatgtcgttt 1920 

taggggatat tgtttttgga aagttaggta tatttaaggg tgtaggaatg gtaggtttag 1980 

gcggtttggt acgtagtagt cggaacgagt atatgcgttc gggtttttat tttattttta 2040 

ttatttagtt tttgtagtcg tagttatttt tttattttcg gtttgttgtg gtgtttttta 2100 

gtgtagtttt tgtaggggta gtagcgtttt tttcggggag taggagtatt agtttaagtt 2160 

ttttggaggt ttgtttggag gttgaagagg tcggtttgtt tttgtaggta aggattggaa 2220 

tatagaaatt tagtaagagg aggtagtaaa gaggagggat gtgtaatttt cggatgacga 2280 

gggaggttgg tagaagagat atagttgttt cgttaaagga gataatggtg gaaggaagat 2340 

atttttttag aaggaggggt ggggtagaga gttatataaa ttttgagata gaaaggggag 2400 

aatattgtgt tatagttagt agttgtcgaa tatgtaggtg tttggaatat gtttaatgta 2460 

ttggttttcg tattttagtg tttattagaa ttatttgtag gttttgttaa attatagatt 2520 

gttaggtata tttttagagt ttttgaattt gtggggttgg ggaggggttt aataatttgt 2580 

atttttaatt tgttattaaa tgaggtagat attgttggtt tagaggaagg taaatttttg 2640 

agaattattg ttttaatgaa tgaggatggt attttttaga tttttattcg ttttaggttg 2700 

ttttatttta agttttttgt tttataggag gatttgtttt agaggaagtt ttattttttt 2760 

gacgaagttt tatgttttgt tggaggattt gttttgatgt gtagttatgt tttgagtatt 2820 

aagtgttatt ttttcgggga aattattggg tggtttatag gagaattttg tggttttaat 2880 

aatttttata tatttttttt gtgtttgttt ttttattagg ttattttgat ttcgttcgag 2940 

gttttaaatt tgaaatcgga agagtttaat gtggagtcgg gtttgggtcg ggttttgttt 3000 

ttagaagtga aagtagaagg gtttaaggaa gagttggaag ttgcggggga gagagggttt 3060 

gtgttagaaa ttattaaggt cgagttagaa gtttttttat aggagggcgt gttagttcgg 3120 

ttgttcgcgg ttgttatgga tatcgtaggg taggcgggcg gttatgcggt ttttagtttt 3180 

gagatttttt agtcgtagaa gggtcggaag tttcgggatt tagagttttt atttagttcg 3240 

agtttgttag gtgggtcggg attcgaacgg attttaggat cgggaagtgg tttcggtttt 3300 

taggtttcgg ggtcggcgtt gattttattt ttgtttttta cgtatatatt ggtgagtgtt 3360 

tgcggagggg tggggtgtgt gggcgttagt attaatggga ttggtatgtg gcgggtgatt 3420 

gataggaatt agaggagggt gagtgtggtt gtttttgaag agggtggagt ttttgagtgt 3480 

gattgattgg gttgatgttg gttggaggga tggggtttga ttagtgaagg gtagggttgg 3540 

ggatttttgt gggtgggatc gagggttttt ttgtgggatt gtttttaggt tggggtgtgg 3600 

tttaaggaat ttgagtaggg ttttttggtt tgggatgtgt atatggggtt ttatgggtgg 3660 

gatatgagtt ggaagttgat tttgagttgt tgggatggtg atttgagagg tttgagagta 3720 

gtatttggat agggttttgt gggtggtttt gattaatgag gttttagtaa ttgagggtgg 3780 

gttttcggga gtggggagtg gggggtgttt gtagagtttg agagattata ggtatggttt 3840 

ttatggttat tatttaggtg tgtgtatttt ggggagtgat tttgggagtg tgagtgggaa 3900 

atttgtggtg gggtcgggat taagtttttt ttagtatttt atatttattt tttatttatt 3960 

tgagtttcga ttattttttt tttatagatt tcggtgttgt tgatatttag ttcgttgttt 4020 

tttagtattt atttttggag tattttgagt tttattgcgt ttcgtagttc ggttaagttt 4080 

tttttttagg taggattttt ttttttcgtt ttggataggg tgagagattt aagaatggtt 4140 

tttatttgtg attttttttt ttttttgttt ttagttttta tttagtggta gcgtttaggt 4200 

gtatattttt tttattagcg tggatggttt ttcgattttc gtggtgtttt ttttagggtt 4260 

ttagaagtta tgattattat tattattatt attatttttt ttggggttat tttatttatg 4320 

ttttttttag ttagttattt taaggagaaa tatagtttaa ttgaaagatt tatgttttga 4380 

ttgtggtggg gtggggattt ttgggaagaa ttatttttaa gagtaatttt tattattttt 4440 

tttatagaaa atatatagtt tttataattt ttttgttttt tgttagtttt ttagtggtcg 4500 

tttttatacg ttttttattt taatggtagg ggcggtttat ttatttattt tttgaaggtt 4560 

attgggagga gtttgattta attttttagg gtggttagga tatttttttt atttttttat 4620 

tttttttttt aagataagat aatcgaggtt tggtttgaga acgatttttt tttttttatt 4680 

ttttagtttg tttttgggga gatgagggag ttttgtttgc gtttttggat gtgagtagaa 4740 

gagttagttt gttttgtttt attatttttg gttatattta ggggtttagg aagaatttgt 4800 

attatttaat gggttgggag ttttggttaa ggaagaatta tatttttgga atagaaattt 4860 

ttattttttt aatttttttt ttagatagtt tatttttttt aattaatttt ttggttaggg 4920 

aggaatgttt tttttgtttt tttttttgag aagttatttt tttgtttgtt aatttttttg 4980 

gggttttgtt tgtttttttt taatggaggg tttttttggg gggtggtttt cgtttggggg 5040 

gtttttttag ttagtatttt aggttttttt gttttttttt cgttgttatt ttgatagtat 5100 

aatttatttt taaatggggt tttttaatag gggagaggga gttatttttt tttatatttg 5160 

gtggggtggg tgggaaggaa gggatttggg ggggaatttt tttgtcgttt tttttatttt 5220 

aagtgtttat ttttgatatt aaatatgaat ttttagtttt ttttttttta gttttttaat 5280 
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tttttgcggg cgggtataaa ggattttttt aatgtttttg gagttgggag ggaggaatgg 5340 

gggatataaa gtttgttttg tttttatttt aggtaagaga gagtgggttt aaaagatttt 5400 

tgggtttatt tgttagcgtt ggtttagttt aggttttggg atttgggggt tggtgatttg 5460 

ggggatagtg ttatattcgt ttttattgtt tgttttattt ttttaaaatg gatttttttt 5520 

ttttttaaag agttttagag aatggggaat tgtttttgta aatatatatt ttttaaagtg 5580 

atgttggagg ttgttggtat tttttttttg atgttgttag gattagttga gcgtgatacg 5640 

tatgattgtg tgtggagggt gtgaggtaga ggtcgattgt gagagaaacg gtgtgatggc 5700 

gtgtgtgaga aataatagta gggattgtat atcggtaatt atgtgtgtgt tagagacgtg 5760 

gaaagttgtg attgtgtgtg agagtgtgta gaagttgtga ttatagattt ttgatggtgt 5820 

gtttatgtgg gaaattgtgg ttgtgtttta gaaaataatt gagtttggat gaatagttaa 5880 

ggtattgtat aagaatttgt gatggtatgt ataaaataat gtttacgaga agggaacggg 5940 

gattgtagtt ttgttattgt ttgatggcga gtaagataga ttgggtgatt aaaattatat 6000 

gtgagagatt acgtgggtat gaaattatag tagaggttgt tagggattgt ggtaaagatt 6060 

gtgtaggtga taaattttat tgtaggcgat ttaacgtttg agaaatagat gatgtgtaaa 6120 

attgtttatt tgaggttgta gtgttagtta aagttgtaag attgtatggg tgcgaatttg 6180 

attgtacggg agagaatggt tgggagatta aattagtgat cgagattaaa gatataagaa 6240 

ggtgtaattg tgtgtacgaa agtaggcgtg agagattgta gtaaagattg agaaaaacga 6300 

atggttgtga ttgggagagt gtgataggtt gtgtgtgtgt gtgtgtatat ttgaaattag 6360 

ttgtaaatcg atttattatt attattatag tttattttat aggtttaggg attgtgagaa 6420 

agtaggagag aaaat ~ " 6435 

<210> 312 
<211> 4406 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 312 

ttagtttgga ttttttacgt ttggggaaaa aggttgtatt tggagttcga ggtatttatt 60 

atttgtttat tttaggtaag ggtcgtttat atttaggggt tgaggatatt ttcgatgtag 120 

ttattttagt ttacgggagt tggaggtttt ggttttggtt ttttagatat ggtatcgtag 180 

ggtttcgtgt tttgtaagtt tttaataaat gttggtttta tgggtatttt aagatagaag 240 

agagatcgga atttattttt tttatattta tgtaggttat tttttttagt tgttttttgt 300 

ttttagggta gcggttatta ttttatagtt tggtttgtgg tgttgttgtg gacggttgtg 360 

gtatttgttt ttatgtttat tatcggttgg ggttttattt taggttatat agttattgta 420 

tagtagggcg ggttgtttag gtggtagagt tcgggatttt taagtttttt tacggttaag 480 

ggttttatat ttattagttt tacgtagatg acgaagggtg gtcgggcggt tcggaagtgt 540 

aggatgggga ttcggggttt gggttttttt tgagagtgat agtggttatt agttttgttc 600 

gagtttggag tttgttttcg gagagacgtt tttagggtgt acgtagttat taagttggtt 660 

tatataggag ttagggtttg ggtagtagat aacgtgttag ggttatagat tgcggtaggt 720 

gtttagagta ggaattcggt tttcgttatt ttcgtttgat attcgattgt agaattgatt 780 

agggtgggtt attttaaggt ttagatttac gagggatagc ggggtttttt ttttaagaat 840 

gggggttggg ttttttggag gtttcgagga gttagtacgg tttaggtgga ttttgttaag 900 

ggggaaggtt cggagtagtt gttttttagt ttcggagtag gtggttgatt ttaagggtga 960 

tattatgcgg ttttgtatat ttgagagttt tcgtggtaat agaagttttt tttgattttg 1020 

ttttcgttta cgttatagaa tgttattatt tggggataag tatagaaggg tagttttgga 1080 

taggtcgggg ttagggtttg tgttgtggga tagtacgggg gtagggtagg tagtgttgga 1140 

gggtggtagg atgtttattt tgcgttggtt gtcggtagtt aagttgcggt atagtcggcg 1200 

tttttgtttg ttagcgtttt agatggtgaa ggtcgtttag cggggtaggg tttgtttttt 1260 

taggaagcgg acgcggtggg tttagatggt cgttttgcgg tggagaagag ggtgataggt 1320 

aggtcggagg ttttatagat gtatatagat gttgatatat agggatatga tgtgtgttta 1380 

gtttacgttg ttagtggagt ggatatttgg tttaatgttt aagagagatt ggttgtttta 1440 

tagttaggtt atttgtttta gttggacgta gttatatgtt taggtgggga tttatttttt 1500 

ttattagatt ttaataggtg gggatagatg agaagttgtt atttaagaat ttagaaggtt 1560 

ttggaaggta gaatttttgt aggacgtaga ggattggatt tgatttaagt ttagtttttt 1620 

agagaaattt ttgtttataa ggatatatgt aattatttta taaaatttta gcgtttatat 1680 

ttggatacga ttttttgttt tagggtcgtt gtttcgaatt tttttatttt tttgggaatt 1740 

tggtttttgt tttttttttg ttgtgataag tcgattttag gttgttgttt cggggggagt 1800 

aggttttttt ttatgatttg ttgttttgtt tagcgtttag ttttatcgtt gttcgggttt 1860 

tttttagttt cgtttttttt ttttttatcg tagttttttt ttacgtgttt tatcgttgtt 1920 

tgtttgtgtg tgtttttttt ttaattttgt ttttggagtt gttcgggttg agttcgtttt 1980 

aaagttttag gagttggacg ggttagaagt tagaggggta ggggatgtag aggttaacgg 2040 

ggttagaacg gcgtgggatg tgttttagat gttgtttttt ttaatggtcg cgatgtagtt 2100 

acgttgaagt tcggtatttt atagattttg ttttggtcga gttttgggat gttttttgtt 2160 

gtgttaaggg agttgtgttc gtgggagttt ttttcggggtgggstttggg jttagatataa 2220 
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ttaatattgg 
tcggggtagg 
tgtgattttt 
ggatgggggg 
aggtattgtt 
attttaatag 
ggggatattg 
gtgtatattg 
gtgtatattg 
tgtgtgtgta 
ggtgtgtgtg 
tgtgtatgtg 
gtatagtgtg 
tgttcggagt 
tgtgtagtga 
gtatgtatcg 
tgagcgtgtg 
tatgtggttt 
ttttgtagac 
ttttgttttt 
tttttattta 
agggttttag 
gtttttgata 
gtttgaaaat 
tttggtttat 
tttttagggc 
agtatcggga 
tgttgtaaag 
cggtgggtat 
gatgttttta 
tttggatttt 
taggtttggg 
ttcgggggtt 
ttaatttaac 
ttaaggaggg 
atttgtttta 
acgtggtcgt 



ttggtgacgg 
atttcgttgt 
ttttaatttt 
gatgggggga 
agggaggttt 
ttcggatttt 
aagtatgtgt 

ggggtgtatg 

tgtgggggtg 
tattgtgggg 
tatattgtgg 

tgggggtgta 
tatgtgtggt 
atatgtttgt 
gggtttgtgt 
tgtagatgtg 
tagtgagtac 
gtgtgtacgt 
ggttgcgggg 
agtagttcgg 
ttgatattag 
aagaaagcgg 
tcggtttttt 
gttggtttat 
ttttattttt 
gttttatgtt 
aaggtggaaa 
gagttgaagt 
aagggagagg 
atggggaagt 
agattttttg 
attatcgcgg 
ttgagtattc 
gttgcggtac 
aattgtttgt 
tttttgttac 
tttttatagt 



gttttttttg 
tttacgattt 
gtggtatttt 
tacggggttt 
tgtttggagt 
atttgggata 
gtgtgtgtgt 
gtgtgtgtgt 
tatggtgtgt 
gtgtatggtg 
gggtgtatgg 
tggtgtgtgt 
gagtgtatgt 
atgtattgtg 
atgtagtatg 
tgtatatgta 
gtgtgtatat 
gtagtgtgtg 
atttttttat 
gtgtgttggg 
tttagggatt 
ggggtttttg 
cgacggtttt 
cgtaggaagg 
tatattttaa 
ttttttttta 
tttagttttt 
gttagggata 
agcgcggacg 
tcgtggtttt 
gggagttggg 
tttttatatt 
gtaggggtcg 
gcggtaggta 
tcggatttaa 
ggttttaacg 
gatacg 



seq_prot.txt 
agtagggttt tcgtgggttt 
tgttgttgtt ttcgtttttc 
ttgtttgtag tagaattttt 
tttttgttag tttagggagt 
ttattttttt ttttttaatt 



tagggagatg 
atattgtgtg 
gtgtgtatat 
gtgtgtgtat 
tgtgtgtgtg 
tgtgtgtgtg 
gtatattgtg 
gtatatgagt 
tggatgtgtg 
agtatatgta 
gtgtgtgtgt 
tttcgtgtac 
tgtgtgtgtg 
taggtgacgt 
tgtttcgggg 
ttagggggtt 
tggtcgtggg 
ttgttttttg 
taaaatgtag 
aaagttttta 
tatttttaga 
tttaaaagta 
cgttatcgta 
cgggaggttt 
tttgttaatt 
attttgtcga 
tatatatgat 
agacgtaagt 
ggtggtcgag 
ggaaggaatt 
tagttacgcg 



tttattttag 
ggggtgtatg 
tgggggtgta 
attgtgtggg 
tatattgtgt 
tgtatattgg 

tgggggtgta 

acgtgtatag 
tatacgtatg 
tgtagtgagt 
acgtgagtgc 
gtatgtgtgt 
agaatttggg 
agtttcgtcg 
cggggggacg 

tacgtgatag 
tggtacgcga 
ggttttttta 
aagtagttat 
tttaattagg 
ggaagtcgtg 
taagatagta 
tgaggaggcg 
tagtatgtag 
ttttagtttt 
gagttttaat 
tttagttggg 
ttcggacggc 
atttttgtgt 
ttatatgtag 
gtatttttag 



agtttcggga 
gggataaatt 
ataggtaatg 
tgtagggttt 
ttttgtagtt 
ttagtgattt 
gtgtgtgtgt 
tggtgtgtgt 
ggtgtatgtg 

gggggtgtat 

gggtgtatgg 
tgttgtgtgt 
tgtgtgtagg 
tgtatacgtg 
gtatatgttt 
gtgtgtgtag 
gtatgttgta 
tgaggatagt 
ggttttggta 
agtttgattt 
gaaagtagtt 
ggtttggggg 
ggggtttttg 
gatgtttgta 
gagaataggt 
gtttcgtttt 
tttttggaaa 
gttaaggtgt 
gagtgaggat 
atttcgtgat 
tttattacgg 
gttttgagat 
ggtttgtttt 
ttttatagtt 
ttttatttcg 
attcgggttt 



<210> 313 
<211> 4406 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 313 

cgtgttattg 
ggagtcgtgg 
attcgggtag 
tgtcgcgtgt 
ttttacgggt 
tgggagtcgc 
gttttttagg 
ttacgagttt 
cgcgtttttt 
tttgatattt 
at;tgggtttt 
aagagaggta 
aatgtgaaaa 
tttacgatgg 
tcgtcgagga 
ggtttttcgt 
tttaagttga 
atatattcijg 
ggaagttttc 
tattgtacgt 



tgaggagcgg 
taggggtgag 
atagtttttt 
cgtagcgtta 
atttagggtt 
gatggtttta 
gggtttaagg 
ttttattggg 
ttttttatgt 
tagttttttt 
tatttttttc 
tagggcgttt 
atgaagatga 
gttaatattt 
ggtcggtgtt 
ttttttttgg 
tgttagtgga 
gtt^ttaagg 
gtagtxgttt 

gtatatagat 



ttacgtgggt 
gtaaatcggg 
ttttagaatt 
agttggaaag 
ttcggagttt 
ggtttgtcgt 
tttaaggtta 
ggtattattt 
ttatcggtat 
gtaatatttt 
ggtgttaagg 
tggggagttt 
attaaatata 
ttagattaga 
agaggttttt 

agttttgatt 
tgagga^gat 

ataagatgtt 
cjtaggagttg 
tatatgtatla 



tcgggtttgg gggtgtcgcg 
gtgaaattgt atgtagaatt 
gtgaggatat agaggtttcg 
taggtcgtcg ttcgaggttt 
taagatttta attagggtta 
gatggggttg agattttcgg 
cggggtgggg ttggggggtt 
ttatttttgt atgttggggt 
tttggtcgtt tttttatgcg 
tagaaatgtt gttttgtgtt 
cggggttacg gttttttttg 
gttttttttg attaaatgag 
gatattatgg ttgtttttat 
gatttttagg aaagtttagg 
taaatttcgc gtgttattta 
gtttttttgt tacgtgaatt 
taggttcgtt ttttcgtttc 
aaaattcggc gaggttgcgt 
tttttattta ggtttttata 
gtatgtatat atatacgtgt 
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tggttgcgtt 
ttttttttag 
gttatttgtt 
gcgtttcggt 
tgtgtgggtg 
taagatttta 
ggtaggaggg 
ttttcgcgtt 
gtgacgtgtt 
tttgggaaaa 
aggatgtaaa 
aattttttaa 
attttatttt 
agataagggg 
cggttataga 
ttttggggtt 
gagatattta 
tatttggtgg 
t^atatatata 
a:cgaaagtat 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4406 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
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atatacgtat ttattgtata cgtttattgt atatacgtat ttacgtgtat atatattgta 1260 

tgtatatata tttgtacggt atatgtaggt atgtgtattt attgtatata tgtatttata 1320 

ttgtatgtat aggtttttat tgtatatacg tgtatatatg cgtgtatata tatttatata 1380 

gtatatatag gtatgtattt cgaatatttg tatatattgt atacgtgttt atgtatatat 1440 

gtatttatta tatatgtata ttgtgtatat ataatatata tttttatata gtgtatatat 1500 

atattatata tttttatata tatatattat atatttttag tgtatatata tatatattat 1560 

atatttttat agtatatata tatattatat atttttatat agtatatata tatatatatt 1620 

atatattttt atagtgtata tatatatata tatattttta tatagtatat atatatatat 1680 

attatatatt tttatatagt atatatatat atattatata tttttagtgt atatatatat 1740 

atatattata tatttttagt gtatatatat atatattata tatttttata tagtatatat 1800 

atatatatat atgttttagt gtttttaagt tattgattgg gataagtatt tttttgtgtt 1860 

ttaggtgggg ttcggattat tgaggtggtt gtagagggtt ggggaggaaa aggtagattt 1920 

taggtaggat ttttttgata gtgtttaagt tttgtagttt tttgagttga tagaggggat 1980 

ttcgtgtttt tttatttttt ttattttatt gtttgtgggg gttttgttat aaataggggg 2040 

tgttataggg ttggggggga gttataagtt tatttcggga ggcggaggta gtagtagaat 2100 

cgtagaatag cgaaattttg tttcgatttc gaggttgaat ttacgagggt tttgtttagg 2160 

ggaggttcgt tattagttag tgttggttgt gtttggttta gattttattt cggaggagat 2220 

ttttacggat atagtttttt tgatatagta gagggtattt taggattcgg ttagggtagg 2280 

gtttgtgggg tatcgggttt tagcgtggtt atatcgcggt tattggggag gatagtattt 2340 

gggatatatt ttacgtcgtt ttggtttcgt tggtttttgt attttttgtt tttttagttt 2400 

ttggttcgtt tagtttttgg ggttttgaga cgggtttagt tcgggtagtt ttaggagtag 2460 

ggttgaggga gaggtatata taggtagata gcgatggggt acgtggggag gggttgcggt 2520 

gggaggagag agggcggggt tgggaagggt tcgggtagcg atggggttgg gcgttgggta 2580 

gagtagtagg ttatgggaag gagtttgttt ttttcgggat agtagtttgg ggtcgatttg 2640 

ttatagtaga gggagaatag agattaggtt tttaggggag tggaagggtt cggggtagcg 2700 

attttgaggt agaaggtcgt gtttaagtgt gaacgttggg attttatggg atggttgtat 2760 

gtgtttttat ggataaaggt ttttttggga aattggattt gggttaagtt tagttttttg 2820 

cgttttatag aaattttgtt ttttagaatt ttttagattt ttaagtgata gttttttatt 2880 

tgtttttatt tgttggggtt taatggaagg ggtggatttt tatttaggta tgtggttacg 2940 

tttagttggg gtaggtggtt tgattgtagg atagttagtt ttttttaggt attgggttag 3000 

gtgtttattt tattggtaac gtgagttgag tatatattat gtttttgtat gttagtattt 3060 

gtgtgtattt gtggggtttt cggtttgttt attatttttt tttttatcgt aggacgatta 3120 

tttggattta tcgcgttcgt tttttggaag agtaggtttt gtttcgttgg gcggttttta 3180 

ttatttggaa cgttggtaag taggggcgtc ggttgtatcg tagtttgatt gtcggtagtt 3240 

agcgtaaggt gagtattttg ttatttttta atattgtttg ttttgttttc gtattgtttt 3300 

atagtatagg ttttggtttc ggtttgttta gagttgtttt tttgtgtttg tttttaggtg 3360 

gtggtatttt gtgacgtgga cgagaataag attaggaaag gtttttattg ttacgaggat 3420 

ttttaggtgt gtagggtcgt atggtgttat ttttggggtt aattatttgt ttcggggttg 3480 

agggatagtt gtttcgagtt tttttttttg gtagagttta tttgggtcgt gttggttttt 3540 

cggggttttt agggagttta gtttttattt ttggggggag aatttcgttg tttttcgtgg 3600 

gtttaggttt tggggtagtt tattttggtt agttttgtag tcgggtgtta ggcgggggtg 3660 

gcgggagtcg ggtttttgtt ttgaatattt gtcgtagttt gtgattttgg tacgttattt 3720 

gttgtttaag ttttggtttt tgtgtgagtt aatttggtgg ttgcgtgtat tttgagggcg 3780 

ttttttcggg gataagtttt aggttcgggt aggattgatg gttattgtta tttttaggaa 3840 

agatttaagt ttcgaatttt tattttgtat tttcgagtcg ttcggttatt tttcgttatt 3900 

tgcgtgaagt tggtgagtgt ggggtttttg gtcgtggaag ggtttgggga tttcgggttt 3960 

tgttatttgg ataattcgtt ttgttgtgta atgattgtgt ggtttgggat gagattttag 4020 

tcgatggtgg gtatggggat aggtgttata gtcgtttata atagtattat agattaggtt 4080 

gtggggtggt ggtcgttgtt ttgggggtag gaggtagttg gagagggtgg tttgtatggg 4140 

tgtggggggg atggatttcg gttttttttt tgttttaaaa tatttataaa gttaatattt 4200 

gttagaaatt tgtaaagtac gaggttttgc ggtgttatgt ttgaggggtt aaggttagga 4260 

tttttagttt tcgtggattg aggtggttgt atcggggatg tttttagttt ttgagtgtga 4320 

acgatttttg tttggggtgg gtaggtgatg ggtgtttcgg gttttaaatg tagttttttt 4380 

ttttaagcgt ggggagttta ggttgg 4406 

<210> 314 
<211> 4417 
<212> DNA 

<213> Artificial sequence 
<220> 

<2 2 3> chemically treated genomic DNA (Homo sapiens) 

» *■ • 

<400> 314 S * 1 ' * ' 

1 1 - I « • I : . . . • . 

gaaatttcgt ttttattaaa aatacgaaaa gttattcggg tttggtggcg ggcgtttgta 60 

ggagaatgta gtgaatttgg gagcgggagg ttgtagtgag tagagatcgc gttattgtat 120 

£*Ltagtttgt--acgatagagc-gagatt 180— 
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agaaaaagaa aagagggtgg agatggggga tgatatttag tttaggaggt gtttatggtt 240 

tggtttttcg tggggagaag gaaggttata cgattggtgt ggtttagggg gtagggtttt 300 

agtattttaa gtcgagtttt tttttttatt ttttgaggtt ggtattggta gtttaggtgg 360 

gggtttgggg atttgaatgg gatgaatggg ttaaggggga tgttattttt tttatttttt 420 

tttatttatt gttttttttt tttttttttt tttttttttt tattttcggt ttatttattt 480 

tttattttag ttaggagtcg ttatttaagt agaaaagggg tttttggaaa gggggcgggg 540 

ttttgatttt tgggtatttt gcgtttgaag aggaattttg ggaaggggtt gtttagggat 600 

ttggtttttt ttttatttgg ttgtataatt tttttgtttt tttttttttt tagtcggttg 660 

tttttttttg ttttgagatg ttaggaaaga gggggttatt tgcgtttttt atagtggttt 720 

tcgaagtttg gggtttttag ttttagagtt tagaggtgaa ggaggtgtta tagttttggt 780 

gattattggt tgttttttag ttttttttat gttatttttt tttaaaataa taaaataaat 840 

aaataaaatt gggtttggcg tagtggttta tgtttgtaat tttagtattt tgggaagtcg 900 

agacgggcgg attataaagt taggagatta agattatttt ggttaatatg gggaaatttc 960 

gtttttatta aaaaatataa taaattagtt aggcgttttg gtgggcgttt gtagttttag 1020 

ttatttggga ggtcgaggta ggagaatggt aggaattcgg gaggcggatt ttgtagtgag 1080 

tcgagatcgc gttattgtag tttagtttta ataatagagc gagatttcgt tttaaaaaaa 1140 

aaaaaaaaat tgattggaat attttttaag atgtaagatt tttagttttt tagagtttta 1200 

taggaaggat ggtagagtgt agttgtttag agttgaagtt ttatttttgt tatttgttgg 1260 

ttgtgtgatt aggtataaat tattaatttt tttgagtttg tattttatta tttgttgtta 1320 

ttgagtaata gtagtggatt attttttatt tttattttta ttttatttat tttatttttt 1380 

tttgagacgg agtttcgttt tgttatttag gttggagtgt agtggcgtga tttcggttta 1440 

ttgtatttat gtttttcggg tttaagcgat ttttttgttt tagttttttg agtagttgga 1500 

attataggcg cgtattatta tatttagtta attttttttt ttttttttag tagagatagg 1560 

gttttattat gttggttagg ttggtttcga agttttgaat ttaggtgatg tatttgtttc 1620 

ggttttttaa agtgttggga ttgtaggtat gagttattgc gtttggacga acgtttttaa 1680 

ttatattatt attattatta ttattattat tattattaat ttttgagatg gagttttgtt 1740 

tttgttgttt aggttgaagt gtaatggcgg gattttggtt tattgtaatt ttcgtttttt 1800 

aggattaaga gatttttttg tttaagtttt ttaagtattt gggagtatag gtatgtatta 1860 

ttacgttcga cgatattttg taattttagt agagatgggg tttttttatg ttggttatgt 1920 

tggtttggaa ttttttattt tagttgattt atttatcgta gttttcgata attataggcg 1980 

tgagttatta cgtttagtcg tttattttgt tttttattag agttttgtat tgtgattttg 2040 

tataaaatag tttggaagtt ggatttattt tgtgtgtgtg attgttttga gtttatagaa 2100 

agatattttt agagtgcgga ttgagaagtt tttattgtgg gaggatcggg gtgttttagg 2160 

gtttcgggag acgggatgga tttggaaggt tggggggagg ggtttttgag gaagaggagt 2220 

tttggaagcg ggggttatta taggttaagg ggtggttttt gggattttcg tagttagtgg 2280 

tgttgcggcg gtagagtgta tattgatagt tgagagttac ggcgtaggag attacggggt 2340 

ttacgtcgcg cgggtagtta gggagtcgga tggattcgaa gcgtatatcg cggtagttgt 2400 

atattatttg aggtagggtc ggtaggattt tttgtagtac gcgggtttgg aagtcgtgtg 2460 

agtgggggaa tgagtatgtg tttggggtta gcgttttagt tgagtttttt agttgttttt 2520 

ataggtttta gatttagggg tttaggaagt tttttgtttt tgttttttta tattttattt 2580 

cgtagttttt gattagagag gtagattatt tttttttttt cgttgttttt gtgggtttgg 2640 

ttttgaggtg gtagtatttg ttttggtttc gggtagttta ttatggtggg gtagtagtcg 2700 

gtatagatgg tggtgttgac ggtgatgtat acggggtagt tttttttttt tatagttagg 2760 

gtggtattga tggggcggta tcgtggtcga agcggttttt tggatgttta tgtttcgttt 2820 

atgtttagta gtagtaatag tagtagtttt tggggtaagg atattgtttt attcgggttt 2880 

gagattgtag tttttagttt tggttttttt atttcgtatg gtatattatt tataaagatt 2940 

tagagatttt tttcggtatt ttttatttag gatttattat tcggatattt gttttttaga 3000 

gtttatttta tagtttagag gatttgagat attttaatat tttagattcg tatttttagg 3060 

aattgattta tttgaagttt attgggggtt acgttttttt agaaagaggt ttttttttat 3120 

agtttatacg ggtttgtttt tttttatgtt agtgatggtt tggaaggagg tggaaggtgt 3180 

ttaggggttt tgtagttttt tttggaatat ttttattttt ggtgttcgga aatgtggatt 3240 

tattttattt ttgatatgtt ttttttttag cgggatattt ttcgtaagta ttgggaatgt 3300 

ggatatggaa agtaaattga gttttcgtgg gggagtgaga tagggagtga ggggtgttgg 3360 

acgcggtacg ggaatttggt tagagttagc ggatttaatt ggttgttttt ttttagatgt 3420 

agtttttttt ttttttttta gggggcgtta cggaacgtag ggtttttatt ggttttgggg 3480 

attgggtgac gttagggatg agttttttgt gattggtttt attattttgc gtaagattaa 3540 

agggaagaaa ggatgggttc gataatcgga gttattgtgg tttaggtcga ttgagtttgt 3600 

tttcgggttt ttaaggttag gcgagggtgt gagggattgg agttcgaggt taatttgggt 3660 

tttattggta ggaaaaaaaa aaaaaatttt tttatcgttt tttatataat aataaaatat 3720 

aaagggagga cgttttgata ggaagaaatg atattttttt aagtgttttt aaattatttt 3780 

aatgtatttt tttttttttt tttgggatat cgagttttgt tcggttttcg ggggattttt 3840 

ttgtttgtta gttatggcgg tcgttatttg tggtaaatag aatggcggiga gtaagtggga 3900 

tgttttatta ttcgggttga ggtaggagaa ttataggaat ttaggaggtg gaattttttc 3960 

gtttaggagg ggcgtggttt cggatttagt ttttgtttaa tgagagaggg ttttttcgtg 4020 

attgtgttgt taggttagtt atttgatttt ggtgtcgtta acgagggatt tagttcgagt 4080 

tttatttttt ttttagggat ttttgtttat tttattttta agttaggatg ttcggagcgg 4140 

ttttcggaaa tgc^tgtgtt tcgggtgatt t^attgatta ttgaataggt tcgtaggagg 4200 

tgtgttttgt tttcgaggtc gtagtttcgg aggatattat ttggatttag tttttttttc 4260 
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gcgatgttat taagttggat aattttaagg tttgtttttt ttttaatgta gggtatagga 4320 

tggtataggg aagagggtta gaaatttgat ttgagttttt cgtttagggt tgaatttttt 4380 

agtattttga ttatttttta ttgaattttg tttatat 4417 

<210> 315 
<211> 4417 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 315 

gtataagtaa ggrttaatga gaagtgatta ggatgttgga gagtttagtt ttgggcgggg 60 

agtttaagtt aggtttttag tttttttttt tgtgttattt tatattttat attgggaaag 120 

aaatagattt taaaattgtt tagtttgatg gtatcgcggg gaagggatta agtttagata 180 

atgtttttcg aggttgcggt ttcgggggta ggatatattt tttgcgggtt tatttaataa 240 

ttagttaaat tattcgaagt atacgtattt tcggggatcg tttcgggtat tttggtttga 300 

gggtagagtg ggtagaggtt tttaagggag aggtggggtt cgggttgaat ttttcgttgg 360 

cggtattagg gttaagtggt taatttggta gtatagttac ggggaggttt ttttttattg 420 

ggtagaaatt aagttcgaag tcgcgttttt tttgggcgag gaggttttat tttttaggtt 480 

tttgtgattt ttttgtttta gttcgagtag tgggatattt tatttgtttt cgttattttg 540 

tttattatag gtgacgatcg ttatggttga taggtaggga ggtttttcga ggatcgagta 600 

aggttcggta ttttaaaaaa aaaaaaaaag atatattgaa gtaatttaaa aatatttagg 660 

aagatgttat ttttttttat taaggcgttt tttttttatg ttttgttgtt atatagggaa 720 

cgataaaaaa attttttttt ttttttttat taatggggtt taggttgatt tcgaatttta 780 

gttttttata ttttcgttta gttttgagag ttcgagggta ggtttaatxg gtcggagtta 840 

taatggtttc ggttgtcggg tttatttttt tttttttttg attttacgta gggtgatgga 900 

gttaattata agaggtttat ttttgacgtt atttagtttt tagggttagt gagggttttg 960 

cgtttcgtgg cgttttttgg agggaggaag gggaattgta tttgagagag agtagttaat 1020 

tgggttcgtt gattttggtt aggttttcgt gtcgcgttta atatttttta ttttttgttt 1080 

tattttttta cggagattta atttattttt tatgtttata tttttagtgt ttgcggaaga 1140 

tatttcgtta agagagagat atgttaaagg tagggtagat ttatattttc gggtattaaa 1200 

gatggagatg ttttaggaaa gattgtaggg tttttgggta ttttttattt ttttttaggt 1260 

tattattggt atgagaaggg gtagattcgt gtgagttgtg gaaggaggtt tttttttgga 1320 

ggagcgtgat ttttagtaag ttttaggtgg gttagttttt gagggtgcgg atttgaaatg 1380 

ttggggtatt ttaggttttt tgggttgtgg ggtgggtttt gaaaggtagg tgttcgggtg 1440 

gtgggttttg aatagaagat gtcgggaagg gtttttgggt ttttgtgggt ggtgtattat 1500 

gcgggatggg aaggttagga ttgggggttg tagttttaga ttcgggtgaa gtagtgtttt 1560 

tgttttagag gttgttgttg ttgttgttgt tgagtatggg cgggatatgg gtatttaagg 1620 

agtcgtttcg gttacggtgt cgttttatta atgttatttt ggttgtggag aaggagggtt 1680 

gtttcgtgtg tattatcgtt aatattatta tttgtgtcgg ttattgtttt attatggtga 1740 

gttgttcggg gttagggtag gtgttgttat tttagggtta gatttataga ggtagcgggg 1800 

gaggaagggt ggtttgtttt tttggttagg ggttgcggaa tggggtgtgg gagggtagga 1860 

atagagggtt ttttggattt ttgagtttga gatttgtggg ggtagttggg gagtttagtt 1920 

gaggcgttgg ttttaggtat atgtttattt ttttatttat acggttttta gattcgcgtg 1980 

ttgtaggggg ttttgtcggt tttgttttag gtggtgtgta attatcgcga tgtgcgtttc 2040 

gagtttattc ggttttttgg ttgttcgcgc ggcgtgaatt tcgtggtttt ttacgtcgtg 2100 

gtttttagtt gttaatgtgt attttgtcgt cgtagtatta ttgattgcgg gggttttaag 2160 

gattattttt tgatttgtga tgattttcgt ttttaggatt tttttttttt aaaggttttt 2220 

ttttttagtt ttttaagttt atttcgtttt tcggggtttt aggatatttc gattttttta 2280 

taataaaggt tttttaattc gtattttgga ggtgtttttt tgtgggttta gggtaattat 2340 

atatataggg tgggtttagt ttttaaatta ttttatatag agttatagta tagaattttg 2400 

gtagaaaata gggtggacgg ttgggcgtgg tggtttacgt ttgtaattgt cggaggttgc 2460 

ggtgggtgga ttagttgagg tgaggggttt tagattaata tgattaatat ggagaaattt 2520 

tatttttatt aaaattataa aatatcgtcg ggcgtggtgg tgtatgttta tatttrtagg 2580 

tatttgggag gtttaggtag gagaattttt tgattttagg aggcggaggt tgtagtgagt 2640 

taagatttcg ttattgtatt ttagtttggg taataaaaat aaagttttat tttaaaaatt 2700 

agtaataata ataataataa taataataat aatataatta aaaacgttcg tttaggcgta 2760 

gtggtttatg tttgtaattt tagtattttg ggaggtcgag gtaggtgtat tatttgagtt 2820 

taggatttcg agattagttt ggttaatatg gtgaaatttt gtttttatta aaaaaaaaaa 2880 

aaaaaattag ttgggtgtgg tggtgcgcgt ttgtagtttt agttatttag gaagttgagg 2940 

taggagaatc gtttgaattc gggaagtata ggtgtagtg<at gtcgagatta cgttattgta 3000 

ttttagtttg ggtgataggg cgagatttcg ttttaaaaaa aaataaaata aataaaataa 3060 

aaataaaa^t agaaaatagt ttattattat tatttaatag taatagatgg taaaatatag 3120 

gtttag^gaa attaatgatt tgtgtttggt tatatagtta ataaatggta gagatgggat 3180 

tttazyttttg ggtaattgta ttttgttatt ttttttgtaq aattttgagg aattgagagt _3£ML 
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tttgtatttt ggaggatgtt ttaattaatt tttttttttt ttttgagacg gagtttcgtt 3300 

ttgttgttga gattggattg tagtggcgcg atttcggttt attgtaagat tcgtttttcg 3360 

ggtttttgtt atttttttgt ttcggttttt taagtagttg ggattatagg cgtttattag 3420 

gacgtttggt taatttgttg tattttttag tagagacgga gtttttttat gttagttagg 3480 

atagttttga ttttttgatt ttgtgattcg ttcgtttcgg ttttttaaag tgttgggatt 3540 

ataggtatga attattgcgt taggtttaat tttgtttgtt tgttttgttg ttttgggaaa 3600 

gggtggtatg gagaagattg ggaaataatt agtggttatt agagttgtag tatttttttt 3660 

atttttgagt tttggggttg ggaattttag gtttcggagg ttattgtgga ggacgtaggt 3720 

ggtttttttt tttttgatat tttaggatag ggaggggtaa tcggttaggg gaagaaaaga 3780 

ataggaaggt tatatagtta ggtggggaag gggttaaatt tttgaataat ttttttttag 3840 

agtttttttt taagcgtagg gtatttaaga gttaaggttt cgtttttttt ttagaggttt 3900 

tttttttatt taggtgacgg tttttggttg ggatgggagg tagatggatc gggggtggga 3960 

ggggggggag gaggggaagg gagaggtagt ggatgaagga gaatggaaga gatgatattt 4020 

tttttggttt atttatttta tttaggtttt raagttttta tttggattgt tagtgttaat 4080 

tttagagggt ggaagaggga gttcggttta gaatgttgag gttttgtttt ttgggttata 4140 

ttaatcgtgt ggtttttttt ttttttacgg aaggttagat tatggatatt ttttgagttg 4200 

gatgttattt tttattttta tttttttttt tttttttttt tttttttttt tttttttttt 4260 

ttttgagatg gagtttcgtt ttgtcgtgta ggttggagtg tagtggcgcg atttttgttt 4320 

attgtaattt ttcgttttta ggtttattgt atttttttgt aagcgttcgt tattaagttc 4380 

ggataatttt tcgtattttt agtagagacg gggtttt 4417 

<210> 316 
<211> 4420 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA CHomo sapiens) 
<400> 316 

tttttggtag ttttgggggt ttttttttgt atttcgaagg aaggggaggc gtgtggttgg 60 

aggtgagagg ttttgaggtt ttgtttggtt tttagttttt taaattcgtt tttttttggg 120 

tcgtcgtttt tagagattgt tgaggatacg agttttttat ttacgggttg ttttaacgga 180 

gtcgggttag gggtttttgg atagaggtcg tttgtttttt tgggttttgg ggtttgtttt 240 

ttttagtagt ttattcgggt ttttggggtt agggtatcgg gtagaagtta gaggacggtg 300 

cgtttgggat ttgagttttg ggtagttttc ggggtttagg atttttggtt tttttgttta 360 

gttttatgtg ttttgtgttt tgaaattacg tgtggtttta tagttgtgtt tttagaagtc 420 

gtcgtggttg tcgtgaggta gtagtggacg ggacgtttac ggtttttttt atgttatgtt 480 

gtttttagtc gggttttatt atttttttat aagttgaaag gaagtggttt ttttattagt 540 

tttggttggg agttttgtgg gcgaggtttt ttttaggagt tgttttttta agtgttttag 600 

gtattaataa gattttacgc gtgttttttt tatagtcggt ttcgtttagg cgtcgttgtt 660 

ttagtagagg tgggtggggg gtagtgtgga gttgtattgc gaggtcgtgg gtagttcggt 720 

gttcgagatt tagtggtggt ttgaagggta gggttttaac gatatttgtt tttagttttg 780 

ggacggcgtt cggttggatc gcgtttatat ttacgttatt tattattagt acgcggttag 840 

tattattttt atcgatacgt tcgtggagga ggatacgggt atttacgagt gtcgggttag 900 

taacgattcg gatcgtaatt atttgattcg ggcgtttagg gttaagtggg ttcgcgttta 960 

ggtagtcgtg ttagttttgg aacgtgagtg gcgggtattt tttttttcgt tttttttagt 1020 

tttttttttg tgtcgttcgt ttttcggttt ttgtttagta gaatttagac gttttttttt 1080 

tttttcgttt cgttgtgttt cgttgggttt atcgtttgga cggaggggtt cggatttgaa 1140 

gggggtgggt tcgtcgtttg attattagcg ttgggcggtt tggttcgggg taggtagggt 1200 

tttgtttttt agtaggtggg ttttcgtttt tttttcgatt ttcgtgttat tcgtttcgtc 1260 

gtttttgttt tttcgggcgt ttgttcggtt ttttttttga gcgtttatat tgtaagtttg 1320 

aggggttttt taagtttagt ttaggtttgt aggttttaga attttttgtg tttttcgtag 1380 

tatttcgttt taagggattt ttaaatttta atttttagcg ggaatattgt aaattaatta 1440 

agaattaaga gatttgtttg gttggggatt ggtagaagaa tcggggtgtt tttagtattg 1500 

ttggagtgtc gttagtttgg ggttgggggg atgtggggtt atttgagtgt tttttagaga 1560 

agggtacggg agttttggtc gggggtgttg tggttatttt tttttttttt tttggagatg 1620 

gagttttgtt gtttttaggt ggagtgtggt ggtataattt cggtttattg taattttcgt 1680 

ttttcgggtt taagcgattt ttttgtttta gttttttgag tagttgggat tataggtatg 1740 

agttaatatg tttagttaat tttttgtatt tttagaagag acggggtttt atcgtgttag 1800 

tta:ggatggt ttcgattttt cgatttcgag atttattcgt tttggttttt taaagtgttg 1860 

ggattatagg cgtgagtttt cgcgttcggt tattatcgcg tttagttggt tacgttttta 1920 

gaaaiagaatt tttagtttgt gttggggttg ttataagagt tcgatgtttt ggtttttgtt 1980 

tagggacgtg ggtgggtgtt ttgtttgttt ttatattttg gttttagttt cgtgtgtttt 2040 

tgggtaggaa tatcgttagg cggggttttt gggtttttag cgttttagtt tagggttcgt 2100 

tagttgttag cgaagggtgt tttttcgaag ggggtgtttt ttcgaagggg tttttttagt 2160 

aggcgttggg tttttttttt ttcggaggcg tggttagaag ttttttttgg gttttcggtt 2220 
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ttcggggcgt aagggttacg gtgtattttt gggatgaaaa ttagtttttt tcggggagga 2280 

gtcgtaggtt tttgggggtt aggtaggtag cgcgggtcgt gtttttttat tttgttgatc 2340 

gcgtttcgtc gggttttgta gtcggtatag tttttattat cgtagaagat tttggtttta 2400 

agatattttt tatttgtttt ttgaatgata gcgttataga ggttataggg tatcgttggt 2460 

tgaagggggg cgtggtgttg aaggaggacg cgttgttcgg ttagaaaacg gagtttaagt 2520 

gagtgtttga ttacgttatg tcgttatttg ttttttttta cggttttttt gtcgttagcg 2580 

gtttgtggtt cgtgagaata aaagatcggt cggtaggttt gatttaggcg gaagttaggg 2640 

attagtttcg cgtagatttt tagagggaaa ttttagggag gggtttgagg ggcgtttttg 2700 

tgaggtaggg ggttttagta gttttggtag gagtttttat tgtttgtttt ttattaggtt 2760 

tcgggtacga aagggcgtta tattgttagg tttcgggtac gtgtgcgggt tttttaggtt 2820 

ttgtgttttt ggtataagag ggtattatat tgaggtatgt tttgttcgga ttttagacgt 2880 

tgttatggtc ggagttttgt tataattggg gttgggacgg ttttgggagt atagagtttt 2940 

ttgtttttag gttgattggg aagacggtta tagggtgtgg ggtgtagagt agggattaga 3000 

ggtgtggggt ttattagtcg gggagaaggg ggtttgttgg tttttttatt tgtggggttt 3060 

tgttagtttg gttttgtttt tggggtcgta cggggatttt aggagttttt ttgaggttta 3120 

gatttgattg ggatagtttt gtttttttat tgtgtcgtgg ttgggttttt ggagaatttt 3180 

gggtttttgg aggcgttttg tagagttggt acgcggttta tttgtttgtt gtggttgtag 3240 

ggtggatttc gacgattagt ggggagagta tttttgcgtt ttttttttcg agtttatggg 3300 

tacggttaat atttagtttt acggtgagtt ttgtagttag gggtatcggg tattatcgat 3360 

tgtcgggaga agttgttggt ttgaggtatt cggtatattt tagagttttg gttttttgtt 3420 

*"£ ggaggg gaatagtttt tttgcgggag gtcggggatg ggggcgggtt tgcggtttta 3480 

ggtttttttt ttttattttt ttgttagggt tttttagagt gaaggttgtg aagtcgttag 3540 

aatatattaa cgagggggag acggttatgt tggtttgtaa gttagagttc gtgttatttg 3600 

ttattgattg ggtttggtat aagattattg attttgagga taaggtgaga agttaaggag 3660 

gttgggggtt ttggatttag tttttaggat tgggtgagag gtttagattg ggggtttcgg 3720 

atttagtttt taggattggg tgaggggttt agattggggg tttcggattt agtttttagg 3780 

attgggtgag gggtttagat tgggggtttc ggatttagtt tttcggattg ggtgaggggt 3840 

ttagattggg ggtttcggat ttagtttttc ggattgggtg aggggtttag attgggggtt 3900 

tcggatttag ttttttggat tgtagttttt tagattgggt gaggggttta gattgggggt 3960 

ttcggattta gtttttagga ttgggtgagg ggtttagatt gggggtttcg gatttagttt 4020 

ttcggattgg gtgaggggtt tagattgggg gtttcggatt tagtttttcg gattgggtga 4080 

ggggtttaga ttgggggttt cggatttagt ttttaggatt gggtgagggg tttagattgg 4140 

ggg "" g9a tttagttttt gtttttggtt ttttaggttt ttatgaacgg tttcgagagt 4200 

aggtttttcg tgagtttttc gtagggtcgg ttagagttat atattgagaa tttgaatatg 4260 

gaggtcgatt tcggttagta tcggtgtaac ggtattagtt ttaagggttt cgattaggtt 4320 

attattacgt ttcgcgtgcg tagttatttg gtcgtttttt ggtttttttt gggtatcgtq 4380 

gttgaggtgt tggtgttggt tattattatt tttatttacg 4420 

<210> 317 
<211> 4420 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 317 

cgtagatgaa gatgatggtg attagtatta gtattttagt tacgatgttt aggaagggtt 60 

agagggcggt taggtggttg cgtacgcgga gcgtgatgat ggtttggtcg gagtttttgg 120 

agttggtgtc gttgtatcgg tattggtcgg ggtcggtttt tatgtttagg tttttaatgt 180 

HESSE? 9 * tc gg tt: ttgc gaggaattta cgaagaattt gttttcggag tcgtttatga 240 

gggtttaggg gattaaaggt aagggttgag tttaggattt ttagtttagg ttttttattt 300 

taScoiol? ?S?a2252 825S555SS " taggtttt ttatttagtt cggagggttg 360 

agttcgggat ttttagttta ggttttttat ttagttcgga gggttgggtt cgggattttt 420 

^ii^SSi* ttttat ttag ttttgagggt tgggttcggg atttttagtt taggttrtttt 480 

atttagtttg gagggttgta gtttagaggg ttgggttcgg gatttttagt ttaggttttt 540 

tatttagttc ggagggttgg gttcgggatt tttagtttag gttttttatt tagttcggag 600 

ggttgggttc gggattttta gtttaggttt tttatttagt tttgagggtt gggttcggga 660 

SE^gS* a ggtttttta tttagttttrg agggttgggt tcgggatttt tagtttaggt 720 

HHB attta 9 tttt: 9 a ggg "gggtttag gatttttagt ttttttggtt ttttattttg 780 

tttttagagt tagtgatttt gtattaggtt tagttagtga taggtggtac ggattttgat 840 

H?H gatta gtatggtcgt ttttttttcg ttgatgtgtt ttgacgatrtt tatagttttt 900 

5HI9 0fla S 9^ ttt: g a tag gagggtgaga gagaggagtt tggaatcgta ggttcgtttt 960 

"F^SB 99 aflOOttgttt ttttttaggg agtagggggt tagggttttg 1020 

?2««2B£ Cg Sgtgttttag fl«aataatt t^tttcgata gtcggtggtg ttcggtattt 1080 

n* gg " g * ag ga "J a ^ g 3 flgagttggat gttcjgtcgtg tttatgggtt cggggaggaa 1140 

gacgtaggag J«tt_ttt.tt ^ttggtcgtc ggagtt$att ttgtaattat agtaggtagg 1200 _ 
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tgagtcgcgt attagttttg taggacgttt ttaaggattt agggtttttt aggggtttaa 1260 

ttacggtata gtgaaaaagt aaaattgttt tagttaagtt tgaattttaa aagaattttt 1320 

ggggttttcg tgcggtttta aggatagagt taggttggta agattttata ggtgagagaa 1380 

ttaatagatt tttttttttt cgattgatga attttatatt tttggttttt gttttgtatt 1440 

ttatattttg tgatcgtttt tttagttagt ttgaaaataa aaggttttgt gtttttaggg 1500 

tcgttttagt tttagttatg ataaagtttc ggttatgata gcgtttgggg ttcgagtagg 1560 

gtatgtttta gtgtggtgtt tttttgtgtt aggagtatag agtttgggag gttcgtatac 1620 

gtgttcggga tttggtagtg tggcgttttt tcgtgttcgg gatttagtga ggagtaagta 1680 

atggggattt ttgttagggt tgttggagtt ttttgtttta taaggacgtt ttttagattt 1740 

ttttttgggg tttttttttg ggggtttgcg cggagttggt ttttaatttt cgtttggatt 1800 

aagtttgtcg atcggttttt tgtttttacg gattataggt cgttggcggt aagagagtcg 1860 

tgagaagggg taggtggcgg tatggcgtgg ttaggtattt atttgaattt cgttttttgg 1920 

tcgggtagcg cgtttttttt tagtattacg ttttttttta gttagcggtg ttttgtgatt 1980 

tttgtggcgt tgttatttaa ggagtaggtg aggagtattt tggagttaag gttttttacg 2040 

gtagtgaaga ttgtgtcggt tgtaaggttc ggcgagacgc ggttagtaga gtggaggagt 2100 

acggttcgcg ttgtttgttt gatttttagg aatttgcggt tttttttcgg gaaggattgg 2160 

tttttatttt aaaaatatat cgtggttttt acgtttcgag gatcggaggt ttaaggggaa 2220 

tttttggtta cgttttcggg aagggaagag tttagcgttt gttgaggagg trttttcggg 2280 

aaggtatttt tttcgggaag gtatttttcg ttggtagttg gcgggttttg ggttaaggcg 2340 

ttgggaattt agaggtttcg tttggcgatg tttttgttta gagatatacg aggttgggat 2400 

tagggtgtgg ggatagataa ggtatttatt tacgtttttg agtagggatt agggtatcga 2460 

gtttttgtag tagttttagt ataaattgga aatttttttt taaaaacgta gttagttggg 2520 

cgcggtggtg gtcgggcgcg gaggtttacg tttgtaattt tagtattttg ggaggttaag 2580 

gcgggtggat ttcgaggtcg ggagatcgag attattttgg ttaatacggt gaaatttcgt 2640 

tttttttaaa aatataaaaa attagttggg tatgttggtt tatgtttgta attttagtta 2700 

tttaggaggt tgaggtagaa gaatcgtttg aattcgggag gcggaagttg tagtgagtcg 2760 

agattgtgtt attatatttt atttgggaat agtaagattt tatttttagg aaaaaaaaaa 2820 

aaaatagtta tagtattttc ggttaaggtt ttcgtgtttt tttttaggag gtatttaggt 2880 

ggttttatat ttttttagtt ttaggttgac ggtattttag tagtgttggg agtatttcgg 2940 

tttttttgtt aatttttaat taagtaaatt ttttggtttt tggttagttt ataatatttt 3000 

cgttaggaat tgaaatttga aaatttttta gaacgagatg ttgcgaagga tatagaaggt 3060 

tttgagattt gtaagtttgg gttgagtttg gagggttttt taggtttgta gtatggacgt 3120 

ttagggaggg agtcgggtag gcgttcgggg aagtagaaac gacggaacgg atggtacgag 3180 

ggtxgagggg aggacgagga tttatttgtt ggagggtaga gttttgtttg tttcgagtta 3240 

ggtcgtttag cgttggtggt taggcggcga gtttattttt tttaggttcg ggttttttcg 3300 

tttaggcggt gggtttaacg gggtatagcg ggacggagag gggaggaggc gtttgggttt 3360 

tattgagtag gagtcgggag gcgagcggta taggagggaa attgagggag gcggggaggg 3420 

aggtgttcgt tarttacgtt ttaggattag tacgattgtt tgggcgcgga tttatttgat 3480 

tttgggcgtt cgggttaggt ggttgcgatt cgggtcgttg ttggttcggt attcgtaagt 3540 

gttcgtgttt ttttttacga gcgtgtcgat ggagatggtg ttggtcgcgt gttggtggta 3600 

ggtggcgtgg atgtggacgc ggtttagtcg ggcgtcgttt tagagttggg agtaggtgtc 3660 

gttgggattt tgttttttaa attattattg gatttcgggt atcgggttgt ttacggtttc 3720 

gtagtgtagt tttatattgt tttttattta tttttgttgg gatagcggcg tttggacgaa 3780 

gtcggttgtg gggagagtac gcgtggggtt ttgttagtgt ttggggtatt tgggagggta 3840 

gtttttgggg aaggtttcgt ttataggatt tttagttagg gttggtggag aagttatttt 3900 

tttttagttt gtigggagagt gatggggttc ggttggaaat agtatggtat gggaaagatc 3960 

gtgggcgttt cgtttattgt tattttacgg tagttacggc ggtttttgag ggtataattg 4020 

tgggattata cgtggtttta aaatataaaa tatatgaaat tgggtagaag agttaggagt 4080 

tttgagtttc gagggttgtt tagagtttag gttttaagcg tatcgttttt tgatttttgt 4140 

tcgatatttt gattttagag attcgggtga gttgttgaga aaaataaatt ttaaagttta 4200 

aaggggtagg cggtttttgt ttagaagttt ttagttcggt ttcgttagga tagttcgtgg 4260 

gtggaaagtt cgtgttttta gtaatttttg agggcggcga tttagagaga ggcggatttg 4320 

ggaagttggg agttaggtag ggttttaggg ttttttattt ttagttatac gttttttttt 4380 

ttttcgaaat gtaagggaag gtttttagag ttgttaagga 4420 

<210> 318 
<211> 4258 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemi pally treated genomic dna (Homo, sapiens) 
<400> 318 : K u <<- . v 

tttttaaagt gttgggatta taggtttgag ttattgcgtt tggttt^aag taattgtttt 60 

tttattggtt tttttgtttt tggtfttatt aaacgtttta tjt.tta.aatta ttgtagat;aa 120 

ggggttttgt tfcaaggagga gagtttatta gttaaaattt tttagtagtt tttaattatt 180 
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ttaggataaa 
tatatggttt 
tttttatttc 
ttatttgagg 
tgttttttta 
gtgtagaaaa 
aatgtttaat 
aatttatgta 
agtagttaaa 
tttataaatg 
cgtgatttgt 
taggagatag 
ttgtttgggg 
ttttagatta 
gcggcggagg 
ggattcgggt 
cggaggtttc 
ggtatttatt 

tggggatttc 
gttcgttggt 
cgttttttga 
gggtttgagg 
aatagaagtt 
gtgggagtgg 
ggtgcgggtt 
agttgttttt 
gagaagagtt 
ttttagcgga 
ggcgtcggtg 
cgtattcgcg 
tcgggagcgt 
ggagaaggtg 
gttgggggtt 
attatgatgt 
ttttttattt 
ttttttttta 



tgtagattta 

tagttatatt 

gtttttttgt 

ttttcgtagt 

tgtagggtac 

gatatttagt 

ttttatatag 

atttatataa 

taatttgttt 

aattaagaat 

gcgttttata 

aggttgggag 

ttcgggagta 

aattataatt 

gcgtaggcgg 

tttggtttcg 

gcggcgtacg 

atatgggggg 

gtgtcgtatt 

cgcgttttta 

gaagggtatt 

ggtttttttt 

ttcgggaggt 

tggggagagg 
ttgtggattc 
cgcgggttga 
tttatttatt 



attgtttatc 

atattgtacg 

aagtggggag 

tataggattt 

gtcgaagtgg 

gggggtggtt 

tgtagtatga 

ttgggttaga 

ttgtgatttg 

gttgttggat 

gaagtatata 

acgggatttt 

agacgttgat 

aggataaggt 

gattgggttg 

gggtagtttc 

tttttttttc 

gattattgag 

gatatttggg 

gtagggtttt 

ggttgtttcg 

ttgtgataaa 

ttttagttgt 

cgtttgtgtg 

gttcggtttt 

gttatgaggt 

tatttattgg 

ataatgtttt 

ttgttttgag 

tttgatttag 

aaattatttt 

tgtaggttta 



tttttagttt 
ttgaggagcg 
tttcgggttc 
tttaagatcg 
cgtaaggggg 
ttttgtttaa 
tttacgttgg 
ttttcgtgga 
ttttttttta 
gtattaaggg 
tggtaaggta 
tttggggttt 
aattttaggt 
gggtttgatt 
gttttttgtt 
gttatatatt 
gtagggtgag 
tagattaagg 
gattataagg 
gaggttgggg 
gagattttat 
ttcgaggttt 

ggggttttgg 

gggattcggg 
gaagttgtta 
ggttagtttg 
aagtgggtgg 
gtagggaagg 
ttaattatgg 
tgtacgttag 
taaaaatgtt 
aaattattgg 
agtataagta 

tttttatttt 

ggggtggttt 
tttttttttt 

agatgtggtt 
tgggagttga 
gattttgagt 
ttgaattttg 
ggtattttgt 



tttttcgttt 
tttagattat 
aagttttatt 
tttttttgaa 
ggaaagattt 
tagcgatttg 
tgttaatttt 
ttttaaaatt 
tagatattaa 
tggaaggagg 
tttaagattt 
ggatacggag 

agggcgtggt 
tttaggattt 
ttgggcgttt 
gtttcgcggt 
ttcgtagttt 
tattcgggtc 
tttttaagtc 
gggttcgggt 
cgggatttcg 
tcgatagttt 
taggtttttt 
tcgttcgggt 
ggatttttat 
gtttgttagt 
tatatttttt 



tagcggttat 
gtttcggggg 
gttcgacgtc 
ttattttttt 
tagttagttt 
ggtttttttt 
tcgtttttag 
ttttggagtt 
gaaaagtgtt 
ttttcgttgt 
gagaggattt 
agagtggtag 
ttttagaagt 
acgtggtcga 
tagtgtttat 
cgtgtttttt 
gggttgagag 
attaggcgga 
atgtggtgat 
tagtaaagga 
tttttatagg 
ggaattcgta 
gatttgagat 
ttaagggttt 
gtattttggg 
attttgtcgg 
attttgttag 
tttgggtttg 
tattattgat 
gaagtttagt 
tttgtatatt 
tggaggggtt 
tggtaatgcg 
tatgttattg 
gatgtgttgg 
ttgtatagag 
attattatcg 
gttgaggtgg 
ttttttttgt 
agatggttat 
ttttgttagt 



seq_prot.txt 
tttgttgtcg tttttttttt 
ttgggttgtt gtttgttttg 
taggttgtat ttatatagtt 
agtttttatt atagttaatg 
atttaattta ttgtttaatt 
tattatttgt aggtatgagt 
acgtttggta ttgttttagg 
gatgatattt ttttatttta 
ggggtcgagt tggggtttga 
ataagagttt cgagaaggag 
tttagttttt agggagttcg 
agtttcgaga gtcgtgtgga 
ttggatgcgc gggcgttcgg 
agcgggcgtt gtcgtttacg 
ggcgagtgga ttgttcgagt 
aagtggggcg attttagttt 
ttattatagc gcgggcgcgt 
gaattaagtg attcgcgtgg 
ggttttaggg gtttagggtt 
ttgaaggcgt tgggtaggta 
gggcgttggg gcgaggtttt 
ttttatcgtt agtagagttt 
gggcgcggtt tgtgtgtatt 
ttggggagat cgatgtatag 
aatttttttt ttttcgtttc 
attttcgacg tattttggtt 
tgttttattt tgggtcgttt 
cgggtttgaa aggagtaaga 
cgggttttcg tttcggtttt 
ggttatcgat atatattacg 
ttataattag tcggttatcg 
agggttcggg atgtaggcgg 
tttttttagt ttttagttcg 
gacggtagtt attttttggt 
tttttagata tttaggtttt 
cggtatttgt agtaagaatt 
ttttttttta ttattggttg 
taggttagcg ggttattttg 
acgattgttt gtttaaaggt 
taaaggtgtg tatttacgga 
ttagttgggt ggtgattttt 
gcggttttgt gaatttttat 
agaagttgat agattttttt 
gttgggtttg gattattttt 
cgaggtttag tattgttagt 
ttttttggta gagggtttac 
gaggtcgggt ttgttggggg 
atgaaaagtt cgttcgttat 
tgttggttac gtattatttg 
ttatttggga atattaagtg 
gggggttgag gttgtggggt 
gtggggttag gggcgtgggt 
tattatttgt attcgttttt 
gtgaggtaag aatggtttag 
agtttttgtt cggggtatga 
atttttagtt agataatttt 
agtatttttg gcgttagtag 
gttatatgtt atttaggggg 
tttgattgaa ttttcgtttt 
tttcgtgttc gtattaatgg 
gattatattt tgtcggagtt 
tttgggggcg gataggaatc 
ttataatgga gagttattta 
ttaataatga tgtcgatttg 
atgggatggg ggtttaggta 
tttattttgg gatttggttt 
gagttgttta ttaatggatt 
agt tg aggag tttgaaattg 
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ttttagattt 
ttatatattt 
gttttagttt 
atattgaaat 
tttttttttg 
gaataataat 
tatttatata 
ttaagtattg 
agtttggtta 
ttaggtaggg 
ttttatagag 
ggaagcggtg 
gacggtacgt 
cgacgttacg 
tttttcgttg 
atttagttcg 
agacggggtt 
ggggtttcgt 
ggtgtcgtac 
ggggtagttt 

cgggttggaa 
cgggttgggg 

tggggagacg 
gtggagagat 
ggtagatggg 
tttgaagtcg 
gggtttttag 
cgatgatttt 
ttttgggatt 
tggagatggt 
acgcggtagc 
gagggagagt 
tttaacgttt 
aagatttacg 
agtttcgttt 
tttagtgagg 
taattttatt 
tttcgggggt 
gttagggtta 
gtttggaagg 
atgggtaggt 
attttttttt 
tcggtgagtg 
ttttatgatt 
tggtgcgata 
gtgagagtcg 
tgggaagggt 
tttttggata 
gcgttgtatg 
agatagagga 
tggtgttttg 
agttttgatt 
tattaaagaa 

ggggtgggta 

tttgcgggga 
ttgttatagg 
tattgtttag 
gtagaattgt 
tattcgtaga 
ttatgtggtt 
gtttaagaat 

ggggtgtttg 

gatatttttt 
attattaggt 
ttgtttttga 
ttgattagat 
tggggttagt 
agaaatag . _ 



240 
300 
360 
420 
480 
540 
500 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4258 



seq_prot.txt 

<210> 319 
<211> 4258 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 319 

ttatttttta attttaagtt ttttagttgt tgatagaggt aggatattta ggtttgtagt 60 

tggttttaga tttattaata agtagtttat ggttatttta ggatttagag aatagtttgt 120 

ttggttaggg attaagtttt agaatggggt aggaaggggt ttaaggtttt aaattaagtt 180 

agaaatagtg tttagatttt tattttattt attttagttt aatttttatt tagggtaaat 240 

ttgatgatta gatcgatatt attgttggcg atggtgatga ttatatttgg gatattgtag 300 

ggagtgttta ggtgattttt tattgtggtt ttatgtagag gggaaagatt agtaggtata 360 

agtatttcgg tttttgttcg tttttaagtt agtatattaa gttattttat tttatggtat 420 

ttttgagtag tttcggtagg atgtagttta gtggtatagg gatgaagggg aatcgggtag 480 

ttatatggtt attgatgcgg atacggggcg tattgttata tttgtgttta tataggcgtt 540 

tgcggatagg agcgagggtt tagttagggg tttttttatt agtggtttgt aattggaaat 600 

aattttgttt ttttaagtaa tatatggtaa tgtgtagaga tatttttgtt tgttataatt 660 

aggtagtatt gttggcgtta gggatgttgt tgaatttttt aacgtgtacg gggtagtttt 720 

tgtgataaag aattatttgg ttggaaatgt taatagtgtt atggttgaga aattttgttt 780 

ttcgtagatt atgtttcggg taagggttta gatttaagtt ttttttgttt tagatgtttg 840 

tttatttttt gagttatttt tgttttattt ggtaaagttt atttattttt taataatttt 900 

ttttggtgag agacgagtat agatgatgtc gataaagtta ggttggtcgg agaaaggagg 960 

ttaggattat ttacgttttt agttttattt taagatgttg gtaatttcgg aattgtttta 1020 

aagtattagt tttatagttt tagtttttag gtttttgatt cggattttta gtttagtttt 1080 

ttttgtttta tttaatgttt ttaggtgggt tttaggtttt agagttttat tttgttttta 1140 

tgtagcgtta ggtgatgcgt ggttaatatg cggattttaa gtttcgaagt tagcgttttg 1200 

tttaagaagt agcggacgag ttttttattt tgtaaagagt gaggttttag aatttcgtgt 1260 

tttttttatt tttagtaggt tcggtttttt ttttgttgtt ttaattttta tgtgttttcg 1320 

gtttttacgt aggttttttg ttaagagggt tattatattt ttgtggttat ttagtagttg 1380 

tcgtattagt tggtagtatt gggtttcgtt cgtttggttt ttgatttgta ggttatagag 1440 

ttatgaggaa gggtggttta agtttaattt tttagttttt tattttgttt tggtttagta 1500 

tttatcggag ggaagtttgt tagtttttgg aaggtacgga tatatagttt atgttgtaag 1560 

gaagaggtgt gggaatttat aaagtcgtat gggtattgag tagagaatag ttatttttat 1620 

ttatttatgg ggattattat ttagttggtc gattacgtga ttaaattttt atttcgattt 1680 

ttttaggttt cgtgaatata tatttttggt ttttgaaggt ttggggttga attttgtgtg 1740 

gttttgatat ttttaggtaa gtaatcgttt gttattttgg gttttagatt ttttatttgt 1800 

tttcggggta gggtggttcg ttgatttaag gtttttttta ttttattacg tgtagtatgg 1860 

tggggttgta gttaatgatg aaaggaaggt agcggaagtt tttgatgcgg tgagtaattt 1920 

ttattggaag tttttgttgt aggtatcgag tatttttttg ggagaaaatg agaagggaag 1980 

gcggggttgg agtttgagtg tttgggagag ttttaggatt tacgaggaaa atagagggcg 2040 

tgaattttat tagaaggtgg ttgtcgtttt gagagcggtt agcgtagagt attatggtgg 2100 

gcgttaggcg gattgagggt tagagaggga gagaggtttt gagtagagaa tttttaatat 2160 

tttttttttc gtttatattt cggattttgg gttggttgtt tttttgcgta tttttttcgt 2220 

tgtcgcgtcg atggtcgatt ggttgtaaaa ggaggtgacg gttttggagc gttttcgagt 2280 

tatttttacg tggtgtgtgt cggtggtcga cgtcgagcgg gttcggagcg cgagtgcggg 2340 

ttttaggagg ggtcggagcg gaagttcgtt ttcgggatcg ttttttagta tcgacgttag 2400 

gattatcgtt ttgttttttt tagattcggt ggtcgttagg attgaggatt cgttaggatt 2460 

gaggatttag gcgatttaaa atggggtaag ggggtgtggg tgggtagggg ttttttttcg 2520 

attttaggga ttagggtgcg tcgaggatgt tgataggttt agttcgcgga gagtagtttt 2580 

tatttatcgg ggcggggaag aggaagttgt aaggattcga atttatagaa ttcgtattat 2640 

ttttttattt gtgtatcggt ttttttagat tcgggcgatt tttttttatt atttttatcg 2700 

tttttttaga tgtatatagg t^gcgtttaa aggatttagt ttttcggggg tttttatttt 2760 

ttaattcgag gttttattgg eggtgggagg gttgtcgggg gaggagtttt ttagattttt 2820 

ttagttcgaa aatttcgttt tagcgtttcg gggtttcggg tatttttttt agggggcggg 2880 

gttgtttttg tttgtttagc gtttttaaat tcgggttttg ggagcgcggt tagcgaacgt 2940 

gcgatattag ttttgggttt ttggggtcgg tttgggaggg tgcggtacgg agtttttagc 3000 

gggatttttt acgcgggtta tttggttcgg ttcggatgtt ttttatatgg tagatgttag 3060 

tttcgtttgc gcgttcgcgt tgtgatggag gttgcggacg tgcgtcgcgg ggttttcgcg 3120 

gattgagtag gttggggtcg ttttatttat cgcggggtcg gagttagggt tcgggtttta 3180 

gcggaagggt tcgaatagtt tattcgttag gcgtttagtc gtttgcgttt ttcgtcgtcg 3240 

£gacgtcgcg tgggcggtag cgttcgttgg gttttgggag ttgtagtttg gtttgaggac 3300 

gtgtcgtttc gggcgttcgc gtatttaggt tacgtttttg ttttcggatt ttaaatagta 3360 

tegttttttt tatacggttt tcgaggtttt tcgtgttttt tttagttttt attttttgtt 3420 

Seite 261 



ttatgaaacg 
ttatttgatt 
attaggtttt 
gtgtttggta 
tatgagtgtt 
tattatttat 
ggagaggaaa 
tttagtatta 
attgagatag 
gtgtgtggta 
atttgggaag 
tagttaggaa 
gtttgtagtg 
gataattatt 



ggttttttgg 
ttttttcgga 
aaattttagt 
agatgaagag 
tagaatagtg 
ttatgtttgt 
ttgggtagtg 
ttggttgtga 
ttgtgtaggt 
ggatagataa 
gagaggggcg 
ttattgaagg 
atttagagtg 
ttaggttagg 



gagttggaag 
gtttttgttt 
tcgatttttt 
atattattag 
ttaggcgtgg 
agataatgta 
agttgaatag 
tgaggatttt 
gtagtttgag 
tagtttagat 
gtagtagaga 
gttttaattg 
gggcgtttgg 
cgtagtggtt 



seq_prot . 
attttagatg 
tttttttagt 
aatatttgga 
ttttggggtt 
aattagtatt 
agtcgttgat 
attttttcgt 
tagaggaggt 
taggatttgt 
ggtttgagga 
gcgaaggata 
gtgggttttt 
tgagattaaa 
taggtttgta 



txt 
taaagcgtat 
ttttggttta 
gtaagttatt 
gtgtggatta 
atataagggt 
tgggtgtttt 
gttttgtata 
tgcgggagtt 
aagggagcgg 
tatggttgga 
agtttgtatt 
ttttttagat 
ggtagagaaa 
attttagtat 



agattacgtt 
tttatgggta 
taattgttta 
tatgaattta 
taggtattat 
ttttgtatta 
agaaaatagt 
ttagatagaa 
ggtgggaagg 
gttatatggg 
tgttttaggg 
agaatttttt 
ttaataagaa 
tttgagag 



Sequence 



<210> 320 
<211> 4435 
<212> DNA 
<213> Artificial 

<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 320 



cgtaggcgtt 

tcgtttcgtc 

gttttcgcgt 

cgtaggtttc 

cgttcgcgtt 

gagtcgcggt 

ttgggtttgg 

ttcgcgttat 

gtgtgtgaga 

gtgtgtgtgt 

tttagttgag 

gaggaagcgt 

gggttttaga 

cggggtaggc 

gcggtcgggt 

gggcggggtc 

agtttcggcg 

tcgggtttcg 

gcgcgttttt 

ttgtagttcg 

gtcgtcgttt 

tcggggcgat 

gtcgtaggtg 

cggcggcgcg 

ggcgtaggcg 

tagtaagacg 

attttggtcg 

aggtggtggt 

tagtggaaaa 

gcggagggta 

ttggcgggga 

gcgagtggtt 

tggaggtgtt 

agggagtgcg 

tttttttttt 

ggtgcgtcgt 

gtagagtgaa 

ttttaaatgg 

aagggagttt 

ttacggggcg 

gaggtgagtt 

ttgggttttd 

tcgaaggtcg 



tagggtttat aggttcggag 
ggaggggttt tttcgttttc 
tcggttacgt tattcgttcg 
gtttagcgtt ttcgtttttt 
ttttacgttc gtattcgtat 
tttttgaagc gcgttatttg 
gcgttgcgcg gcggaatcga 
cggtttgcgg gcgagatttt 
gagagagaga gagagagaga 
gtgtgtgtgt gtgtgtgtgt 
cgttaggacg gggggcggat 
cggagatttt tttgttttta 
gaggggagtt ttaggcgtgg 
gggggcgcgg cgagagtcgc 
ttcgttttcg ggggatcggg 
gcggggtcgt cgtttaaaga 
gcggcggcgg cggcggtagg 
gagcgatcgt agggtagttt 
tcgcgcgcgg tttcgatgtt 
ttcggtatcg ttagttttat 
ttcgtcggtt tcgagggttt 
tcggcggagg cggcggacga 
tttaagggtt acgattggag 
tttaaagtta agtcgtatgt 
gcgcgtcgta agttggtcga 
ttgggtaagt tgtggcggtg 
tttttggttt cggattgcga 
tatacgtagg tttcgggttt 
atattttttg gttagtcggg 
gcgtttgggg tgtgtatttt 
atattttgcg gagggttatg 
ttagtttcgg gaggtacgtt 
ttttttttag aaagggttta 
gtaggcggat gggtgggtag 
ttttggatat tttcggttcg 
gtgtacgttt tggcgtggtg 
gaatttgttg tttaggatag 
tttttttggt ttggggtttt 
ggagtttgcg tatttgtatt 
ttgtggacgg taggtttttg 
tcgggaagag ttttaaggag 
ttagggtttt tgggtgtttg 
gtatcgtrtt-xgggaag tt-t- 



cggttttcgt 
gagttttagg 
gcggttttag 
ttagttggtc 
gggtagggcg 
gacgtgcgtt 
ttttgttttg 
aggcgcgaga 
gagagagaga 
gtgtcggggg 
ttttaggagt 
ttcggttttt 
ggcgtttggt 
ggagtttttt 
gcgcgtttcg 
aatttgttgc 
ggcgagggtc 
cgggcgtcgg 
ggatatgagc 
gtcgtacgtg 
gggtcgcgcg 
gcgtttttcg 
tttggtgttt 
gaagcggttt 
ttagtattcg 
agtgtcggcg 
gcgggggttt 
tgtatttcgg 
gttcggcgtt 
cgtggcgggt 
gttggtattt 
cggtacgata 
gtttttttag 
taggtttgtt 
aagtagggag 
gggtcggtta 
gtcgttttta 
ggttttcgta 
ttttgttttt 
tttttttaag 
tgtgttttgt 
ggaatggagg 
cgtttttgtt 
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ttttgtcgga 
ttacgttcgg 
ggatttagcg 
ggttttcggg 
gttcggggtt 
gttcgtagat 
gatcgtaggt 
cgggttgtgt 
gagagagaga 
aggggttcga 
cgcgcggcgt 
ttgtttcgtt 
ttacgtggtg 
ttatgggtgg 
ttaatgggta 
gggtttcgta 

ggggttatcg 

tttcggtgcg 
gaggttcgtt 
gaggattcgg 

ggggtcgcgg 

gtttgtattc 
atgtcggtgc 
atgaacgtat 
tatttgtata 
tttcgggggc 
ggagggcgta 
gcgggtgtta 
agttatggga 
gcgttttgga 
cgacgtgttt 
gtaagttgta 
tttttggatt 
tgcgagtatt 
ggtaggagtg 
gttttttaga 
tcgagtgttt 
ttttgttttt 
tgtttggatt 
ttttgtcgta 
ggtttagtga 
gatagtagtt 
-tatcgtcggt 



ttatttaggg 
aaggttcggg 
tcggggttag 
atcgcgtttt 
ttttagtttc 
tttggacgag 
agtagttgat 
gtgtgtgtgt 
gtgtgtgtgt 
ggattcgttt 
tcgtagagga 
ttttttttcg 
cggcgtcggg 

gggggcgggg 

gtcgggcgcg 
gcgggattcg 
cgcggcgatt 
cgttttttgt 
tttagtcgtt 
attcggacgc 

tggggggcgt 
gcgacgtcgt 
gcggcggcgg 
ttatggtgtg 
acgtcgagtt 

ggggttcggg 

gagttcgcgg 

gggcgggttt 

gtcgggattc 
gttagttttt 
tttagtcgtt 
tgcggttttt 
atttatgggg 
tttttcgtag 
gagatgacgg 
aattaggatt 
tttggttggg 
tatttggaag 
tagggcgtag 
taggcggttg 
ggacgggcgt 
tagttgttcg 
tgggttttgt- 



3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4258 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
246t> 
2520 
2580 



seq_prot.txt 

tcgttttttt cgtttttttt agaaagtttt agagggaggg aatagtaggt tcggaatagg 2640 

gggaaataga aggaggaaga tgggaggcgt ggggtttgtt tttttttggt tgtcgtaagt 2700 

atattaattt ttttaaatag gcgtcggtcg gtttttttag aggagtgtat tgtttggtgt 2760 

tacgggaggt ttcggagcgt acggtaggcg ggtttttttg gttagagttt tttggtttcg 2820 

gtttttgttg tcggtttttt ttgtggtgtg tgtttgcgtc gagggtatag tgggcgtttt 2880 

ggttattttt gtttttgttt tgtgttgtag tttgttgagc gagagcgaga agcggttttt 2940 

cgtggaggag gtagagcgtt ttcgcgtgta gtataagaag gattatttcg attataagta 3000 

ttagttacgg cgtaggaaga gcgttaaagt cggttatagc gatttcgatt cgggcgcgga 3060 

gttgggattt tattttggcg gcggtgtcgt gtataaggtt gaagtagggt ttggagatgg 3120 

gtattattat ggcgattata taggtgggtt ttaggttttt cgtatatttg agggtttttg 3180 

tatgggaggt agttgtgggg tttttggttg agaagggtgt atgatggtgg aggggggtag 3240 

gcgttttcga gagaatcggg tttttttcgg gttttttgaa aaagttttta aggtttcggt 3300 

cgtttttgtt tttggtgtac gagggatttg tttagttatt ttgagtgttg aggggttgat 3360 

agttttgttt ggtagggatt tcgagaggtg gtgtttttag taagatttag tttaatgcgg 3420 

gttttttttt ttatatcgag ttgttgttta tacggtattt agtagtttta gttttagggt 3480 

tagattgata gggtggagtt ttgtcggggt ttcggttagg ttggttgacg ttcgttagtt 3540 

ttagagtgtt tggttggttc ggggaggaat agggtttgtg tggtttttta gcgttttgta 3600 

aattttagtt ttggttaggt tgggtttgtc ggtttttttt ttttattttg gtttttggga 3660 

ggggtttcgg gtttttttat agtatagcgt tttatggaat tttttcggtc gaggagtgag 3720 

gtttaggttt gattttgttg gaatttttgg gaaatgtaaa gaaataaaaa gtcgttgatt 3780 

tttttggtag ttagcgcggg cgtttttttt tttttagttt ggtcggtttt attcgtttta 3840 

tagaaggggt atttattttt gaagtttgtt tttttgtttt tagggtagat ttacgggtcg 3900 

tttatttcgt ttattatttt taagacggag ttgtagtagg cgggcgttaa gtcggagttg 3960 

aagttggagg gacgtcggtc ggtggatagc gggcgttaga atatcgattt tagtaatgtg 4020 

gatatttcgg agtttagtag cgaggttatg ggtattatgg acgttttcga cgtttacgag 4080 

ttcgattagt atttgttttt gggcggtttc gttttattcg agtcgggtta ggtttatggg 4140 

ggcgtttatt tttacgtcgg ggcgtttttc gtgtgggttt ataagagtgt ttcgtcggtt 4200 

ttcgcgtcgt ttatcgagac gggtttttta cggtcgtata ttaagacgga gtagtcgagt 4260 

ttcggttatt acggcgatta gtttcgaggt tcgttcgatt acggtttttg tagcggttag 4320 

tttagcgtta tttcggtcgt tttcgtcggt tttttcgtcg gtttataggg cgattatggc 4380 

gatttgtagg tttttagtta ttatggtgtt tattttggtt acgtattcgg ttttt 4435 

<210> 321 
<211> 4435 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA CHomo sapiens) 

<400> 321 

4i 

agaggtcggg tgcgtagtta gggtaggtat tatagtagtt ggaggtttgt aggtcgttat 60 

agtcgttttg tgagtcggcg aaggggtcgg cgggggcggt cggggtggcg ttggattggt 120 

cgttgtagga atcgtagtcg ggcgagtttc ggggttggtc gtcgtagtgg tcggggttcg 180 

gttgtttcgt tttgatgtgc ggtcgtgggg gattcgtttc ggtgggcgac gcggaggtcg 240 

acggggtatt tttgtgggtt tatacggggg acgtttcggc gtggaagtag gcgtttttat 300 

aggtttggtt cggttcgggt ggggcggggt cgtttagggg taggtattgg tcgaattcgt 360 

ggacgtcgaa ggcgtttatg gtgtttatga tttcgttgtt gagtttcgag atgtttatat 420 

tgttgaagtc gatgttttgg cgttcgttgt ttatcggtcg gcgttttttt agttttagtt 480 

tcggtttggc gttcgtttgt tgtagtttcg ttttgggggt ggtgggcggg gtgggcggtt 540 

cgtgggtttg ttttggggat aggagagtag gttttaggga tgaatgtttt ttttgtgggg 600 

cgggtggggt cggttaggtt ggaaggggag ggacgttcgc gttaattgtt agggggatta 660 

acggtttttt gtttttttat attttttaga aattttagta aagttagatt tagattttat 720 

ttttcggtcg agaaaatttt atgaagcgtt gtgttatgag aagattcgag gtttttttta 780 

gagattaagg tggggggaag aagtcgatag gtttagtttg gttagagttg gggtttatag 840 

ggcgttgggg gattatatag attttgtttt ttttcgggtt agttaagtat tttgaggttg 900 

gcgggcgtta gttagtttga tcgaggtttc ggtagga&tt tattttgtta gtttggtttt 960 

gaagttgggg ttgttgagtg tcgtgtgggt agtagttcgg tgtgggaaga ggagttcgta 1020 

ttggattgag ttttgttgaa aatattattt ttcggggttt ttgttaagta aagttgttag 1080 

ttttttagta tttaaggtgg ttaagtaggt ttttcgtgta ttaaaaatag gggcggtcgg 1140 

ggttttggga gtttttttag ggaattcggg gaaggtt^gg ttttttcgag ggcgtttgtt 1200 

tttttttatt attatgtatt ttttt tagtt agaaatttfa tagttgtttt ttatataggg 1260 

atttttagat atgcgggggg tttggagttt atttgtgtgg tcgttatggt ggtgtttatt 1320 

tttaagtttt gttttagttt tgtatacggt atcgtcgtta ggg?:ggggtt ttagtttcgc 1380 

gttcgagtcg gagtcgttgt ggtcggtttt ggcgtttttt ttgcgtcgtg gttggtattt 1440 

gtagtcgggg tggttttttt tgtgjitigtac gcgaaggcgt tttgtttttt ttacgaaggg 1500 

tcgtttttcg t ttttcgttta gtaagttg ; ta gtatagggta gaggt;aggga tggttagggc 1560 
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seq_prot.txt 

gtttattgtg ttttcggcgt aggtatatat tataggggga atcggtagta gaggtcgggg 1620 

ttaggaggtt ttgattaggg aaattcgttt gtcgtgcgtt tcggagtttt tcgtggtatt 1680 

aggtagtata tttttttggg agaatcggtc ggcgtttgtt tgggaaggtt ggtatgttta 1740 

cgatagttaa agaagaataa gttttacgtt ttttattttt ttttttttat ttttttttat 1800 

ttcgggtttg ttgttttttt tttttggggt tttttaaggg aggcggggga ggcgggtagg 1860 

atttagtcgg cggtggataa aagcgaggtt tttcgggaac ggtgtcggtt ttcggcgggt 1920 

agttgggttg ttgttttttt atttttagat atttagaagt tttggtaggg tttaggcgtt 1980 

cgtttttatt gagttatagg atatattttt tgggattttt ttcgaggttt atttttaatc 2040 

gtttgtgcgg taggatttgg gggagtaggg gtttgtcgtt tatagcgttt cgtggttgcg 2100 

ttttagattt agatagaagg taggaggtgt aaatacgtaa gttttaagtt tttttttttt 2160 

aggtgaggag taggatacgg gaattagagt tttaaattaa gagaattatt tgaaatttaa 2220 

ttagaggata ttcggtgggg gcggtttgtt ttgagtaata agttttttat tttgtagttt 2280 

ta 5£H tt99 aaaattgatc ggttttatta cgttaaggcg tgtatacggc gtatttcgtt 2340 

"Qttttttt tgtttcggat cgagggtgtt taggagaggg agaaattgcg 2400 

gggagggtat tcgtaggtaa atttgttgtt tatttattcg tttgtcgtat ttttttttta 2460 

tgggtggttt aggagttaga ggagttgagt tttttttaag gagggagtat ttttagagag 2520 

tcgtatgtaa tttgttgtcg tgtcgggcgt gtttttcggg gttggggtta ttcgtaacgg 2580 

ttggagggta cgtcggggtg ttagttatgg tttttcgtag ggtgtttttc gttaggagaa 2640 

ttggttttag agcgtattcg ttacgagggt gtatatttta ggcgttgttt ttcgcggatt 2700 

tc 9fH t " a tagttggcgt cggatttcgg ttggttagag gatgtttttt tattgggatt 2760 

cgttttgata ttcgttcggg gtgtagagtt cggagtttgc gtgtggttat tattttcgcg 2820 

a9 .H tt9cgt trttttaggtt ttcgttcgta gttcgagatt aggggcggtt aaggtttcga 2880 

atttcgtttt cggggcgtcg gtatttatcg ttatagtttg tttagcgttt tgttgagttc 2940 

ggcgttgtgt aggtgcgggt attggtcggt tagtttgcgg cgcgtcgttt gcgtttatat 3000 

tatgaatgcg tttatgggtc gttttatatg cggtttggtt ttgagcgcgt cgtcgtcgtc 3060 

gtcgcgtatc ggtatgggta ttagatttta gtcgtagttt ttgagtattt gcgatacggc 3120 

gtcgcggatg taggtcggga agcgttcgtt cgtcgttttc gtcgggtcgt ttcgggcgtt 3180 

^""cgcg attttcgcgc ggtttaggtt ttcggagtcg gcgggagacg gcggcgcgtt 3240 

cgagttcgag ttttttacgt gcgatatgga gttggcggtg tcggacgggt tgtagggcgg 3300 

ttgggagcgg gtttcgttta tgtttagtat cggggtcgcg cgcggagggg cgcgtatagg 3360 

agacgcgtat cggggtcggc gttcggggtt gttttgcggt cgtttcggga ttcgaggtcg 3420 

tcgcgcggtg gtttcgattt tcgtttttgt cgtcgtcgtc gtcgtcgtcg aggttcgggt 3480 

ttcgttgcgg gattcgtaat aagttttttt aagcggcggt ttcgcggttt cgtttcgcgt 3540 

HSSiP*^ attggcggag cgcgtttcga tttttcggag gcggagttcg gtcgttttcg 3600 

tttttttatt tataaagaaa gtttcgcggt tttcgtcgcg ttttcgtttg tttcgttcgg 3660 

cgtcgtatta cgtgggttag gcgttttacg tttggagttt ttttttttgg agtttcggga 3720 

aaggaagcgg ggtagggagg tcgggtggga gtaggggggt tttcggcgtt tttttttttt 3780 

tgcgggcgtc gcgcggtttt tgagagttcg tttttcgttt tgacgtttag ttaggaagcg 3840 

ggttttcggg tttttttttc gatatatata tatatatata tatatatata tatatatata 3900 

tatatttttt tttttttttt tttttttttt tttttttttt ttttttttta tatatatata 3960 

tatatatata gttcgtttcg cgtttgaggt ttcgttcgta ggtcggtggc gcggggttag 4020 

ttgttgtttg cggtttagga tagggtcggt ttcgtcgcgt agcgtttagg tttagttcgt 4080 

ttagagtttg cggatagcgt acgtttaggt aacgcgtttt aaggggtcgc ggttcggagt 4140 

tgggggattt cgggtcgttt tgtttatacg agtgcgagcg tggagagcgc gggcgggggc 4200 

SS 9 "SHi 9tt:a g tt; gg a gggg gcgggagcgt tgggcggggt ttgcgttgat 4260 

HiE? 9 £ 9ttg agttrtttggg gtcgtcgggc ggatggcgtg gtcggacgcg ggggtttcgg 4320 

SHHE 999 cg'tgatttag ggttcggggg cggggaggtt ttttcggcgg ggcggttttg 4380 

ggtggttcgg tagaggcgga ggtcgtttcg agtttgtgag ttttgggcgt ttgcg 4435 

<210> 322 
<211> 4784 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically, treated genomic dna CHomo sapiens) 
<f»00> 322 

tttggttgta ttgggattta tagagttata ttttggaggg tttttttttt gatattgata 60 

agggtgtgta ttatatttta gtgagcgtta tacggttggg ggttaacggg agttatattt 120 

tttagatttt tagtgtgttt tttatcgagg tttattttga agattatagt gagttgtagt 180 

fi?vf a 2 ga 3 a9 iHJ ttt:a g att ttggtg aggtggtatt atttgtttgt gttgcggatg 240 

aatttgtgat tgttttgacg gtgattttgg atgtcgattt tattaagatg attttaaagt 300 

aaaggattga ttttttgta?: aggatgcgga gttttttaga agtagagttt tataatatga 360 

aattagtgtc ggtggtgaat aatagattat ttgatatgtc ggtttttatg gttggttcgg 420 

gaaatgtaaa aaaggtggtg gagaatgggg tttttttttt ttggaagttg ggttgttttt 480 

t^aattaflaa tagtgtgttt gatattjcatg gtgtagaggt ttttgttagg gagggcgtaa 540 



seq_prot.txt 

tgtttgttta gtttggttat tttgtggtgg gttggtatat cgttaataag aagttttttt 600 

tttttaaacg cgttcggagg tagatttatg ttatatttat atttgttatt gttattgggt 660 

ttttaattac ggttatttag gagtttttat ttaggatcgt gttaattttt atatttttag 720 

ttattgtttt tttaatagag attatggttt ttttagttag ggattttgtt tttgggaaat 780 

ttacggttat tattcggatt cgaggcgtta ttatttaaat tttaatttta ggttttattt 840 

agtttattcg ggtgttagaa gttggtatta tagtttttgg ttagattcgt ttaacgatga 900 

ttatttttgg ttatgtggag tttattgtag ttgttatttt ttttataatt attattaaga 960 

agttacgagt atttatatta aaattagtaa cgtttttaat tgattttatt attattacga 1020 

ttcgtaggtt aattaagaaa ttacggatat ttcggttagt gtttcgggtt attattaaag 1080 

tttttattat tagattggaa attgttttat cgtttattcg tattcgtatt attattagtg 1140 

gagtgtttcg tggcggagaa tttaattagc gtttagagtt taagaattat attgataggg 1200 

tagatgtttg ggttggtatt tattttgagg tgaagatttc gttagatatt ttttatgatt 1260 

atgaggatat tattattgat aagttgaagt tgattttgaa attgcgggag tagtagttgg 1320 

tgggcgagaa gttttgggta tagtttaata gtaatagtta gtttatgtat ggttttttcg 1380 

atagtagtta cgtgggtaaa tacgagtatt ttatgtatgt tatagataag gggggtttgt 1440 

cggttgtgga tgttttcgag atttacgttt ataggcgttt ttaaggggat agggtttttg 1500 

taaggtttaa ggttaagttt gtgggtgatt cggtattggt gttgaatgat atttataaga 1560 

agattgtttt ggtaaagaaa ttggttttcg tttttggaga tcgaaattgt agtattatta 1620 

ttttgtagaa tattattcgg ggttttatcg tggtggaatg gattaataat atattgtttt 1680 

nggagttttg ttttaaggag tagatcgttg ggttgagtcg tcggatcgtt gaggatgatg 1740 

gaaaatttcg gtttgttttt tttaacgttt tagagtttga ttttaaggtt ataagtatta 1800 

ttgtgacggg ttttggtagt tgtcggtatt tatagtttat ttttgtggta ttatttagga 1860 

gagtgttttt agaggcgtcg tttatagaag tgtttgatag ggattttgag aagagtagtg 1920 

aggatgatgt ttarttgtat atagttattt cggtcgtggt ggtcgtagtt attttgttta 1980 

ttgttggtat tattigttatg atttgttatc gtaagaagcg gaagggtaag tttatittttg 2040 

aggattaggt tatttttatt aagaaggggg tgtttattat ttttgtagac gaattggacg 2100 

attttaagtt tttatttttt tttagtatgt tatttatttt gtaggaggag aaggtttttt 2160 

tatttttttt tgagtatttt aattagagtig tgttcgagat tatttttttg aattaggata 2220 

ttatgggaga gtatacgttt ttgcgggatg aggattttaa tgcgtttttt tattagtttt 2280 

tatcgttttt tatagtartt atggagggta agggttttcg ttttaagaat atgattttat 2340 

atcggtrtatt ttttttttat gttttatttt aattcgtaag cgtttgggtg gaggtagggt 2400 

agggtagggg tttggagacg atatggtgtt gtttgtggag atcggtggtt tgtagattat 2460 

tgtttatcgg gagtcgatat ttgatttagt atatattgat ataggggttt ggataagttc 2520 

gttttttttg gtttttttaa attttaaagt agttggagag attttgggga tttttttatt 2580 

tttatttttt gtttaatagt ttttggtttg tttatagaga atttttcgtt ttatttttga 2640 

tggttggttt tgaaagtatt atgtggagtg gaggtggagg gagcgaggaa ttatgaatga 2700 

attcgtaggt agtgtcgggc ggttttttgg ttttttgcgt tttgttttta atattaattg 2760 

tattgttttt tttatttacg tgtgtttagt tgtaggatgt aatatggaaa atagtaatta 2820 

aagattaaat ttaaaggatt tttagaagtt aaggttaagt ttttacgttt aatttgttgt 2880 

ttatttaaat ttgtatgtat aatttttggg tgggtatggg gaattgtttt gttaaaaata 2940 

agtttttagg gtgttttaaa tttagagaag attaagggat agtatttttt attaaaggaa 3000 

tattattttt ttatattacg ttaatttggt tgttttgata ttttagagtt tgattggggg 3060 

tttttcggtt ttggtttacg ttaagttttt ggtgttgggt ttgttttttc gttgttgtta 3120 

ggggttggaa gttggagggg ttttttgggt tatggatatt tttattttta gtttatgtat 3180 

attagtggtt tacgattaag gggtttttat ttttatgaaa aagggatttt aagaggtagt 3240 

ggtggttgtg gtttttaatt ttggtgtttt agggtgggtt agttgtttgt gggggtattt 3300 

gggaggttaa aggtttttat tatattaatt tattttgttt tatttttttt ttgtgtattg 3360 

tttttttttt tttttttaaa aggaatatta cggttttttg aaatatttag tgggggatat 3420 

tttggtgaag atgtaatatt tttatgttat gtgatgtttt ttttttattt gattttggtc 3480 

gttttgtttt aatagtttat agttttgttt cgatttattt tatttttttt ttttggtatt 3540 

ttagttttag gttttgggtt tgaattattg gaaaaggttt ggcggttggg gaggagtgtt 3600 

agtaatagtt tataataaaa atttgttagt ttttaaagtt aattttttat taaagttttt 3660 

atatagtttt aaattgtttt attaaaaaaa agatttaaaa tggtgatgtt tatagtagtt 3720 

tgtacgagtt tttaagtgtt gattttatgg aattgacggt tttgtttgtt ttgatttttt 3780 

tttttttatt ttttttaatg gtttaaattt tggaattata ttggggtttt tttgtttttt 3840 

ttagtagaat attcgttcgt ttatttgtat ttttgtttta tgatttaggg gcgtttattt 3900 

tgtttcgatt ttttttttgt ggaagaaatt attttgagta tgattttttt tgatgtttga 3960 

agcgttattt tgggtatttt ttagggagga atgtttttcg taataatgta tttatttttt 4020 

gattgagggt gggtgggtgg atttaggttt tttttgtata tagagtagtt atttttaagt 4080 

tatatcgatt gttttgtaga ggatttgtgt gtgttgtttt aggaggggag ggttggtagg 4140 

agggggggag aggtttttgt tttattgttt tttagagggt attttttttt gcgttttttt 4200 

ttatagggtt tagttttttt tttttgtttt agtttttagg gggtattttg gagtgagtag 4260 

tgtttttgtg ggggagtttg taaatgcggg tttagtggat tattggtgat tgggtttatg 4320 

tttttaagtt agagtttttt tggtgtttta gagataggag tataagtggg atttgatttg 4380 

gtgagattat ttttgatgat tttattaaaa aataaataat ttttaatgtt ttaggtgagg 4440 

gttttgaaag gttttttaaa tagtttcgtc gtttttagta attttattat tgggtattgt 4500 

tatgtagaga cgtggttggt ttagaatggt ttgttgttat agtaattgga ggcgatgggg 4560 

tatjtgaatag aataataat^ gt^ataatgt ttttgtaggt agtttgtttt tttgagcgtt 4620 
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gggttggtga tggtcgttgg attttgtgag atggagagtt aattttatat ttaagtgttt 
attaattatt gatgtgtttt tatttttttt tatatgattt taagatgtgt tttttgtatt 
ttgtaaagaa atatattaaa ttaaataaaa gtagtgtttt tatt 

<210> 323 
<211> 4784 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 323 



4680 
4740 
4784 



aataaagata 
tttaaaatta 
gattggtttt 
ggttgtttgt 
tgttatggta 
gagttgttag 
gggaattgtt 
tgtgtttttg 
agtggtttat 
attttttggg 
aaatgttttt 
ttttgaggta 
tttgtgtgta 
attgcgaaag 
gttatgttta 
gttatgggat 
ttagtgtaat 
taaagtcgtt 
attattttaa 
aattagtttt 
cgttagattt 
gatggggtgg 
ggaaagagta 
gttttaaaaa 
ggtaaaataa 
agttggttta 
ttttttttat 
tggggatgtt 
gtaaatttag 
ggggtattag 
tattgttttt 
attttttata 
aaaaatttaa 
tgttatattt 
taaaacgtag 
gtttttttta 
gaatttttta 
aagttttttt 
ttgtgttagt 
cggtttttat 
ggcgtttgcg 
gggacgggag 
cgtattggga 
agtggtttcg 
ttgtagaatg 
aaagatgata 
ttttcgtttt 
gattattacg 
gtttttgtta 
agggataaat 
aaagttaggt 
tcggcggttt 

ggtttatttt 
tcggttttta 



ttgtttttat 
tatagaagga 
ttattttata 
aaaggtattg 
ataggttatt 
gggcgacgga 
tattttttga 
tttttggggt 
tgagttcgta 
gattggggta 
tggagagtag 
gtatatataa 
aagggagttt 
gtattttttt 
aaatggtttt 
agagatgtag 
tttagaattt 
agttttatgg 
attttttttt 



ttagtttgat 
aataaaaata 



gagagttaat 
tttttagtag 
gtcggggtag 
ttatatgata 
atcgtgatat 
aataggttga 
ttttggagta 
ggaaatgaag 
tatggtttaa 
tattagggat 
agtaattaag 
tggttttttt 
tttatttaaa 
ttttaatttt 
tgtagttaga 
agagttaggg 
tttttatttt 
tgaataaatt 
agttgttttg 
gtgtgttagg 
agataatatt 
ggttaaggtg 
tttttgtttt 
tttttatttc 
ggtatatttt 
agtggtatgt 
ggtatttttt 
ttgcggtagt 
gtcggaatga 
ggtatttttg 
tgtaggtgtc 
tttagggcgt 
agtttagcga 
attacgatgg 
aaggcgaagg 



gagtttaacg 
ttgttgttgt 
ttgggttagt 
gttgtttgga 
tgaggttatt 
attagggaaa 
tttataggtt 
ggggagaggg 
taggatagag 
attttttgta 
gggtttattt 
ttaaaaagta 
ttttatagga 
atggacggac 
aaattattag 
aattaatatt 
taataaaata 
agatttttat 
tttaggttta 
gattgtggat 
taaaaatatt 
tttttttagg 
tgtggtggag 
ttaaagttgg 
attttttggt 
gagatttttt 
ttggcgtgag 
ttgacgtagt 
taagtttgaa 
aattatatat 
tgaaagtttt 
tatacgtgaa 
ggtcgttcgg 
atatggtgtt 
aaaagttgtt 
gggtttggga 
ttaggtgtcg 
atgtcgtttt 
ggatataggg 
ttatgggtgt 
gtaggggcgt 
ggttggggta 
tggaggaggg 
ttttgatgaa 
agattatggt 
ttgtgtgtag 

tgggcggcgt 

gataattgtt 
tggagaaggt 
tttgtttttt 
agtttcgggt 



atgttttttt atagaatgta 
tattagtggt tggtgaatat 
attattatta gtttagcgtt 
tattttgttt attgttttat 
tacgtttttg tatggtagtg 
aggtttttta aagtttttat 
agaaataatt ttattaggtt 
ttttgatttg gaggtatgag 
tttttatagg ggtattgttt 
gttgggtttt gtgggagaag 
attttttttt tttttttatt 
aaatagtcga tatggtttag 
atttattttt aattagggaa 
tttaaaataa cgttttagat 
ggagaatcga agtagagtgg 
ggatgttttg ttaaaaaagg 
gaaaagtagg gggaaaagaa 
taagagttcg tataaattgt 
atttgaggtt gtataaaaat 
tatgaattat tgttggtatt 
aggtttggga ttggagtgtt 
tgttaggata aagcggttaa 
gtatttttat taaaatgttt 
aggaaaaaaa aaaataatgt 
atttttgatt ttttaggtgt 
gggttatagt tattattgtt 
cgtgggttat tagtgtatat 
tagtttttag tttttggtaa 
ttagggtcgg gaggttttta 
gtgaaaaaat agtatttttt 
atattttggg agtttatttt 
ataagtttag gtaaatagta 
ttgaatttaa tttttagtta 
tagaaaaaat agtatagtta 
tattgtttgc gagtttattt 
tttagagtta gttattaaaa 
aggtaaaaaa taaaaataaa 
ggattagaga gggcgggttt 
gttttcggtg ggtaatggtt 
taggtttttg ttttattttg 
aggaggtgat cggtatgggg 
tgtgaagggc ggtgggggtt 
gtattttttt atggtgtttt 
tttaggaggg ggtaggggag 
tgggggtttg gagtcgttta 
ggtggtttgg tttttaaggg 
aatgatgtta gtaatgagta 
gtagatatta tttttattgt 
ttttgagggt atttttttgg 
agagttcgtt atagtgatgt 
aggtcgaggt tttttattat 
ggggtagggt tttaagggta 
gatattttgt agggtgatgg 
t»t_aaggta._at.trtt.ttgiL 
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gaaaatatat 
ttgaatgtga 
taggggagta 
cgtttttagt 
tttaatggtg 
ttggaatatt 
agattttatt 
tttagttatt 
attttagagt 
gcgtaagggg 
agtttttttt 
aagtagttgt 
tggatatatt 
attaagaaaa 
gcgtttttga 
taaaagaatt 
ttaaaataag 
tgtaagtatt 
tttagtaaaa 
tttttttagt 
agagaaaagg 
ggttaagtga 
tttattgagt 
atagaaaaag 
ttttataagt 
ttttggagtt 
gggttggaag 
tagcgggaga 
attaggtttt 
tgataaaaaa 
tagtaaagta 
gattaaacgt 
ttgtttttta 
gtgttaaagg 
atggtttttc 
atgaagcgaa 
aaagttttta 
gtttaggttt 
tgtaggttat 
tttttattta 
ttatgttttt 
ggtagggagg 
ggtttagagg 
tttttttttt 
gttcgtttgt 
taagtttgtt 
ggatggttgc 
tttttttagg 
gtggtattat 
ttgtggtttt 
ttttagcgat 
gtgtgttgtt 
tgttatagtt 
ggatgt-tatt— 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240— 



seq_prot.txt 

taatattagt gtcgggttat ttataaattt ggttttgaat tttgtaggag ttttattttt 3300 

ttgggggcgt ttgtggacgt ggatttcgaa ggtatttata gtcgataggt tttttttgtt 3360 

tgtggtatgt atgaaatatt cgtgtttgtt tacgtggttg ttgtcgggaa ggttatatat 3420 

gagttggttg ttgttgttga attgtattta ggatttttcg tttattagtt gttgttttcg 3480 

tagttttagg gttagtttta gtttgttagt ggtggtgttt ttatggttat agaaagtgtt 3540 

tgacgggatt tttattttaa agtaggtgtt aatttaggta tttattttgt taatatggtt 3600 

tttgagtttt gggcgttggt tgggtttttc gttacggggt attttattgg tggtggtgcg 3660 

aatacgagta ggcggtgagg tagtttttaa tttggtgatg gaaattttgg tggtgattcg 3720 

gggtattggt cggggtgttc gtggtttttt ggttggtttg cgagtcgtgg tggtggtgga 3780 

gttagttgaa ggcgttgttg gttttggtgt ggatattcgt ggttttttgg tggtggttgt 3840 

ggoaggggta gtaattgtag taggttttat atagttagga atggttatcg ttgggcgaat 3900 

ttggttagga attgtggtgt tagtttttga tattcgagta ggttggatgg ggtttagggt 3960 

tggggtttga ataatggcgt ttcgagttcg gatggtgatc gtgggttttt taggaatagg 4020 

atttttgatt ggaggagtta tggtttttgt tggaggagta atggttggag atgtgggggt 4080 

tggtacgatt ttggatgggg gtttttggat agtcgtggtt gggggtttaa tggtagtgat 4140 

aggtgtgggt gtagtatgga tttgttttcg gacgcgtttg ggaagagggg gttttttatt 4200 

ggcgatgtgt taatttatta tagggtagtt aagttgagta gatattgcgt tttttttggt 4260 

aggggttttt atattatgaa tgttaggtat attgttttgg tttagggagt agtttagttt 4320 

ttaggagaga agggttttat tttttattat tttttttgta tttttcgggt tagttatgaa 4380 

ggtcgatatg ttaaatagtt tgttatttat tatcggtatt aattttatgt tgtgaagttt 4440 

tatttttgag aagtttcgta ttttgtgtag gaggttaatt ttttgttttg gggttatttt 4500 

ggtgaggtcg gtatttaaaa ttatcgttaa aatagttata ggtttattcg tagtataggt 4560 

agatgatatt attttattag ggtttgggga ggttgttttt atcgattgta gtttattgtg 4620 

gtttttaggg tagatttcga tggagaatat attggaggtt tgggggatgt ggttttcgtt 4680 

ggtttttagt cgtgtagcgt ttattgaaat gtaatgtata tttttattag tgttaagggg 4740 

gaggtttttt agggtgtggt tttgtgagtt ttagtgtagt taag 4784 

<210> 324 
<211> 4337 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 324 

tagtatgagg aaatttagtt gtaagaaagg agatttggga ttatgtgggt gttgtgttta 60 

tttttttgat agtatttttg ttgtgggagt agatgtggta gattttgaaa gtcgggagga 120 

gaggaaaaga ggtgttttga ggaggttgat tttggttggg ggtgaagagg agtttttttg 180 

taattgtagt aaatttatgg agcgatggtt tttattgtta tggaaataat tagtagaggt 240 

ttttattttt ttggattttt aattatagga ggaattatta tatttttttg taattttgtg 300 

tatatttagg aaaagttgaa gtagtttttt taaaattatt tattttttga gattattgtt 360 

tagatagttt ttatgttttg tttgtttttt aatgttgttg ggatttatta aattgatttt 420 

agaatttgtt aatggtttag gttgtagttg gaaatttttg gttttagtag agttgaattt 480 

tattcgtgga gatgttagga gttaggagga ttaggggtgg gtgtttttgg gtttttattt 540 

tagttttagg ggtttttttt ttttgtcggg ttacgtgtag gttgtgaagg ttttattttt 600 

ttagtagtat attaaatttt attattttat ttaagttaat tttgtttttt ttttattcga 660 

gggtttcgcg tggatagaat attaagcgga taagttcgta gggtgtttag attaagtgtt 720 

gagtttgtag tgaatgttta gttatgattt gtagtggatt gatttgtgta aaaattggat 780 

ttttaagttg ttagttttag tgttagtacg agttcggtcg agtttatttg gtcgtaggaa 840 

taagggggtc gttaggagta agagcggagg gggcgtttaa agggaggaga ttaaggttta 900 

attattattc gttttttaat tttttaggcg tatagcgggg acgtagcgtg tttttaaatt 960 

aacgttatta gtagagggtt ttagaggagt cggaaggggt tagggtttgg gcggcggtgg 1020 

gagggtttga gtagtggggg cggggagttg ggggtcggtt ttcgttcgcg ttttagcgtt 1080 

tcggcggtcg tcggggttat tgatttcggg tttggttttt gcggtagcgt ttcgagttgt 1140 

cgtttttcgc gttgttagcg ttcgggaagg aggaaggggg aagggggcgg gtcggtcggg 1200 

tttacggtcg attttttttt ttagtttcgt ttttcgtcgt ttgcgggtga gttttgttta 1260 

agtcgaggtt gcggggtcgg cgtcggcggg aggattgcgg tgtttcgcgg aggggttgag 1320 

tttgttaggg tttatttgat tttgtttttt atttttcgtt ttttaggttc ggaggcgggg 1380 

gttttcgggg cgattcgggg gcggatcgcg gggcggagtt gtcgttcgtg agttcggtcg 1440 

agttatttga gttcgagtcg cgggatatcg tcgtttttgt ttttcgaatg ttgcgtatcg 1500 

cgatgggttt gaggagttgg ttcgtcgttt tatggggcgc gttgtcgttt cggttatcgt 1560 

tgttgttgtt tttgttgttg ttgtttttgt tgtagtcgtc gttttcgatt tgggcgttta 1620 

gttttcggat tagtttgttt ttgggtgagt gtcggggatt cggggacgccj ggtcggggga 1680 

aggaaggttg tcggggacgt gtttcgtgcg gaggtggtcg tgataaagga ttaaggaggg 1740 

giatttgtcgg aggtacjggga tttattagga tttttttttt ttttaaggcg ttgcgcggag 3,800 

ggagggagag gagaggggag ttgtggggtc gttggttttt ttttttagtt ttttttttgc 1860 
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gtagggatgg 
tcggcggggt 
tcgttttcga 
gggttttggt 
ttcgattttt 
atggagagat 
tgatattttt 
atttttaagt 
ttgggttttt 
gttttttttt 
gggttttaga 
gttttttgtt 
gtaaggtgtg 
tttaatttgt 
atagattgaa 
ttgtttataa 
taagtaattt 
ttttagttat 
tttttttatt 
ttagttcgtg 
ttatttagtt 
ggttaggata 
agttaggatg 
tataattttt 
tagatgtttt 
tgttaaaaga 
gttagaaggt 
agttagaata 
ttcgtgattg 
gtgggtttgt 
atgtagttta 
tgtttttggg 
aatagtgtag 
tgttattata 
ttatttttat 
gtaattgaat 
ggtttattag 
cgtcgggttt 
attggatatt 
atgtttcgta 
aattttttga 
aattattgtt 



ggtttagttc 
tcgtttgttt 
gggcggttcg 
atttttagta 
ttttttcgtt 
ttttaggttt 
aagttttggg 
tttgggcgcg 
tttttggttt 
tttttttttt 
tttagagggt 
tttagtgttt 
ttattcgggt 
tggttgttat 
attgttttag 
agggaaatag 
tatttgaatt 
ttagtaatta 
tgtaaaatag 
tttgaaaagt 
tagagtttat 
gggttggggt 
aataggtgta 
tttgttttta 
aatggatttt 
cggtaggttt 
aaaattggag 
tttggttatt 
tttttaggtt 
attttgtgta 
tatttagttt 
gttaggtttg 
tggtatttag 
ttttggagtg 
ttacgtgtag 
aggtttattt 
atgtttttgg 
agggttttag 
atatattttt 



gttttgtcgt 
atttatttgt 
gggtagaggg 
agtcgtagta 
tttttatttt 
tgggagcgat 
agcgaataga 
attttttttt 
ttatatttgt 
tttttattcg 
tttggatttt 
tatttttttt 
atgaagttta 
attgtatgat 
atttttattt 
tagaattagt 
tttgtttgaa 
tgtgattttg 
gtataataat 
cgttaggata 
ttttgtatat 
gttagtttga 
attagtttta 
gattttttta 
atttatggtt 
tatgtttaat 
agtagcgagg 
attattaatg 
tttgttacgt 
ttgtattaag 
tttaatttaa 
ttcggagtgg 
ttatttgaat 
tttttttttt 
gttttagggt 
ttgagtgtgg 
tagtgtgttt 
tgtttttttt 
tttttttttt 



aggatttaga tgtttaggga 
agtggagcgt ttagtttgtt 
gatgaat 



seq_prot. 
tttttttagt 
ttttgatgtt 
aggttattgt 
gggtcgggag 
tggtttgtta 
tagattgcgg 
ttgcggatgg 
agagtgattt 
ttttgtggtt 
gttgtttttt 
aagtttattt 
atttgtaaaa 
taagtgttta 
ttggaataaa 
ttttatagta 
atttaaatta 
atattgtaga 
agtaggttat 
agaatttatt 
gggtttggta 
tttgtttaga 
ttgtagtttt 
ggtttgttgt 
tttttgggag 
ttgggaaagt 
tttggaattg 
ggaatcgagg 
tagttgtagg 
attagtattt 
gtgtttagtt 
gagttttgga 
tttatttttt 
ttgtgagtac 
gggttttttt 

tggttgggag 
tttaaggcgg 
attttgggta 
tgtatgtatt 
gttatttatt 
tagtggggtt 
ttgtatttgg 



txt 

ttttttagga 
gattagagtc 
ttgtagtttt 
ttgtggtggg 
gtgtagtttt 
atgggggtag 
gggtagtttt 
gggagtagta 
attttttgtt 
tagttttttt 
tttttattgg 
taggtagtcg 
atagatgtat 
gtatgtatga 
tttttatttt 
aggtagggtt 
tgatttgggt 
ttaaagtttt 
ttaaggtagt 
cgggaatcgt 
tttggggtga 
ttgggtgttt 
tttgttattt 
aaggtagggt 
tgggttagtt 
gatgggataa 
ggatttgttt 
atttggttta 
atttgttagg 
gagggttgag 
gtaaaagatt 
tagttttatt 
gtatttatag 
ttggttttta 
ttcgagaggg 
ttttttgttt 
ggttcgtggt 
ttaggagaaa 
tttttattga 
atgtttgtgg 
ggttttttat 



ggatgtattt 
gatcgcggtg 
ataaatattg 
cgggtaggcg 
aagttttagt 
ttttgggcgt 
gggcgttgat 
ttttggttgg 
ttagtttttc 
tggggtattt 
gtaagtaagt 
taggtgtttt 
tcgtaagttt 
attagaggtt 
taaaagtatt 
ttagcggttt 
ttaaatgtta 
tgggttttga 
gaggattaaa 
tgaatgagtg 
ttttgtttgg 
tagtttgtga 
ttgttgtggt 
gagatggggg 
tatgggatta 
ttatagggta 
ttcgaggagt 
tttttataag 
gttgtcgtag 
aggatttgaa 
ttttttgtta 
aaagttttgg 
tcgattgagt 
tatttatttt 
gaatgttttg 
tattttaggg 
gattatggga 
gagttatgga 
atttttattt 
tttaaaataa 
agggaatttt 



<210> 325 
<211> 4337 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 325 

gtttattaat 
ttttatttta 
gggtttttac 
ggtgtgtaat 
gggttttggg 
ggaatattta 
tgagtttgtt 
tacgtaggta 
tttagggtat 
ggtgttatta 
atttagtttt 
ttgagtgtgg 
ataaaai:gta 
ttgggaa^ikg 



aataattagg 
agggattttg 

ggggtatggg 

atttagtttt 
ttcggcgttt 
atgagttttt 
tagttattag 
agagtaggag 
agtggtaatt 
tattatttta 



A_tlXT___U.«_ 



attgtat£tt 
agtttattta 
ttaczgggttt 
WMJtgtt 



gttttttatg 
ttttaaatta 
tgggaattta 
atgatttttt 
tatggttatt 
tgggatgagg 
ggtatttttt 
gtgggtatga 
tagtcgatta 
gggttttggt 
tagaaggggt 
aggttttttt 
cggtagtttj: 
gtagaagtgg 
tt£cgggaag 



aaggatttta 
tagatatagt 
gtagaggagt 
tttttaagat 
acgagtttgt 
t&ggaaatcg 
tttcggattt 
gggttaggag 
tagatacgtg 
gggattggaa 
tttttgtttt 
aatttttagt 
g^taggtgaa 
attaggtttt 



gatgtagggt 
tttattgttt 
gagtggtagg 
gtatgtaagg 
ttaggatggg 
ttttaggtta 
ttagttagtt 
gaagtttaga 
tttataagtt 
aaatgggtta 
aggattttta 
tgggtatttt 
tgttggtgcg 
gtagttgtat 
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aagttggacg 
ttaggtattt 
aggaaagagg 
gagggtatta 
tatattgtta 
tatttaaggg 
ttaggatttg 
gagggggtat 
taggtagtta 
tttcgggtag 
ggttggagag 
ggtataatgt 
tgatag^agt 
tggtaataat 
cgttgttttt 



1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4337 



60 
120 
180 
,240 
300 
360 
420 
480 
540 
600 
660 
720 
,780 
840 
-9Q0- 



tagttttgtt 
tttatcgttt 
gtttattgga 
gggtaaaggg 
atttgtttat 
ttaattttgt 
aattttgagt 
ttttaaatac 
ttttatagat 
ttattaagtg 
ttaggtaggg 
tttttttttg 
ggatagtttt 
ataattaata 
cgagtaatat 
tattgggagt 
ttttgaattt 
aaaggagagg 
ttaggaaagg 
gtttaggatt 
aggatttggg 
tttggaggtt 
ggagagggag 
tgggaatatt 
gtcgttttcg 
taggcggatt 
ttagatttta 
tttttttttt 
cgtgttttcg 
ttttcggtag 
taggttgatt 
tagtaggagt 
gttttttagg 
ttcgggttta 
cgagtcgttt 
aagtgggttt 
gtttcgtagt 
aggaagtcgg 
■cggtagcgcg 
gtttcggcgg 
ttattgttta 
tttttgttag 
agggggcgga 
ttttggcgat 
gggttagtag 
gtatttatta 
ttgtttacgc 
atttaatgta 
gggaattttt 
taatattttt 
gattattagt 
aatatagaaa 
aatttttttt 
agtttaaaag 
taaatttgtzt 
gagtattttt 
ggatgttgtt 
taagtttttt 



ttttagttat 
tttgatatgg 
gtatttgttt 
agttgtggtt 
tttgatttta 
tttaatttta 
tgagtaatat 
gagttgattt 
gaagaaatta 
gttgaggtag 
ttatttgaag 
taaataggat 
agtttgtaat 
agttaaaggg 
attttgtaag 
agaaggtatt 
aaagtttaag 
gaagggcgga 
aatttagtta 
tgggaatgtt 
aatgttagcg 
tttttatgtt 
gatcgggcgt 
agaattttag 
ggggcggtat 
tcgtcgaaag 
tttttacgta 
tttttttttt 
ataggttttt 
tttttttttt 
cgggggttga 
agtagtagcg 
tttatcgcgg 
ggtggttcgg 
cgggggtttt 
tggtaaattt 
ttcggtttgg 
tcgtgagttc 
aggggcggta 
tcgtcggagc 
gattttttta 
tggcgttggt 
tggtgattag 
ttttttgttt 
tttgagagtt 
taagtttagt 
gaagttttcg 
ttgttgagga 
aagattggag 
acgggtggag 
agattttgaa 
ttatttaagt 
aagtgtgtat 
agtgaaagtt 
gtaattgtag 
tttttttttt 
aagagggtgg 
tatgttg 



tttgtgattg 
ttttatgagt 
ttattttatt 
ataatagaag 
taagttgaag 
ggtagaatta 
ttatttagcg 
aatttttatt 
agatttagaa 
tatttgaatt 
tcgttagggt 
gtttttagag 
ttttaattta 
tttgcggata 
gtatttacga 
tatttgttta 
tgttttaaga 
gggttgaaat 
attaggatgt 
agcgtttaga 
tttagagttg 
ggggtttggg 
ttgttcgttt 
tgtttatagg 
cgcggtcggt 
tgtatttttt 
gaaagaaaat 
cgcgtagcgt 
tttttggttt 
ttcggttcgc 
gcgtttaggt 
gtggtcgagg 
tgcgtagtat 
tcggatttac 
cgttttcgga 
agttttttcg 
gtagagttta 
ggtcggttcg 
gttcggggcg 
gttgggacgc 
tcgtcgttta 
ttggggatac 
gttttggttt 
ttgcgattaa 
taatttttgt 
atttagttta 

ggtgggggaa 

ggtaaaattt 
tggaagttta 
tttagttttg 
attaatttag 
aataatttta 
aggattatag 
tttgttagtt 
gaaagttttt 
tttcgatttt 
gtatagtatt 



<210> 326 '■' 
<211> 4388 ' 
<212> DNA 

<213> Artificial Sequence 



seq_ prot . 
ttttatttag 
tggtttagtt 
ttgttttttt 
tagtaagata 
tatttaaggg 
ttttagattt 
gttttcgtat 
attttgaggt 
attttaagta 
taggttattt 
tttgttttaa 
gtggggatgt 
tatatgtttt 
tatttgttgg 
ttgtttattt 
gtggaagaaa 
agagttggag 
aggaggtggt 
tatttttagg 
gttgttttta 
tttttattcg 
gttgtattgg 
attatagttt 
attgtaagta 
tttggttagt 
taagggaatt 
tgaagaaagg 
tttgaaaaaa 
tttgttacgg 
gttttcgggt 
cggaggcggc 
cggtagcgcg 
tcggagagta 
gggcggtagt 
tttggggggc 
cggggtatcg 
ttcgtaagcg 
tttttttttt 
ttgtcgtagg 
ggacgaaagt 
ggttttggtt 
gttgcgtttt 
ttttttttta 
atgaattcgg 
ataggttaat 
ggtattttgc 
gggtaagatt 
ttataatttg 
gaggtattta 
ttgagattag 
tggattttaa 
aaggataagt 
aaaaatgtag 
atttttatgg 
ttttattttt 
tagggtttgt 
tatatggttt 



.t 



txt 

ttttagagtt 
tttttaaggt 
ttaaaaatga 
gtaggtttgg 
gttatagtta 
gggtaagatg 
taggttttgt 
aggttttgtt 
gtttgtttaa 
gtagtatttt 
tttaagtgtt 
tgtgaggaga 
attttaagtt 
gtatttataa 
tgtagatgag 
tgggtttggg 
aagtaatcgg 
tatagaagta 
ttattttgag 
ttcgtagttt 
tagtttggtc 
taagttaggg 
tcggttttgt 
gtggtttttt 
attaaagata 
aggggggacg 
ggttagcggt 
gaaggggttt 
ttattttcgt 
tttcggtatt 
ggttgtagta 
ttttatgggg 
ggagcgacgg 
ttcgtttcgc 
gggaggtggg 
tagttttttc 
gcggagggcg 
tttttttttt 
ggttaagttc 
cgatttttag 
tttttcggtt 
cgttatgcgt 
ggcgtttttt 
tcgagttcgt 
ttattgtaag 
gggtttgttc 
agtttaagtg 
tacgtgattc 
tttttaattt 
gggtttttaa 
tagtattaag 
aattttggga 
tagttttttt 
tagtagagat 
agttaaaatt 
tatatttgtt 
taggtttttt 



ggatatgaaa 
tataagtgga 
gagggtttga 
ggttggttat 
aattaatatt 
tatagaagta 
tttaacgatt 
attatatttg 
ggttatatgg 
agataggggt 
gattttatta 
tggaagtttg 
atataatatg 
attttatatt 
aaaggtgggg 
atttaggatt 
atggaaggga 
gatgtgagga 
gaggggtcgc 
gttcgttttt 
gtttttaggg 
atggaggggc 
tgcgatttgt 
tttatttcga 
agtagatggg 
atagagcggg 
tttatagttt 
taatggattt 
acgaggtacg 
tatttagagg 
ggagtagtag 
cggcgagtta 
tgtttcgcgg 
ggttcgtttt 
aaatagggtt 
gtcggcgtcg 
gggttgaagg 
tttcgggcgt 
gggattagta 
tttttcgttt 
tttttgggat 
ttagggagtt 
tcgtttttgt 
gttgatattg 
ttatggttgg 
gtttaatatt 
ggataatagg 
ggtaggggaa 
ttttggtttt 
ttgtagtttg 
aaataaataa 
aagttatttt 
tgtagttaag 
tatcgtttta 
agttttttta 
tttataatag 
tttttatagt 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4337 
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<400> 326* S* 
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seq_prot.txt 



agttatgagt 
aaatatatta 
gtagttgggt 
tgttagtatt 
ttggttgatt 
gtttttgttg 
tttttttttt 
aatattttgg 
ttggttttag 
acgtgaatta 
atagtatttt 
gtaggttttg 
ttttttttat 
atatattttt 
ttattagtag 
gtagggaaag 
tttttttttg 
gataggataa 
gtgtttggag 
agagtgttga 
gttgttgtta 
aggtttattt 
gtttttttaa 
gtaggaagag 
aaggtattta 
gcgttttgaa 
ttgttttttt 
ataggatttt 
aaagtggttg 
aatatacggt 
attttaaagt 
tttttttttt 
tttttttgtg 
gtaagaacgt 
atgcgtttat 
ttgcgggttt 
gtttttttgt 
tattttattt 
tgtatttaat 
gtggtatatt 
taatagtttt 
agttgaattt 
agatttaggt 
gaataagtta 
ggttaagggt 
gtggtatatt 
tttttttata 
ttttaatttg 
tatgggtttt 
gttaagtttt 
aagaaagttt 
tattgtattt 
ggtgatgtat 
agttttgggg 
agagggaaag 
tggttttgtt 

tgggggtgtt 
ggttggaaga 
tattttgggg 
tttggttaga 
tggggaagaa 
tagtttattt 
tatagggtag 
ttttaataaa 
ttattgagag 
ggtatgtttt 
ttaa&ggatt 



ggattagatg 
gttttgattt 
gtttttattt 
agtaagttta 
ttattggtta 
gaatcgtgtg 
ttttttgata 
ttttttatta 
ttatatgttg 
ttaggttgta 
aaatgaaaaa 
aatagttttg 
taaaaatgta 
tttaaaagaa 
tgtggtgggg 
gagaatgtta 
tttttttgaa 
agtttgtttt 
gcgggagaat 
gaaaaataga 
ttgggtagcg 
tagtaagtag 
gggtagtata 
tttggagata 
gatcgtattt 
ttgttttatt 
agaagttttg 
atttgaggrt 
tagggagagt 
tatgagtttt 
gggtttttaa 
ttttttaggg 
tttttttatt 

ggtgtttttt 

tagtcgtgga 
tattttatcg 
tacggtttga 
ttattatgta 
attaatggtt 
ttttgttttt 
tttttttttt 



ttaaaatttt 
tgagttggga 
gttgaaagga 
attagttaat 
cgtttggtag 
aaaagttgga 
tatatatata 
agttaattta 
ttatttagag 
taaggaatat 
aaaaaaaagt 
atgttgtgtt 
aattttttta 



attttaaatt 
tagtagagtt 
tttgtttata 
cgtgaattga 
tttaggaatt 
aaatataaat 
acggtgtgaa 
gtaggttttt 
gaatagtaag 
aaaattagtg 
tggatggaag 
tgagttaaat 
tatttaaaat 



agtttatggg 
ttggagagta 
tagtaaagaa 
ttattatagt 
ttaaagtttt 
ggaatataag 
atatattgta 
agttaattta 
tttatatgag 
tttttttttg 
ggattttttc 
gatgagggta 
tatggagttg 
tagggtagaa 
ataggttgta 
ttgggatttt 
agtagttgtt 
tacgggagaa 
ttacgtatgt 
aatttttttt 
tttttaattt 
ttttttttgg 
aaagaatgtt 
gtgatgtttg 
gtttggaaaa 
gagaatttgg 



gcgattttat 
acggaggttt 
gtgagggtgg 
agattagggt 
ataggaagaa 
attttagtat 
tatagtaata 
tatgatttgt 
ggggtattcg 
tattttttgt 
gagttttttg 
ttattttagt 
ttttagatga 
tgttgggaat 
tgttatgtta 
ttggggttta 
gatattttgt 
tttaaagtgg 
atttttttgt 
tatatgattg 
gggtatatag 
gatttagatg 
tgttttttat 
ggataagttt 
aatttgtagt 
ggcgtgtgtg 
aatggtagtt 
aatagtattt 
attggatata 
gttatttagg 
tggggtagtt 
atagttgttg 
gggtggggat 
ttttggtggt 
tttttttttt 
tttttgattt 
gtttttgttt 
tgattttttt 
tgttagttaa 
gtggatttta 
gagggtaaaa 



tggttgagag 
atttcggtga 
attgttgaat 
attaagtaaa 
agggtagtag 
gaggtitgttt 
tatatatata 
aatcgtagaa 
aatttaatat 
agtggtagat 
aatatttaaa 
tttatggtaa 
agattttttt 
agataatagt 
gtggttagtg 
tttttttttg 
gttttgggta 
cgtattgagg 
aaataaaaag 
agaggaaggt 
tcgaggaggc 
tgtaggtttt 
tggtttcgga 
ttggattagt 
gggaagtgac 
atttattttt 
ttgaatgtat 
atggaagtta 
gatcgttttg 
ttttgaaagt 
agaaagattg 
ttttggggtt 
aaggtatgga 
tgttggagat 
tggaggagac 
aaaagtatta 
gtttttatac 
taaattttgt 
gtggttattt 
agttaaaggg 
ttaagcgtga 
gataattttg 
ttttgataag 
tttttttaat 
atttttatgg 
tagtagagta 
ggaaggtaga 
tgttgaaagt 
ggatttatgg 
ttgatttttg 
aaggttagtt 
tgttttatat 
gaaaggagta 
gtgtaggatt 
gttttgaggt 
aatatttttt 
tagttttttt 
tagatagttg 
cgttgttgtt 
ttagagataa 
cgtttcgttt 
ataggttaaa 
ttttagtata 
tgggtgattt 
tgatttaggt 
tttatttttg 



aataatttta 
tgttgtagat 
ttttaaattt 
tttgtattaa 
tagggagaaa 
tttttttttt 
tatataattg 
gtttattttg 
tagattttat 
ttagggggta 
taagtaattt 
gtttttaatt 
ttggttttgt 
tgtttttttt 
gaggagagta 
tttatttttc 
ttgttttagg 
gtgtgagtgt 
gagaatttta 
tgagtttgta 
ggtatttgaa 
ttaagttttg 
gagtatatta 
tgggttttgt 
gtgagagatt 
gtttggttga 
ttaggtttta 
tttgtatagt 
gtaggaggtg 
ttttagtttt 
ttaagtgttt 
cgggttggtt 
gtatttgtat 
gttggacgtt 
ggattaaagt 
tattacgggg 
ggtttagata 
tttttgtata 
atttgtttgt 
attttaaggt 
ggattttcgt 
tgtatttaag 
tattttttaa 
tggtgatttg 
taatgtattt 
tttggtgatt 
ttttttagtt 
tttgtttttg 
agagtagtaa 
tttttatttt 



ttttggagga 
ttaatgtatt 
attaattgtt 
atatataatt 
gttttgtaga 
tttttttttt 



ttaggatttg 

aggggtgaat 
ggggttttgg 
gttgtggtta 
atagrtagat 
gtagatttcg 
ttatttttta 
tgtttttata 
attatttagg 
tttaaaatta 
tttatcgttt 
aaagaaaggt 
attatgggtt 
tattgtttgt 
gagttgttcg 
attgtttagt 
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taataataat 
ttaagtaagt 
ttgattgtaa 
tttaattttt 
agattatgtt 
tgagttgaat 
tatttatgat 
tttattttcg 
gaggaggaga 
gttgtttatt 
tttagtaggg 
gtgtacgtgt 
ggtagtgata 
gaggtttgag 
gatcggagga 
ggggagttta 
ggggagagag 
tgaatatgga 
taataatgga 
atattttatg 
ttggagggga 
aaatattttg 
gaaaatgaaa 
tttagttttt 
ttggagtttt 
ttaaagtagt 
agtatgaagt 
tatcgtttat 
tatttggtta 
gaggtagagg 
atttttgttg 
tatttcggta 
ttagttttta 
taaatttttg 
agttttttat 
ttatttgtag 
atggttttaa 
gagaaagtta 
ttttatgaaa 
gttaatttat 
ggtaagatta 
gtttttcggt 
gttgatttta 
tgtgttataa 
ttttagtggg 
tgtttattgt 
tgttgtattt 
ttagagaata 
ttgtttaggg 

tattgttttt 
gcgtggtttt 
attggtttag 
ttgattgggg 
gggtttggtt 
ttatttttgt 
ttattttagt 
attttttttt 
aattgaaatt 
ggtttttttt 
taagtgatta 
jcstattiaaa ... 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
_4Q2Q_ 
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tttggggata attttatgta tttgtgttaa ggatatgttt aagaatataa tttttttgtt 4080 

gttgtttgtt taagaagtat tttagtttgt ttaagaagta ttttatatag tataatatat 4140 

atttttttga aattatattg tttgtttatt agataattga atgtagtaat tttgttttgg 4200 

atttaatttg attgggttaa tatgtaaaaa ttaaggaaaa atatttagtt tttttttttt 4260 

tttttgtata ttttttaagt tattttgtta tgtatatagt tatttatgtt taaagtttgg 4320 

tgattattta tttaaatgaa gattatattt tatattaatt tttgtattta tagtagataa 4380 

aatagtat 4388 

<210> 327 
<211> 4388 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 327 

gtgttatttt gtttattgtg gatataaaag ttgatatgaa atgtgatttt tatttaaatg 60 

aataattatt aggttttagg tataaatgat tgtatatatg ataaggtagt ttgaaaagta 120 

tataaaaaaa aaaaaaaaaa ttaaatattt ttttttggtt tttgtatgtt aatttagtta 180 

aattaaattt agaatagaat tattatattt aattgtttga taaataagta atgtaatttt 240 

aaaaaaatat atattatatt atataaggtg ttttttaaat aaattaaggt gttttttaaa 300 

taaatagtaa taaaagaatt atgtttttaa atatattttt aatatagata tataaaattg 360 

tttttaaatt taagtgtata taaaaatttt tttttttttt ttgttttttt agatttttag 420 

ttttttggtg attatttaat tggataatta gaaatgaatg aaatttattt ttttaaataa 480 

gatatattaa gagaagttcg agtagtttat ttgggttatt ggttaatata gatattattt 540 

tttaataggg ttttaattat agataataag gttatttaga ggaaattaaa tatttttttt 600 

tgttaagaaa aaggaagtaa tttataattg tgttagagaa gtaaaggttt agggaaggtt 660 

gttttgtggt tggaatgagt tttttttttt tagtttatag gttaaaggaa attgaaggaa 720 

atgagttggt aggagtggag gcggtaggga acggggcggg ggaggaaggg ggaggatttt 780 

tttttttaaa ttaaattttg attttggatt atttttaaat tattaagggt atacgtaatt 840 

tgattaggtt ttagttagtt tgagtagtga tagtaacggt ttttattttt ttttcgtgtt 900 

ttagggtgtt gggttaattg taggaatata gttgtttata gtagttgtga tagttgtttt 960 

ttttaattag ggttacgttg ggaaatgaag aagagttgga ttattttagg gattttaggg 1020 

tatttttaaa gggtaatgcg gggtttgtaa ggaatgtttt taggtagttg tagtttgtgg 1080 

tagggttatt ttgagtaagt ttggttgtgt tttagaattg tatttagttt ttgttttatt 1140 

tttttttttg ttttttagta gttataataa ttttgtataa gtattgttta gttttatgtt 1200 

ttagggttaa atgtaatatt agggtttttg ttttttttaa ttattattta tttttattat 1260 

gtattattat agtgaatagt ttatttttgt gtaaggtata tatacgtcgg gaagatttaa 1320 

gtatagtgtt tattggaata ggttttgaag ttgattttat tgtaagttta gggagggagg 1380 

gttttttttt gtaatataaa aataaaaata aaaattagga atttattttt tatataggga 1440 

gatttaatta agattaattt gttgtttttt ataggtttat gggaggtatg aattaattgg 1500 

aatttatgat cggaaagtta gaggtagagt ttttagtata tttgaatttg tagtgtatta 1560 

agttaaaata gttttgttaa ttaggggatt tgtttttttt gtgtgttttt tgtattttta 1620 

tagggggaat gaattgataa ttattagata ttttgttata gttatataaa agttttaagg 1680 

tgtattattt ttataaaagg atgtattgtt ataggaatat aagagggtgt tataataaat 1740 

ttttgattta gtttttttta agttattaat taaaggagtt attttaaatt ttttgttgtg 1800 

gtttattttt agagttattt aaaaagtgtt tattagaggt aaaatgttta ttttttaggt 1860 

ttgggttttt ataagtagtt taaatgtata gagttatttg agttttaatt tatagattga 1920 

gtttagttgt gaagagttac gggaattttt acgtttagta atatagtaag ggggaaagag 1980 

agttgttata aagatttagt tttggaattt ttttggttgt ttttaataga agatagaaga 2040 

tgtgttatta agaattgagt aagtaaatga atggttattt atttagaaag ttattggtgt 2100 

tggatgtatg tcggagtgta tgtaggagat agaatttggt taaagtggtg tatgatgagg 2160 

gtaaaatgta gtagggatta tttgaatcgt gtgggagtta gggagttttt agatcgtggt 2220 

agggaaattt tttgtttttt tcgtgatgta atatttttgt aaggaatgcg atgaagtaga 2280 

gttcgtagtg gttaagtggt tttggttcgt tttttttacg gatgtttttt tacggttagt 2340 

gggcgtatgt aggcggtggg cgtttagtat ttttagtagt aggttataga ggggtattac 2400 

gtttttgtat tttatgttgt atagatgttt tatgtttttg ttattgtagg tgggaagata 2460 

tagaaaggat tattttagag ttaattcgag ttttaggagt gttgaaattt ttagaagggg 2520 

aaggaagagg aattttaaag atatttagta attttttttt ttttttgttt aaagatttat 2580 

tttaggatgg gagttgggaa agttgagggt ttttagaagt tttaatttgg aatttatggt 2640 

cgtgtgtttt ttatttttta ttttttatta gggcggtttt atttttatat tttttttgta 2700 

gttattttta gggtatttgt tatgtaggtg atttttatgg attttcgtgg ttttaaatga 2760 

gattttattt ttttttaata ggatttaaat gtatttaaat aggatcgtta aagtttttaa 2820 

aaagataata tgagatattt agttaagtag gagtgggtat tttaaatgag taaagtagtt 2880 

taagacgttt tattgttaaa ttttttacgt tattttttat ttaatttaga atacgattta 2940 

aatgtttttt tatgtttaat aaggtttaat tggtttagtt tttatttatg tttttaagtt 3000 
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ttttttgttt ttttttttta atgtgttttt cggagttata ttgatttttg tgttgttttt 3060 

aaaggaatta agttttttta agatttaggg gatttatatt tgttgttttt atttgttgqg 3120 

atgaattttt tttcggtttt taaatgtcgt tttttcggaa aggtttatcg ttatttagtq 3180 

gtagtagttt taagtttttg taggtttagt tttttttttt tatatcgttt tgtttttttt 3240 

agtattttta ttattgtttg aaattttttt ttttatttat ttgtgtttat tttttcgttt 3300 

ttaaatatat acgtgtatat atttatattt ttagtgcgaa tttttagaga agtaggtttt 3360 

attttgtttt ttgttaggtt taaaatagtg tttagagttt agtttacgtt tagqaaaata 3420 

aggggggaaa tgggtagcgg gaaatgggta gagggaaatg taggtaaatg gtattttttt 3480 

ttttttattt tttttttttg ttttttttta ttagttataa ttttgttgtt ttattatatt 3540 

attggtggcg ggggtgggga ggagaataat tattatttag tttgaaattt tttttaaaag 3600 

aaatatgtat tatgagtagt aggattaaga gaaggtttta gaaaaatttg tatttttagt 3660 

aggggaaaat ttagtttaag ttggagattt gttatgaaaa tataatatta aagttattta 3720 

gggtttatag tatgatttaa attatttgtt taagtgttat tttttttttt ttttatttaa 3780 

aatgttataa aagttaaatg ttttttggat ttgttattgt gttttttgtg taatttgatg 3840 

atttacgttt atagttagat gaaatttgat attaaatttt ttgagtagta atatatggtt 3900 

ggggttaagt ttatttaata aaatagattt ttgcgattta gattggttta gtgaggaatt 3960 

aaaatattat tattattata attgtgtgtg tgtgtatgtg tgtgtgtgtg ttagagagag 4020 

agagagaaag agagagagag agaaagagaa gtaatttttt tagtttttta tacgatttta 4080 

atagaaattt tatagagttt tttttttgtt gttgtttttt gttagacgtg gttagtaaaa 4140 

ttagttaggg ttgtatattt agtgtaaatt tatttagtat taattaatta ggtttgttaa 4200 

tgttgataag tagttaatgg atttgaaaat ttagtaattt tttttaataa ataaaaatat 4260 

ttaattgtaa tgtattgaat ttataatatt atcgaaattt ttagtttaga gttaaagtta 4320 

atatgttttt ttttagggta aggttatttt tttagttagg gattttggta tttaatttat 4380 

ttatggtt 4388 

<210> 328 
<211> 4200 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 328 

gaaaatgagg gttggttttg aagttggata gatttgtttg atttgtttgt ttttagtttg 60 

ggttaattat attgttttag taagttattt atattgtttt tatatttttg aaatgaggta 120 

tatgtttgtt ttatttggga atgtgaggat taaggagaat aatatatata taatgttgaa 180 

tatttatgtt ttttaattat tttgaagttt tagaaatatt tttagttttt aggtgagttq 240 

tgattaaatt tgtttattaa tttaatatat atttattgaa tgtttatttt gtgttgtagt 300 

ttttggtagg tgtgtttgat agtgggtgtg taatatgttt agggtttgta ttttatgagt 360 

gtggggattt tttttatttt ttggtatata tggttttggg ggagaagtta aggggaaggg 420 

ttaggagttt atatggtaga tttagtaagt ttttagtata ataattttaa ttgtaaaaat 480 

aaatagtttt gttaattgtt tgagagtagt gtttgtttta taaaaattaa taataataaa 540 

aggaaaaaaa atttaaagta aaatgttata tttaatggtt gttggataag atattgttgt 600 

ataaagtttt gtgtgagatt gttttggtgt tgttttagtt tgtttttggg ggttatagaq 660 

gaaggtgtgg ggtgtgtgtt agttataaaa gattgttttg aattttaggg tagtgtttat 720 

ttattgggtg gttttagaag atttttttta aagtgtggtt gataagggtg atagtgtttg 780 

gattgtggga attgtttgtg gagttggtgt tgaatagggt gtgtgtggtg ggtgttttaa 840 

ggaaattgga agtgggattg gaggttggtt tttgggtgtg tgaggaggag ttggaagaag 900 

agtggggagg ggaatggttt ggaatttgtg tgtgttgaat ggtggtgttt taatttataa 960 

atttattttt tgtttagaga gaagtgaagg agaggtttag tgagtaaaag ttgaggtttt 1020 

tgagttgttt tttggtgtgt gttttttgtt tgaggtgtat tttgttgtgt tgtttttttt 1080 

tattgttgtg tgtgaagtgg tggttgtata tgttgtaagt aaaaggtttt ttgttggtgt 1140 

gggtgtgtat gtgtgtggtg aggtggttgt tgtggttgaa gttgtggagg tagatgtggt 1200 

attggaaggg tttgtggttt gtgtggatgt gtaggtggtg attgagtttg ttggagtgtg 1260 

taaagttttg tatataattt tttattgggt aagtgaaggt tttggtgtgt ggttgtgggt 1320 

agaagtagtg tgtgttgtat ttgttgttgt ggtgtttttt gtgttgtgtt ttggtttggg 1380 

ggaaaggggt gggtggtggt ggtggtatgg gtggtgtagt tatgttattg tttttaggga 1440 

tgtttgttat tagaggttta gggaagtttg ttgtggtggt gttgtgaagg tttagtgggg 1500 

aaagttgagg ttgtgtattg gttagaaatt ttttgttgtt gttgttattt tttttttttt 1560 

tattaggagg ggttaggagt ttagggaggt ttttagtttt tttttttaag ttatttgggg 1620 

ttagtgggaa agtgttatag gttttgttgg gatagagttt gttggttggg atggttggta 1680 

gttttgtagg gtagttgatg gatagtaagt ttttaatttt ggtttttatt atgggaaaat 1740 

gagtttttgg ggtggtttgg tagttttttt gtgatttata gttttttggg gtttttatag 1800 

aaagtagttt ttagggtgtg tagggatttt tgaaggtaga gatagtgttt agtgttggtg 1860 

aggtgggagg tgtttggagg ttgggtttga tgttgggtgg ggagagttga ggtttatata 1920 

ggtattgtga gg gggtattt gtgtaaggtg aggtttttta gaa tgttttt qggaaagggg 1980_ 
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tagtgtttag atttggggag taaaggtttg gtggatttgg tagtaaggta ttggattttg 2040 

taggaaaagg ggtatttagt ggggatttgg atgttgttgt ttttggattg gggaagggtg 2100 

ttaggtttaa gatgtttgat atgaggttga agagtgtttt tgggttgtgt gggtgtttgg 2160 

gtattgtttg aatgaagaag ttattgttgt agttgaggtt gggagggggt gtgggtgtag 2220 

gtttttttag gaagtaggag ttggttaagt ttttatttgt gttgtagttg tttaaagttt 2280 

agtttaagaa gttgtttgtt gaggagggag taggaattag ttttgggtat attaaagggt 2340 

gtatttgagt ttatagtttt tatttataaa atttttggtg tttatggata tatataaagt 2400 

gttttttgta tattttgatt tagtaggggt ttgtatatgt tttaaggagt atatagaaat 2460 

atgtatatgt aaaatatatg tgtataggta aaagggtgtt ttgataattg tataggtttt 2520 

tgtggaggtg ttggtggttt agtgtgggtg ggaatggggg tgtgtatttt aggattttta 2580 

agttttttat tgtttttatt tttatagttt taatgtttta gttttttttg gtttttaggt 2640 

atggtgtgtt tttgtgttgt gttgttgttg tttgagtatt tttgtttgtt ttttttattt 2700 

ttagggtagt tggtggttgt gggagtgttt ttggtaggta gttggggtaa tttagtgttg 2760 

ggtttggtgt aatagttttt agtggatttg atgaggagtg tgttgggttt ggaaaatttg 2820 

ttaaggtgga gtatggtgtg gtgttggttg tggggtgttt ggggttttgt ttgttgggtt 2880 

tgggggtgtg tgggtggtgg ggaggttggt ggtaggggtt ttttgtagtg tatagattta 2940 

ggtgtttggg tttttggttt tgggtattta tttttgggaa aggagttggg gagttgtggt 3000 

gtttttgttt gtttgtattg gttttgggtg gtggtttttt gtttttttaa agtgttgttg 3060 

ggttttttta gtttatagtt tttttattta tatagttgtg gtggtgttgg ttttgggttt 3120 

ttgatttttt gggttattgt tattttggga ggttttggtt ttgttgttgt gatgtaaatg 3180 

tttaaatatg gatataggat gtgtgttggg gatrtttgaa aaggaaagtt tgagttgtga 3240 

tgttgttgtt gttgttgttg ttgttaggtt tgtgttgttg tgtgtgtgtg tgttttttag 3300 

gtgtggagtg ttggggttgt ggatgtgtta gtttggggtg ttgtgtttta ttgggttgat 3360 

tgtttggtag atgtgttttt tatttggagg tagaggttga ggggtagtag gggaggattg 3420 

tgttgggttt gaatttgttt tttgttatta attagtttga tatttagttt tttttaatgt 3480 

tagtttttta tttgtgtatt ttttgtggag agaaattgag agtatataaa tgtttaggtg 3540 

tatttagtaa attatagaga ttattgttat ttattttttt tgtttatgtt tgtgtgtttt 3600 

tttattttaa gttttttttt tttaaatttg tgttaggaaa aataatttaa attaaaaagg 3660 

attgtaatat ttgttaatag taggaggaaa gttaattagt ataatagggt atattttaaa 3720 

ttgaaaagta aaagtttttt ttttttattt atttttattt ttagtttttg aggtagttat 3780 

atttaagttt: tttgggtatt tttttagaag aattttatgt atatttaatt atatatattg 3840 

ttttgtaata ttattattat tattattttt ttatttgaaa taatttagag attgtgttat 3900 

gttttgatat aaagattttg ttattttttt ttaaggttgt atagtatttt attgtaaggg 3960 

aggattttga gttagttgat ttgtttttta tttatgagta tgtaggtagt ttttaatttt 4020 

ttatttatat taatgttgtt ggagttaata tttttgatta tataaatgtg aatgtgtttg 4080 

tgagataaat tttaatgagt gtaattgtgt gttaaagtat atgtaaaatt ttgatagata 4140 

attggtaaat tgttttttag aaaagtaggg ttaatttttg tttatttttt ttttatttat 4200 

<210> 329 
<211> 4200 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) : f 
<400> 329 

gtgggtgagg gaggagtgag taggaattgg ttttattttt ttggaagata atttgttagt 60 

tgtttattaa aattttatat atattttggt atataattat atttgttgga atttatttta 120 

tagatatatt tatatttata tagttaagga tgttgatttt aataatattg gtataagtaa 180 

aaagttggaa attgtttata tgtttatgga taggggataa attgattaat ttaaggtttt 240 

tttttataat ggagtattat atagttttga aaaagaatga taagattttt atgttaggat 300 

atggtataat ttttaagtta ttttaagtga aaaaataata ataataataa tattgtagaa 360 

taatgtgtat ggttggatat atataaaatt tttttgaaag aatatttaag aaatttaaat 420 

gtggttattt taggagttag gaatgaaggt aggtggagag gagggatttt tattttttaa 480 

tttagaatat attttgttgt gttagttgat ttttttttta ttgttaataa gtattatagt 540 

t ; ttttttaat ttaaattatt ttttttagta taaatttaaa aaaaaaaaat ttaaaatgga 600 

aaaatatata ggtgtgagta aaagaaatga atggtaataa tttttgtggt ttattgagtg 660 

tgtttggata tttatgtgtt tttagttttt ttttatagga aatgtatagg tgagaaattg 720 

atgttaaggg ggattgagtg ttaagttagt tagtggtaga gggtagattt aaatttaata 780 

tggttttttt ttgttgtttt ttggtttttg tttttaggtg ggaagtgtat ttattggatg 840 

gttggtttgg tgaggtgtag tgttttagat tggtgtattt gtggttttag tgttttatgt 900 

ttggggagtg tgtgtgtatg tagtggtgtg agtttggtgg tggtggtgat aataataatg 96Q 

ttatagtttg agtttttttt tttgggagtt tttggfcatat attttgtgtt tatgtttggg 1020 

tatttatgtt atggtggtag ggttggggtt ttttaaiaatg gtagtggttt ggggagttgg 1080 

aagtttggag ttagtgttgt tgtagttata taagtggggg ggttgtgggt tgggggagtt 1140 

tggtagtgtt ttggagaggt gaggagttgt tgtttgaggt tggtgtgggt gagtgagggt 1200 
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gttgtggttt tttgattttt tttttagagg tgagtgtttg aagttaggag tttgggtqtt 1260 

taggtttgtg tgttgtgggg aatttttatt gttagttttt ttgttatttg tgtgttttta 1320 

agtttagtgg gtgaggtttt gggtgtttta tagttggtgt tgtgttatgt tttattttag 1380 

tgagtttttt gaatttgatg tgttttttgt taagtttatt gaaggttgtt gtgttgaatt 1440 

tagtgttgaa ttgttttggt tgtttgttag ggatgttttt gtggttattg gttattttgg 1500 

aggtaaggag ggtgagtagg ggtgtttaga tgatgatggt gtagtgtggg ggtgtattat 1560 

atttgagaat taggagggat tgggatattg gagttatgag aataggggtg atgggaagtt 1620 

taggagtttt ggggtgtgta tttttatttt tatttatatt gggttgttag tgtttttgta 1680 

ggaatttgtg tggttattgg agtgtttttt tgtttgtgta tgtgtgtttt gtgtgtgtat 1740 

g H53 t * t ? t 9 tttt:t tggg atgtatgtgg gtttttgttg aattagaatg tgtaaaaggt 1800 

?5Hi§S 9ta tatttgtggg tattaagagt tttgtaggta ggggttgtgg atttaggtgt 1860 

attttttgat gtgtttagag ttgatttttg tttttttttt agtaggtgat tttttgagtt 1920 

? gg £:" gaa 9* aa 9*9999 atttagttga tttttgtttt ttggaggggt 1980 

ttgtgtttat attttttttt ggttttagtt atagtggtag tttttttatt taggtagtgt 2040 

H?-22HHi g ta i9atttg gaggtatttt ttaattttat gttgggtatt ttaggtttgg 2100 

tatttttttt tggtttagag gtagtagtgt ttagattttt gttggatgtt tttttttttg 2160 

tggggtttga tgttttgttg ttgggtttgt tggattttta ttttttggat ttgggtgttg 2220 

553 t 5 tt " t agaggtgttt tgggaggttt tgttttgtgt gggtgttttt ttgtagtgtt 2280 

tgtatgagtt ttagtttttt ttgtttgatg ttaagtttgg tttttgggtg ttttttgttt 2340 

tgttagtgtt ggatgttgtt tttgttttta agggttttta tgtgttttgg gagttgtttt 2400 

^i 99999 * tttagggaat tgtgggttat agggagatta ttaggttgtt ttggaggttt 2460 

gtttttttgt aatagggatt aagattgagg atttgttgtt tattagttgt tttgtggaat 2520 

tgttggttgt tttagttaat agattttatt ttagtggggt ttatgatgtt tttttgttgg 2580 

ttttgggtga tttaggggag ggggttgagg gtttttttgg gtttttgatt ttttttagtg 2640 

gggagggagg gagtagtggt gatggtggag agtttttggt tagtatgtag ttttagtttt 2700 

""9 tt99 9 tttttgtagt gttgttgtgg tggatttttt taaatttttg gtggtggata 2760 

tttttggaag tagtggtgtg gttgtattat ttgtgttgtt gttgttgttt attttttttt 2820 

tttaggttaa ggtgtgatgt aaggggtgtt gtggtggtaa atgtagtatg tgttgttttt 2880 

S^SHS" 9 tat 9 ttaa 9 gtttttgttt gtttggtgga gagttgtgtg tggagttttg 2940 

^HiSPHH 3 tgfgtttaat tgttatttgt gtatttatat gggttataaa tttttttagt 3000 

g I §i at i tg tt1:t tgtaat tttagttgta gtgattattt tattatgtat gtgtgtattt 3060 

atattggtga gaagtttttt gtttgtgatg tgtgtggttg ttgttttgtg tgtagtgatg 3120 

agaagaaatg gtatagtaag gtgtatttta agtagaaggt gtgtgttgag gagtqqttta 3180 

agggttttgg tttttatttg ttgggttttt tttttgtttt tttttgagta agagatgggt 3240 

H232222: t9 9ggtgttgtt gtttggtgtg tatgagtttt gggttgtttt tttttttgtt 3300 

£££"1*22 i t H tttttg tat 9tttgag ggttggtttt tggttttgtt tttagttttt 3360 

ttgaagtgtt tgttgtatat gttttattta gtattagttt tgtggatagt ttttgtggtt 3420 

HSHHESS* 52H tttg i t a 9ttgtgttt tgggggaagt tttttgagat tatttagtga 3480 

ataggtatta ttttgggatt taagatagtt ttttgtaatt ggtatatgtt ttatgttttt 3540 

ttttataatt tttagagata ggttggggta gtgttaaggt ggttttgtgt gggattttgt 3600 

a * ag 3 ag ! gt H tatttagt agttattgga tgtaatgttt tgttttgggt tttttttttt 3660 

t 5l2H 9 i tg ttaa tttttg taaagtagat gttattttta agtagttgat aaaattgttt 3720 

atttttgtaa ttaaaattat tgtgttaaaa gtttattgaa tttgttatgt aagtttttga 3780 

JSS*? 1 *** ^astt^ttt tttaggatta tatgtgttag aagataaagg agatttttat 3840 

gtttatggga tatagatttt gaatatatta tatatttatt gttaggtata tttgttagga 3900 

23H5 g .§£2S a 9 a 9taggta tttagtaaat gtgtgttggg ttaatgaatg aatttaatta 3960 

^HPH* aa 99gttgga ggtgtttttg gaattttaaa gtggttaaaa ggtgtaggtg 4020 

tttaatatta tgtatatatt atttttttta atttttgtat ttttaaataa agtaggtata 4080 

tattttattt taaagatgtg gaagtaatgt agatagtttg ttggaataat gtagttaatt 4140 

taggttagga atagataggt taaataaatt tgtttaattt taaagttaat ttttattttt 4200 

<210> 330 
<211> 4491 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 330 

tjtagtaatt ttttatgttg aaatttggtt tagtatattt tgattatata tttatattta 60 

attatagttt taaaatagtt aagtagtttt aaaaatttaa attaaatttt tgtggaattt 120 

2:22222522 2 aaaaata tt attgagtaat tggtaatata aaatataaaa tgtttaattt 180 

H g 2H tga 2 S t 2! ag 5 gat taatgaatat ataatggttg tagttttttt 240 

tataattatg ttatttaagt tattgatgaa tatttatgat atatgtagta aaaatttaga 300 

taaatatl:tt ggttataaat tggtttatta aataatagtt tgtattatag tttgagttaa 360 

-9MtAatajg^tj:ataat.tg_.tgtttta&gg ta t taagaag-tt^tttt-gag-gattttt-taa -420 — 

Seite 274 



seq_prot.txt 

aaggtatttt ttgttttgaa agattgtttt ttttttggaa gttttaggtt ttatgagaaa 480 

ttatttattt tgaatttgtt ataaaaaatg atttgtaaat aagttttgtg tagtgaatta 540 

tgtgttttta gtaagaggat tttgaaagtt ttttgtaaaa tgaagttatt tttaaaagag 600 

agttatttag aatataatta aattgattta tatgttttta tttattttaa atatgaaggg 660 

. aaattgtttt taaatatatt taaattttta atagaatagt agtatgttat tagtgtggaa 720 

ataatttatt tgttaaataa atatttgggg tgtatttgtt gtatattagg tattgtaata 780 

tagggattaa tgaaagagaa aaaaaataaa gtatgaaagg atagtagaaa tagttttata 840 

attttatttt ttaggaataa tttttatgga tattttgttg tgtgtttatt gtttttttaa 900 

tatatatatt ttttaattaa atggaattgt tttaaatatg ttatttaata gtttgttttt 960 

ttttatatta tttgaaaatt aagaaatgat tataatatgt tttatttaaa aattttaagt 1020 

ttaggttggg tatagtggtt tatgtatgta attttagtat ttggagggat tgaggtgggt 1080 

ggattataag gttaggagat tgagattatt ttggttagta tggtgaaatt ttgtttttat 1140 

taaaaatata aaatttagtt gggtatgatg gtatgtgttt gtagttttag ttatttggga 1200 

ggttgaggta ggagaattgt ttgaatttgg gaggtggagg ttgtagtgag tggagattgt 1260 

attattgtat ttgtttggtg atagagtaag attttatttt aaaaaataaa taaagaaaat 1320 

ttgaagtata gtattttttt aaattttaaa tagataatag aaattggttt ttttttattt 1380 

aaattagaat ttaagtttaa ttttatatat ttttgatagt ttggattttg ttttttaatt 1440 

ttataaaatt gggaatttaa gtattatttg gtttgattta aatgtaatgt agaatttgta 1500 

ttaaaatatt atattaaagt tttagatttg tagtagttaa tagtattttt atgtatgtgt 1560 

tagggattgt tttaaatatt ttatatatat taattttttt attttgtatt tttgtttttg 1620 

ttttatatag taggaaattg aaatattgag aggttaagta attaaagtta tagagttaga 1680 

gtgataggag taaagtttta atttaggtaa tttagatttt tagagttttg atttttatta 1740 

ttaagttgtt agtatagttt ttttggtaat tatttttaat ttaaatataa tttgagtgat 1800 

ttatttaata agttattatt ttgataattt agtgatttgt aatgtaaaat tatttattgt 1860 

aatttattta atattattgt ttttttgtgt tgtaaaaatt atagtaattg agatgtaatt 1920 

tattattttt ttttttattt ttggtatttt gtgttaattt ttttgttttg tggatttttt 1980 

ttgatttttt attatgtgtg ttaattgtta ttaatttttt tgtttgttgg ggattggggt 2040 

tgtgagggta tatttttgag gggtatgggg ttagggttag gtaggttgtg tggttgggtg 2100 

gggttttgtg ttttattgtg gagtgtgggt tgggaagtgg agagagaagt agttgtgtaa 2160 

tttgttggat gtggattagg gtgtttttta tttttgttgg gagtttgttg attggttggg 2220 

tgtgggtgta tgtgattgat atgtggttgt attggtgtag tttgttaggg tgttattgga 2280 

gatagaatgg aggtgttgtt ggatttggaa atggggtagg tgttggagtt attatggtta 2340 

ggtttgttgt ggggggaggg gggaaggtgg tttttttttg tattgtttta aattgatggt 2400 

ttttttttgg tatagggttt attgtagtat gttaaatgag gaggtagggg tgttgttttt 2460 

ttgtttttta ttgtagtatt ggagatggat tttttgtatt ttggatttag ggtttttgat 2520 

agaagaggaa gaagggggag gggtagaagt gttaagggga gtttgttgag aaaagttgtt 2580 

tttgaagtta gaaggggttt ttgtttttat aatgttattt gatagagtgg aataatagta 2640 

tttaaggaaa tgggtagagg ataataaaga atggagtata tttatggtga ggagtaaaag 2700 

ttttatttta ttgaaaggtt tttttttttt tttggtgata aggatatatg tattggtggt 2760 

taaaagagag aggagataaa attgttgtag atggttg^tg tgaatttagt ggaaagagtt 2820 

attggggatg agagaaagag gaggaggtag gtattgtaga gtgtgagtgg tggtgttggt 2880 

tggtgaaata ttggttatta gtagtgtgtt tgtttttgta aaatatttaa gtaaattttt 2940 

tgtgaatagg gtggtaaata gatattagtg tttttgttag ttataaaatg tagtggtagt 3000 

ggttttttgt ggatgattgt agtagtgttt tttttttttt tgttaggttg aaaagataat 3060 

tgtagaggaa taagaaattt tgtagtaaat gttggggtag aagtttattt ataagaagtt 3120 

atagtttata aatgtagttt gaatagtaga aaaaaaatta ttgtttttta aagtaggaat 3180 

aatgttaggt tatgaatgtt ttgttattgg aatgtattgg atattttgat tttatattat 3240 

gaaagtgatg tttaaatttt ttgatttaat ataaatttta tatgaaattt taataaatta 3300 

tgtatgaaat agtggatttt tttttttgtt agtgaagttt ttatgttatt aattaggtta 3360 

tttaagagta aattatttta taatgtaaat tattttgtta aaaattatgg tgaataaatt 3420 

tttgtaggtt taatatttaa gatttatagt taagtaattt tatatttttt tttggttgtt 3480 

tttagataat attgaataaa tatttaagat attaatttag tgtgtaaata ttttaaatta 3540 

aagtaatatg gttttttttt tagatgtatt tttgtttagt gagtatttat gagatatatt 3600 

tttgatataa agtgtttttt attgagtttt tttttttttt atttaaatgt aaaagtttat 3660 

ttttatgtat atttatagta gttagtttgt tatatttttt tattttttga ataaggatgg 3720 

tagtggtttt gtaaagtttt ttagggttag tgatagtgtt ttttaaataa tgtttatgta 3780 

atagaaagtt ttaagatgat ttattatatt gttttagtaa tatttttaag ataatgtgga 3840 

attgggttgg tttagataaa taagattata taagtttttt attgataaat ttaaatagtt 3900 

ttttaaaaaa ttatttttgt tttaagaatt atttgattat ggatattagg gttaatagta 3960 

tttaggagtt gggtgtggtg gtttatgttt gtaatataag ttaaatagga ggttgaggga 4020 

agttgaggta ggaggattat ttgagttttg gtgtttaaga ttagtttggg taattaagta 4080 

agattttatt ttaaaaaaaa gtatttagag tttttttttt aaaatttgat ggaaattatg 4140 

gttagaatta aagatgttat attaaaattt taaatgatat ttaatagtta atattattta 4200 

gttttttgaa ttttattagt atagtgtttg agagtttata aaattattat ttttatataa 4260 

attatttttt attattttaa tagtttattg tagtattatt tttattttaa atgttagaga 4320 

tttttattta a^tatgttgt gkttt£attt aaggttaaat att^tatakt aatgtaaaga 4380 

atttaagt;tt tttattttat ttikattirfca taaaattata atgtttttta ttktatatat 4440 

ataaaataat atatkataga ggtaatttga aaaatagtgt aagttattga t 4491 

Seiti* 275 



seq_prot.txt 

<210> 331 
<211> 4491 
<212> DNA 

<213> Artificial Sequence 
<220> 

<^23> chemically treated genomic dna (Homo sapiens) 
<400> 331 

gttagtgatt tatattgttt tttaaattat ttttgttata tattattttg tatgtatgta 60 

ataaaaggta ttatagtttt atagaattag aataaaatag aagatttaag ttttttatat 120 

tgttatataa tatttgattt tggataaagt tataatatat ttgaatagga gtttttaata 180 

tttaaaataa aaataatgtt ataatgagtt gttaaaatgg taaaaggtaa tttatgtaaa 240 

aataatagtt ttatagattt ttaaatatta tgttaatgga atttaaagaa ttgagtaata 300 

ttagttgtta aatgttattt ggaattttaa tgtgatattt ttagttttag ttatagtttt 360 

tattagattt taaaaaagag attttaagta tttttttttg agatagggtt ttgtttggtt 420 

gtttaggttg gttttgaatg ttaaggttta agtgattttt ttgttttagt tttttttagt 480 

tttttgttta gtttgtatta taggtatggg ttattatgtt tggtttttaa gtattattaa 540 

ttttgatatt tatgattaag taatttttaa agtagaaata attttttaaa gaattattta 600 

aatttattag tgaaaaattt atgtagtttt atttatttaa gttagtttaa ttttgtatta 660 

ttttgaaaat attattgaaa taatataata ggttatttta agatttttta ttatataaat 720 

attatttaag agatattgtt attgattttg gagggtttta taaagttatt gttatttttg 780 

tttagggagt ggggaggtgt ggtaggttgg ttattataag tatgtataaa gataggtttt 840 

tatatttagg taaaggaaaa aaaagtttaa tgaaaaatat tttatattaa gagtatattt 900 

tatgagtatt tattaaatag aaatatattt agaagaaaag ttatgttgtt ttaatttaaa 960 

atatttgtat attgaattaa tattttaaat atttatttag tgttatttaa aagtaattaa 1020 

agagaaatat aaaattattt aattataggt tttaaatatt aggtttataa aaatttgttt 1080 

attataattt ttgataaagt aatttatgtt gtaaagtggt ttgtttttga atgatttaat 1140 

taatggtata aaagttttat taataaaaga aaagatttat tgttttatat ataatttatt 1200 

aagattttgt ataggattta tgttaaatta aagaatttga gtattatttt tgtgatgtag 1260 

aattaaagtg tttaatatat tttaatgata aaatatttat agtttgatat tatttttatt 1320 

ttaaaaaata gtaatttttt ttttgttgtt taggttgtat ttataaatta tggttttttg 1380 

taaatgggtt tttattttag tatttgttgt aaggtttttt atttttttgt agttgttttt 1440 

ttggtttaat agagggagaa aaagtattgt tgtagttgtt tgtaaaaagt tattattatt 1500 

gtattttgta attaataaag atattggtat ttgtttgtta ttttgtttat agggagttta 1560 

tttagatgtt ttataaaagt aggtatatta ttggtgatta gtattttatt aattaatatt 1620 

attatttatg ttttgtagta tttgtttttt tttttttttt ttatttttag tagttttttt 1680 

tattagattt atattagtta tttgtagtgg ttttgttttt tttttttttt ggttattaat 1740 

gtatgtgttt ttgttgttag ggaggaaaag aagtttttta atggggtaga gtttttgttt 1800 

tttgttatga atatgtttta ttttttgttg ttttttattt gtttttttag atattgttat 1860 

tttattttgt taaatggtat tataaaaata aaaatttttt ttggttttaa aaatagtttt 1920 

ttttagtaga ttttttttaa tatttttatt tttttttttt tttttttttt tgttaaaaat 1980 

tttggatttg aagtatagga aatttatttt tagtgttgta gtggggggtg gggggatgat 2040 

gtttttgttt ttttgtttgg tatgttgtag tggattttgt gttaaggaaa ggttattggt 2100 

ttaagatagt gtgagggaaa attatttttt tttttttttt tgtagtaagt ttggttatgg 2160 

tggttttagt atttatttta tttttgagtt tggtagtatt tttattttgt ttttagtgat 2220 

attttggtgg gttgtattaa tatagttata tgttggttat gtgtgtttat atttagttaa 2280 

ttggtgggtt tttgatggga atggggagtg ttttggtttg tatttagtgg attatatagt 2340 

tgtttttttt tttgtttttt gatttgtatt ttgtagtggg gtatagggtt ttgtttaatt 2400 

gtatagtttg tttagtttta gttttgtatt ttttgggggt atgtttttgt ggttttagtt 2460 

tttagtaagt aaggaagttg atggtagttg atatgtatag taaagagttg ggggaggttt 2520 

gtagggtaga aggattagta taagatgttg gaggtgggag ggagagtaat aaattatatt 2580 

ttgattgtta tgatttttgt agtatagaga aataataata ttaaatgaat tataatgaat 2640 

aattttatat tataagttat tgagttgtta gagtgatgat ttgttagata gattatttga 2700 

attatatttg aattaaaaat agttattaga aaagttatgt tagtagttta gtagtggaga 2760 

t^agaatttt ggaagtttgg gttgtttgag ttgaagtttt atttttgtta ttttagtttt 2820 

gtaattttag ttatttaatt ttttagtgtt ttaatttttt gttgtataaa atgggaatag 2880 

aagtatagaa tagaggagtt aatatatatg aagtatttag agtagttttt gatatatata 2940 

tagaagtgtt gttagttatt ataaatttga ggttttaatg tagtatttta atgtaaattt 3000 

tatattgtart ttaaattgaa ttaggtgatg tttaaatttt taattttatg aggttgaagg 3060 

ataaaattta aattgttaag aatatataaa gttaaattta aattttggtt taaatggggg 3120 

gaaattagtt tttattattt atttaaaatt taaaaggata ttatatttta agtttttttt 3180 

gtttgttttt tgagatagag ttttgttttg ttattaggtg agtgtagtgg tgtaattttt 3240 

gtttattgta atttttattt tttgggttta agtgattttt ttgttttagt tttttaagta 3300 

Qttgggatta taggtatgtg ttattatgtt tagttaagtt ttgtattttt agtagag^tg 3360 

g99tt_tta« 3420- 
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ggttttttta agtattggga ttatatgtgt gagttattgt gtttggttta aatttaaagt 3480 

ttttaaatga agtatattat aattgttttt taatttttaa atagtatgaa aagaagtaag 3540 

ttattaaata gtatatttga ggtaatttta tttgattaaa aaatgtatat attagaaaag 3600 

taatagatat ataataaaat gtttatagag gttatttttg gaaagtgaga ttatggagtt 3660 

atttttgttg tttttttgtg ttttgttttt tttttttttt attagttttt gtgttataat 3720 

gtttgatata tagtaaatat attttaaata tttatttaat aagtgagtta tttttatatt 3780 

gatggtatat tattgtttta ttaaaagttt aaatatattt agaaataatt tttttttata 3840 

tttgaaataa ataaaagtat gtaaattaat ttgattatat tttaaataat ttttttttga 3900 

gaatgatttt attttataaa aggtttttaa aattttttta ttgaaagtat atgatttatt 3960 

gtataggatt tatttataag ttattttttg tgataaattt agaataggta attttttata 4020 

aggtttagaa tttttagaag gaaaatagtt ttttaaaata aagaatattt tttaaaaggt 4080 

ttttaaaaga attttttagt attttaagat ataattgtaa gtttattatt tttgatttag 4140 

attatgatat aagttattat ttagtaaatt agtttgtagt taaaatgttt gtttaaattt 4200 

ttattgtatg tattatagat atttattagt gatttaaatg atatagttgt aaagaaagtt 4260 

atgattatta tatgtttatt agttattatg ttttaaattt aaaagaaagt taagttaaat 4320 

attttgtatt ttatgttgtt aattatttag taatattttt tttattttta gagattttat 4380 

aaaagtttaa tttgaatttt taaaattgtt tagttatttt aaaattataa ttaaatatag 4440 

atatataatt aaagtatatt gggttaaatt ttagtatgga aaattattga a 4491 

<210> 332 
<211> 4256 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 332 

atatttttta ggtttatgat ttgagagttt attaaataag agatggttat ttttttggtt 60 

tttaaattat tttggaaata aagttatttt tagagaggaa ttttaaaata ttgtttgtag 120 

ttatagtaat tttaaaattt gagtgttgta tggtggaagt agataattta ttttaggata 180 

attgttattt gttatattag tttgaggatg gtggtgttaa agaggagtta tttattttta 240 

ggtatatttt atattaaata taaattgtat aatttgttta aattaaggaa ttatattaaa 300 

ttatattatg gttattaaat tttgttttga gaaagtgaaa ttgatttagt ttttaaagag 360 

ataaagagaa agtataagta aattaaattg tagttataaa aagaaagata aaatgttgta 420 

gtatatttat tgttttgtgt atttaatgaa gttttttgtt ttggttataa aattagtttt 480 

aaaggttttt tttatatttt atagtatgaa aaatttaaaa agtaatttat atgtaaatat 540 

ttaaattatg atagaaattt aaagtaaaaa gaaaatgaat taattgaatt aaaatgtgta 600 

ggatgtttaa atttatttga taatatattt atttgataat atattaatat gaatttagta 660 

ttttaaaatg ttatataaat aaatgttttt atattaaata ttaatgtagt taggatttta 720 

agttaatatt attttttttt ttttatatgt tttttttttg tttttattaa aaattgttaa 780 

aattatttat tttttttttt tttttttgtt tttaaataaa taaggttttt tttaagatat 840 

tgtaggatta taaagttaaa tttttgggtt taagttgttg gtaaaatttt agagatgtta 900 

agttatttat gtattaatta tttttaaatt tttttttaat ttttttataa aataggagta 960 

gggagaggag aaatattttt gtttaaaaat gaggaattga aaatttttat tataaataaa 1020 

ttatattaag taagttaaag atagtaaaag agtaaaaatg ttagtagata tttttaaaat 1080 

ggtaattata tattattttt ggaatgatta tatgaatgtg gtttattatt ttttaagttt 1140 

ttatagtaaa tatatattta tttgttttat ttagttaaaa ataaatataa tatgtagttg 1200 

tttttgaata attttttttt tttttttttt tttttttttt tttttgataa agttttattt 1260 

tgttatttag gttggagtga agtggtttta ttttgttgtt tattataatt ttagtttttt 1320 

gggtttaagt gatttttttg ttttaatttt ttgagtagtt gggattatag gtgtttgtta 1380 

ttatttttgg ttattttttg tatttttagt agaggtgagg ttttatttgt tggttaggtt 1440 

ggttttgaat ttttgatttt aggtgatttt ttttgttttg attttttaaa gtgaagggat 1500 

tataaggtgt gaggtattgt gtttggttgt ttttgaataa ttttgattaa aatttatatt 1560 

tgatatttat tttaatatat attatagatt tttattgata atttttttta gtaagaaaga 1620 

taagttttat ttaggtattt gtgaattgga ggttaagtag ttttagtata ttttatattt 1680 

ttttaagatt ttttttttat tttaaatgtt tgtaaatttt gtatttgata aagagtatat 1740 

ttttatttaa tataaatatg tttttttttt tagatttttt tagtatttga gagatttgta 1800 

tgtgtgtggt tttttatttt ttttttttgg ttttttaagt ttttagggtg ttgttaggag 1860 

gaggtttgtg attataaatt ttttttgaaa ajttttttagg aagttttttt tttttttgga 1920 

gaattgaagt gttatttgat tttaattttt ttgtaaattt tgttttttag agttgtttgt 1980 

tattttttgt tttt^ttgta gattttttat ttatttggat tggt^itttga ttgtaattat 204 { 0 

ttggtgtgtt gggtdgtgtt tttgttttta gtagtgtttg tttttttttt atttgatttt 2100 

gggttgirggt tgtggttagt tagttagttg aaggttttat gttgtttttt gttgttggtt 2160 

ttatgttgtt ttttgttgtt tgttgtttgt tttttttttt tttgtagttg ttgagtgtat 2220 

gtggtttgtt ttattttttg dtgattagtt agtttttttt £t£l#ttttt Jitggtgttgg 2280 

tggaagagtt ttttttgatt ttgtttttta aattttttg^ f agggattgtg gtattttttt 2340 
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aggtaagggg 

tgaatgtggt 

gtagtttttg 

ttttatattg 

tgtgtttgag 

tgtgatataa 

atggtttttt 

ttattgagaa 

gatttttagg 

agagttggat 

ggggtaaggg 

aggtttattt 

ggagttattg 

tttagtgttt 

gtttttttta 

ttataaaggg 

atgatgtatt 

ttagtttttg 

gaaaatgttt 

tgaggtgggt 

ttttgttttt 

agttatttag 

agttgagatt 

aaaaaaaaaa 

ggtggtttat 

ttaggagttt 

aaattagtgg 

agaattgttt 

ttttagtttg 

agaaagatta 

ttataaagaa 

aattttrtttt 



atgttgtgag 

atttttgaag 

ttagagttag 

ttttttggtt 

tgtgtttatt 

tttttttaat 

tttttaatta 

ttgaaattat 

agtgtggggt 

tgtttttttt 

ttttttagat 

aatatggatg 

gaaggatgga 

tagaagtaga 

tattttttat 

ggaagggaat 

aatatatata 

aattatttgt 

tggagtagga 

agattatttg 

attaaaaata 

gaggttgagg 

gtgttattat 

aatgttttgg 

gtttataatt 

gagataagtt 

ggtttggtgg 

gaatttagga 

ggagataaga 

agaagaattt 

aagttaaata 

gtgattatta 



tgagtgtttg 

ttgttttagg 

tgttggtaag 

tttttgtttt 

ttgttaaaaa 

tgttaaattg 

taagaaaaag 

ttgtatgatt 

ttattaagtt 

tttttttttt 

gtgtatattg 

ggggagaatt 

ttttattttt 

gttgtatagt 

attttttttt 

atgagtgttt 

agttttaaga 

tttgaggatt 

tgtggtggtt 

aggttaggag 

taaaaattag 

taggagaatt 

tttttagttt 

atagaattat 

ttagtatttt 

tgattaatat 

tgtatgtttg 

gaaggtggag 

gtgaaatttg 

attttttgaa 

gtattaaaga 

ttttttaagt 



seq_prot. 
gaggaggtgt 
ttgggttttt 
gaaggaggat 
tagttgtgtt 
gaaatttgtt 
aattggggtg 
taaaattatt 
agaaagtgtt 
agaaatttta 
tttttttttt 
tttggtataa 
ttgtttgtag 
aaagttataa 
gatttaaaga 
tttattgaaa 
tttgttttat 
atagtgttat 
ggtttgtaat 
tatatttgta 
tttgaggtta 
ttggttatgg 
gtttgaattt 
gggtgataga 
tattattata 
gggaggttga 
ggtgaaattt 
taattttagt 
gttgtagtga 
gttttaagaa 
aagattatgg 
gtataataag 
ttattaaaaa 



txt 

tattaatttt 

tttgggggta 

tgggtttttt 

tttttgtttg 

tttgttttgt 

tttggtgtta 

tttttttaat 

tttttatttt 

gtttaaagga 

tgtgtgtaaa 

gagtagattt 

gtagatagga 

tttttagatt 

ttagttttaa 

atattttgta 

aggggttgtt 

atattttaag 

tttgttttga 

attttagtat 

gtttggttaa 

tggtgtatgt 

gggaggtaga 

atgagatttt 

taaaaggaaa 

gataggtgga 

tgtttttatt 

tatttggagg 

gttgagattg 

aaaaagaaag 

gtatttttta 

tgtaaggagg 

tatgtattat 



gagtatttag 
ttagttggaa 
tttatttgtt 
ttagtaaagg 
tttttttttt 
tagggaaagt 
tgtgagagtt 
tttaattttt 
ttttttttgg 
atggttgttt 
tgaaaagatg 
aaatggggag 
agaaaaagtg 
atattgtttt 
ttttttgtaa 
gtgagtttaa 
ttaatatttg 
ggtatagaaa 
tttgggaagt 
aatggtgata 
gtgtaatttt 
ggttgtagta 
gatttaaaaa 
gtttggatgt 
ttatttgagg 
aaaaaatata 
ttgatgtagg 
tgttattgta 
aaagaaagaa 
ttatttttat 
taaaagtttt 
gtttta 



<210> 333 
<211> 4256 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 333 

taaaatgtag 
tttttatttt 
gggtggtgga 
tttttttttt 
atggtgtgat 
tattagtttt 
ttttttagta 
aggtgatttg 
ttggattttt 
tgagttggag 
gtaatttttg 
ttatatatgt 
attattttgg 
tttaaagtgt 
tatgttttaa 
tattagttta 
atttataata 
gaaaaatgta 
tagtatttga 
ttttttggtt 
ttattttttt 
tttttagagt 
agttgtttta 
aaggaatttt 

gttgaggggg 



tatatgtttt 
tttgtgtttg 

gggtgtttat 

tttttttttt 

tttggtttat 

tgagtagttg 

gagatagggt 

tttgttttgg 

tttttatgta 

ttttattttg 

ttttttgggt 

gtgttattat 

ttaagttggt 

tgggattata 

aataagattg 

ggatgtgtgg 

atttttataa 

aggtattttt 

agttggtttt 

taggaattat 

atttgtttat 

ttgtttttat 

tatgtaggag 

ttgaattagg 

tag ggggata 



tgataagttt 
ttatattttt 
aattttttta 
tgagattaag 
tgtaattttt 
ggattatagg 
tttgttatgt 
ttttttagaa 
atagtgataa 
ttatttaggt 
ttaagtgatt 
ggttagttaa 
tttgaatttt 
ggtgtgagtt 
taagttagtt 
tattgttttt 
agtagggggt 
agtaggaaag 
tggattattg 
gattttgaga 
aggtagaatt 
attaggtaat 
gggaagggga 
gtttttgatt 
ttttttagtt 



aaaaagtagt 
tagtgttgtt 
gggagtaagt 
ttttgttttt 
gttttttttt 
tatgtgttat 
tggttaggtt 
tgttgggatt 
ttttatttaa 
tggagggtgg 
tttttgtttt 
tttttgtatt 
tgattttagg 
attgtgtttt 
tttaaagtgg 
aaggtttata 
atttatattt 
agaaatgtga 
tgtaattttg 
atggagtttg 
tttttttgtt 
gtatatgttt 
ggggaaggag 
tagtgaattt 
gtataggtfla 



agttatagga 
taattttttt 
tttttttggt 
gttttttagg 
gggtttaagt 
taagttttgt 
tgttttgaat 
atagatgtga 
agtatttttt 
tggtgtgatt 
agttttttga 
tttagtagag 
tgatttgttt 
gttttaaagt 
ataatttaag 
tgtattaata 
tttttttttt 
gaagtgtgaa 
tttttagaat 
tttttttaat 
tgtattaaat 
gagaaatttt 
agagtagttt 
tgtgtttttg 
tLtt^gatttt 
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aaaattagaa 

ttgtaagtga 

tttttttttt 

ttggagtgta 

gattttttta 

taattttgta 

ttttggtttt 

gttattgtat 

tttttttttt 

ttggtttatt 

gtagttggaa 

atggggtgtt 

gttttggttt 

attttttttt 

agttaatagg 

tattatttaa 

ttataattat 

ggagatagga 

attgagtatt 

gatttttttt 

aaattttatt 

tgttttagat 

gattttttaa 

aaaattaagg 

tggtggggtt- 



2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4256 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 

1380 
1440 

15G£)- 



seq_prot.txt 

tttataatta ggaaagaata gttttgtttt tttttatgat taaaagaaga agttatattt 1560 

tttttatgat attaaatatt ttgatttaat ttggtagtta ggaaggttgt attgtggagg 1620 

aaggaaatgg ggtgggggtg gatttttttt taatagagtg aatgtattta aatatgtttt 1680 

tgttggtagg tgggggagtg tggttgggag tagggaggtt ggagggtggt gtggggggta 1740 

ggtggggagg agtttagttt tttttttttg ttaatgttgg ttttggtgag ggttgttttt 1800 

ggttggtgtt tttgggggag atttaatttg gggtgatttt aggggtgtta tatttgttaa 1860 

gtgtttggag ttaatagtat tttttttgag tatttgttta tggtgttttt ttgtttggaa 1920 

agatattgtg gtttttttag aggatttgag ggatagggtt ggagggggtt tttttgttag 1980 

tattggagga agaaagagga ggggttggtt ggttattaga gggtggggtg gattgtgtgt 2040 

gtttggtggt tgtggagagg gggagagtag gtagtgggtg gtggggagta gtatggagtt 2100 

ggtggtgggg agtagtatgg agtttttggt tgattggttg gttatggttg tggtttgggg 2160 

ttgggtagag gaggtgtggg tgttgttgga ggtgggggtg ttgtttaatg tattgaatag 2220 

ttatggttgg aggttgattt aggtgggtag agggtttgta gtgggagtag gggatggtgg 2280 

gtgattttgg aggatgaagt ttgtagggga attggaatta ggtagtgttt tgattttttg 2340 

gaaaaagggg aggttttttg gggagttttt agaaggggtt tgtaattata gatttttttt 2400 

tggtgatgtt ttgggggttt gggaagttaa ggaagaggaa tgaggagtta tgtgtgtata 2460 

gattttttga atgttgagaa gatttgaagg ggggaatata tttgtattag atggaagtat 2520 

gttttttatt agatataaaa tttatgaatg tttgggataa aaagggagtt ttaaagaaat 2580 

gtaagatgtg ttgggattat ttagttttta atttatagat atttggatgg agtttatttt 2640 

ttttattagg agggattatt agtggaaatt tgtggtgtat gttggaataa atattgaata 2700 

taaattttga ttgaaattat ttagaagtgg ttgggtgtgg tgttttatgt tttgtaattt 2760 

ttttattttg ggagattaag gtggggggaa ttatttgagg ttgggagttt gagattagtt 2820 

tggttaatag gtgaaatttt gtttttatta aaaatataaa aagtagttgg gggtggtggt 2880 

aggtgtttgt aattttagtt atttgggagg ttgaggtagg agaattgttt gaatttggga 2940 

ggttgaggtt gtagtgaata gtgagatgga gttattttat tttagtttgg gtgatagagt 3000 

gagattttgt tgaaagaaag aaagagagaa agagagagag aaaaattatt tagaagtaat 3060 

tatatattgt gtttattttt aattgagtag ggtaaataaa tatatgtttg ttgtaggaat 3120 

ttaggaaata atgagttata tttatgtgat tattttagag gtaatatgta gttattattt 3180 

tgggaatatt tgttaatatt tttgtttttt tattattttt agtttatttg atatagttta 3240 

tttgtgataa gagtttttaa ttttttattt ttgaatagag gtgttttttt tttttttatt 3300 

tttgttttgt gagggagtta ggggaggatt taaaagtaat taatatatgg gtaatttagt 3360 

atttttaaaa ttttgttaat agtttgaatt tgggagtttg gttttgtagt tttataatat 3420 

tttagaagag attttatttg tttaaaaata aaaaggaaaa agaaaagtgg atagttttga 3480 

taatttttaa tggagatggg agaagaatat gtagaaaagg ggaaatgatg ttggtttaga 3540 

attttaatta tattggtgtt taatatagga atatttattt atataatatt ttaaagtatt 3600 

aaatttatat tagtatatta ttaaatggat atattattaa atgggtttaa gtattttata 3660 

tattttaatt taattgattt attttttttt tgttttggat ttttattatg atttaaatat 3720 

ttatatatgg gttatttttt agatttttta tattatgaaa tataagaaaa atttttaagg 3780 

ttagttttat gattaagatg aaggatttta ttgaatatat aaaataataa atatattgta 3840 

atattttgtt tttttttttg tagttgtaat ttggtttgtt tatatttttt ttttgttttt 3900 

ttgaaaattg agttagtttt atttttttag gataggattt aataattata atataattta 3960 

gtataatttt ttgatttagg taaattatgt aatttgtgtt tagtatgaaa tgtatttaaa 4020 

aataagtaat ttttttttaa tattattatt tttaaattaa tataataaat aatagttatt 4080 

ttaaaataaa ttgtttattt ttattatgta gtatttaaat tttaaggttg ttatgattgt 4140 

agatagtatt ttaaaatttt tttttggaaa tggttttgtt tttaagatga tttaggaatt 4200 

aaagaggtga ttattttttg tttaatgaat ttttaaatta taaatttggg aagtgt 4256 

<210> 334 
<211> 4414 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 334 

aatgtttgga gtatatattt taatgaatat ttattttatt ttattttttt ttatttttga 60 

attaagtaat tttgaattta aagttgttat gattagtatt gaaaagatta ttggattatt 120 

aattgtgtga ttttgggata gtaatttttt gtattttagt ttgtttatat gttatatatg 180 

a^ggttgaag tttgattttg ttttgtgatt attattttaa atatttgatg aaattaaatt 240 

ttagtgtttg gaatggtagt ataataaatt tattaagaat aaataattta ttgtaaaaat 300 

atattgattt ttaaatgatg taattgatag ttatattatt gtagagggtt gataaataat 360 

aaaagaaatg aaagatgtat atggtgagaa ttgaaattat tttgataagt tttttatttg 420 

tftattattt aaaattaatg attatgttga atgtttataa attataaaat ataaaagaaa 480 

ttttataaat gtgtatgtat aggagtttaa gttattaaaa gttttaaagt ataagtttaa 540 

aftaaattaa ^taaagaagt tgagaggaaa aattggtttt tatttttaat tatt^ttgtt 600 

ttgaggtttt atgtttaata taattttitta agta.gaggtt tiagagagaa gagttgtgag 660 
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gatattttta tatttgtgta gaaggaaaag tttgttattt attttagtat ttttagtgtt 720 

atattgatgt gtattttgga tttattttgt ttttattgta taaatttata tttgatttta 780 

aagaaaagga aaatttaaag tttttttttt ttaaggggat agaaattttt tgtgttaatt 840 

gtttgatttt tttttttgta aggttttatt ggaaattttt tgtaatataa tgtaggggat 900 

ttttttatgt gttgatgttg tttatatagt ggggtgggtt tgattgaaga aaaaaaattg 960 

tatatatgta tgaaagatta tggttttatt tttggaaagt atgaaaqqta attaatattt 1020 

ttaagaagtt tttgttattt aggaaaatta ttaaatattt tatttagaga tatttggaaa 1080 

gattgaagga aaggaagaat gaagaaagta gaatttagat ttatgtgggg agagatttgt 1140 

ggtagaggaa aagtattttt tttgaatttg ataagggatt tgtttggggg aattttttgt 1200 

ttagtttttt attattaggg ttttttgaag ttgggttttt tattgggtag ttttttggga 1260 

gtgtagtggg gaatttttat attttttttt taggtttttg aaggattttg tttttttagt 1320 

gtttttttta ggttggtagg agttttgagt ttgatatttt tttttgatgg gataggtaag 1380 

ttttgtgggt gtgtaaatat gttgtaatta agttttttgt tgattttata gttttgtgtg 1440 

tttttgagaa gaagtgattg tatttaattg tttattgttg gtttgttttt taagagtttg 1500 

ggggtttttt ttttttaatt tagaattagt tgtatggggg gtggggaaat gggggtgggg 1560 

aaggagtggg agggtagtgg tttttgtgag tagagtgatg ttattgagtg agtttttgaa 1620 

tggggagtgt tgttgttttt aagttgattg gtattttttg ttaggaagaa atgttaagag 1680 

gtgggagtgt ttggggaggg aggtaggtgg tttttattgt aggtgtgggg agttgttttt 1740 

ttgttttttt gtttgttttt taagtttgga tttttaggag tggttgaagt tgtggagtgt 1800 

ttttggagtt tgtgaatgaa tttttttttt tttttttttt tttttttttg ttgagttttt 1860 

tttttggttt tgatggtata gtgatataat gatgatgggt gttataattt gtatttgaat 1920 

ttgtaggtga gttgttttga gtttttttgg ggaagaattt taggtgtgtg gatgtaatag 1980 

ttgagaatat taggtgttgt ggataggagt tgggattaag atttttggtt agttttgtat 2040 

ttttttgtat tttttagtat tgttttgtat tttttgtatt tttttttggg ttattatgtt 2100 

ttttatgtga tttgtttggg taatgttgaa tttagttgtg tagtgttgta gtgaattttt 2160 

tttttaaatt gtaataagtt gttttttaag gtaattatgt tttttttgtt ttttttttaa 2220 

aaaataaaaa taaaaaattt atagaaaaaa atttgtgagt ttagaaaaaa gaagtaattg 2280 

gtagaaggtt ttaattaagg taaagagttg taaggtgaag ttaagaaaat gtaggtattt 2340 

aaaaaatgta ggtaattttt ataagggttt ttggggagag gtatatagag ggattttggt 2400 

gttgaaaaag atttagataa aagaaattta gggtggggtg gggggtaaaa tgattaatgg 2460 

aattggggga agggagggaa taaattgtaa agaaattata gaaaagtggt gggtttttga 2520 

gttggagaga agagagggat ttttggtatt ttgatttttt tgttgttgtt gtttttaata 2580 

tgtttgaggt aaaagtttga atggggatta ttaagatttg ttatagataa gtttttgaag 2640 

ttgttttggt gtaggttatt tggtttttta gtttttggtg tgtggttagt gtttggtgtt 2700 

tttggaaagt tatttttggg tagtttttga tagtgtgatt tggtgtttaa gtagtttggg 2760 

attttgtgtg gatttgattt ttttagattg taggtagttt gggaggaggt ttggttgggg 2820 

gaggtgtagg atttttgttg tgtttttttg atgatttggg gattgttatg gttttttttt 2880 

ggtgtttttg ggtttttttg ttttgtatgt ggtgtgaagg ggttagtagg gaaggagtag 2940 

aggatggggg gtggggttgt tggagttttg tggaggtttg ggaggttttt gggtgggaaa 3000 

agtttgtttt gaattggtag ggatgtgtaa taattttttt tattttgaag agtgaaatag 3060 

ggtttgttgt ttttatttta ataagtaaat tggtatttag agtgtattgt agataaaggt 3120 

ggtttgggga tttgaattag gggtttttgg gttagttttt ttgtttagat tataggagtt 3180 

ttttgttttt ttatatattt tttatttttt tttagttttt tgttttagtt gagtaatttt 3240 

attggtaggt ttagtggtag tggtgtgtgt gtgtgttggg agttttgggg gatgtttttg 3300 

gttggagtgt tttattgttt tgtagaggtg tgggtgttgt agggtgatat ttttggtgtg 3360 

tgtagattta gggtatttgg gttgttttgg tttgtggttt gtgttattgg gtgtggagtg 3420 

gtttgggtgt taggtagagg agagtggggt agaaaaatag tgttatagtt taaattattt 3480 

gttttttaat ttaattgtgt tttggtaagt tttgtatgtt tttgggagat gttgtgggaa 3540 

gggggagaaa tttatatggt gtttggagag ttgttttgtt gtgtgtatat ttgtggtaga 3600 

gtttttttgg gttgtgtttt ttatttttgt gttttttttt gtttttattt tttttgtttt 3660 

tttttttttt ttttagtttt tgtttttttt ttttagtttt tgtgtttttt tttttatagt 3720 

tgataaatga atggttagtg tgaaattttt gttttttttg tgttttaagg tagtagggag 3780 

ggaggagtga gggaggtggt gtgttttttt ggatttgttt ttagtttagt ttataagatt 3840 

tgtagaattt agatgtttag gaattgggag ttttgtggtg ggtgtgggtg ttttttgatg 3900 

gagaagtttt gtataggtgg agaaaaataa gttttttaga ttaagtgagt attttttata 3960 

attttgtgtt taaggatgga aggttttagt ttttttttaa gttatttatt ttgtatttta 4020 

taatttgtgg tgtatattta ggagtttgga ggattttgaa aaaaggtttt ggtttgtgta 4080 

aagtgtagag atgttttttt gtgagttggt tggaatgtat gtggtgtttt gggtttagtt 4140 

tgttgtttag tggatgaatg tatatggtta gggtggagtt aggttttgga ggtttagatg 4200 

tgttattttg ggtgttttta agaaagataa tatatatatt gtgtttttaa aatttatgat 4260 

tatttgaatt taatgttagg gttttgtttt aggatattgt aaaagaaggg tttttaattt 4320 

aaaatttaaa tttttttaat tttagggtgg gtgttggatg ggaaagtgga gagaaggtgg 4380 

gtagtgggag gaaaaaagaa agggaaggaa ggga 4414 

<210> 335 . : V ' l " . ' 1 ' [ . ' ' 

<211> .4414 ; : •:/. ; ... - 1 . ' 

<212> DNA : t - 1 ' , 

<213>, Arti ficia.1 .Sequence 1 1 ; ' ' f " 
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<220> 

<223> chemically treated genomic DNA CHomo sapiens) 
<400> 335 

tttttttttt tttttttttt ttttttttta ttgtttgttt tttttttatt tttttgtttg 60 

atgtttgttt tgaagttgag gaagtttaag ttttggatta ggagtttttt ttttgtaata 120 

ttttggggtg gggttttagt gttgggttta agtggttatg ggttttgaga gtatggtatg 180 

tgtattattt tttttaaaga tgtttagggt ggtgtgttta gatttttgaa atttggtttt 240 

attttggtta tgtgtgtttg tttattaagt ggtgaattgg atttagggtg ttgtgtgtgt 300 

tttagttgat ttgtgagaag gtatttttgt attttgtata gattaagatt tttttttagg 360 

gttttttaaa tttttagatg tatgttgtag attgtggagt gtaaaatgaa tgatttaagg 420 

gaaaattagg gttttttatt tttgaatata aggttgtgaa agatgtttgt ttggtttggg 480 

gggtttgttt ttttttgttt gtgtgggatt tttttattag ggggtgttta tatttgttgt 540 

aaaattttta atttttagat atttaggttt tgtaagtttt gtgagttgaa ttgggggtag 600 

atttgaagga gtgtattgtt ttttttgttt tttttttttt gttgttttgg agtatgggaa 660 

aggtagggat tttatattag ttatttattt gttagttgtg ggggagggga tgtagaagtt 720 

agagagagag aatagaggtt gggagagggg aagggaggta gagaagatgg agatagaggg 780 

agatataaag atgaggaatg tgatttagag agattttgtt gtgggtgtgt gtgtggtagg 840 

ataatttttt aggtgttgtg tagatttttt tttttttttg tagtattttt tagggatatg 900 

taggatttgt tgagatgtag ttggattgga gagtaaatag tttaggttat gatattattt 960 

ttttattttg tttttttttg tttaatattt agattgtttt atgtttagtg atatagattg 1020 

tgggttagaa taatttggat gttttaggtt tgtatgtatt agggatgttg ttttatagtg 1080 

tttatgtttt tgtaggatgg tggggtgttt tagttgagga tgttttttgg ggtttttggt 1140 

atatgtgtgt gttgttgttg ttagatttgt tagtaaagtt gtttaattag agtggagaat 1200 

taggagaggg tggggaatat gtaaggaaat ggaggatttt tgtagtttgg gtgagaggat 1260 

tgatttagag gtttttgatt tgggtttttg agttattttt gtttgtagtg tgttttgggt 1320 

gttggtttgt ttgttgagat gggagtgata gattttattt tattttttag ggtgagaaaa 1380 

attgttgtat atttttgttg atttagaata ggtttttttt gtttaggggt tttttagatt 1440 

tttgtggggt tttaataatt ttatttttta ttttttgttt tttttttgtt ggtttttttg 1500 

tattgtgtgt aggataaaag aatttagggg tgttgggaga gaattgtgat agtttttaag 1560 

ttgttaaaga gatatggtgg ggattttgta ttttttttgg ttggattttt ttttagattg 1620 

tttgtggttt gaaggggtta ggtttgtgta aggttttagg ttgtttgggt gttgggttgt 1680 

attgttagga gttgtttggg agtggttttt taggaatatt aggtattgat tatatattag 1740 

gggttgggaa attaggtggt ttgtattaag gtggttttgg ggatttgttt gtggtaagtt 1800 

ttggtagttt ttatttaaat ttttgttttg agtgtgttaa gaataataat aataaaaaaa 1860 

ttaaagtgtt aaaggttttt tttttttttt tagtttaaga atttattatt tttttatgat 1920 

ttttttataa tttatttttt tttttttttt aattttgtta gttattttat tttttatttt 1980 

attttgggtt ttttttgttt gaattttttt taatattaag gtttttttgt atgttttttt 2040 

ttaaaagttt ttatgaaagt tatttgtatt ttttaagtgt ttatattttt ttaattttgt 2100 

tttatagttt tttgttttaa ttaaagtttt ttattaattg tttttttttt ttaagtttgt 2160 

gggttttttt ttataagttt tttgtttttg ttttttaagg ggggaataaa agaaatgtga 2220 

ttattttgga aggtggttta ttgtagtttg gggggaaaat ttattgtagt gttgtgtgat 2280 

tgggtttggt gttgtttagg tgggttatat aggaagtgtg gtggtttggg gaaggatgtg 2340 

gagggtgtgg gatggtgttg gaagatgtgg gaggatgtgg ggttggttga agattttggt 2400 

tttagttttt gtttataata tttaatgttt ttggttgttg tgtttgtatg tttggagttt 2460 

ttttttagaa aggtttgggg tagtttgttt gtaagtttaa atgtgggttg tgatatttat 2520 

tattattata ttattgtatt gttagagttg aggaggagat ttagtgagaa gaaggaggag 2580 

ggagaggagg agggtttatt tataggtttt aaaagtgttt tgtagtttta gttattttta 2640 

agagtttagg tttggaaagt aggtggaggg gtggaaaggt agttttttgt gtttgtggta 2700 

gggattgttt gttttttttt ttaggtattt ttattttttg gtgttttttt ttgataagaa 2760 

gtattaattg gtttggggat agtagtgttt tttatttagg gatttattta gtaatattgt 2820 

tttgtttgtg gaaattattg tttttttatt ttttttttat ttttattttt ttgttttttg 2880 

tgtagttagt tttgggttag gggaaaggag tttttaggtt tttagggggt aggttagtaa 2940 

tagataattg agtatgatta tttttttttg ggagtatata aaattgtaaa attagtaaag 3000 

aatttggtta tagtgtgttt atgtgtttat agagtttgtt tgttttatta aagggaagtg 3060 

ttaggtttaa ggtttttgtt aatttgaaag agatattgag aaaatgagat tttttgggga 3120 

tttagaggga aagtgtaaga attttttatt gtatttttag ggaattgttt aatggggagt 3180 

ttggttttaa aagattttgg taataaaagg ttggatagga aattttttta ggtaaatttt 3240 

ttgttggatt taaagagaat attttttttt tgttataaat ttttttttat ataagtttag 3300 

attttgtttt ttttgttttt tttttttttt agttttttta agtatttttg agtagaatat 3360 

ttgataattt ttttgagtaa tagggatttt ttggaagtat taattatttt ttatgttttt 3420 

tggaa^itaag attataattt tttatgtgta tatgtgattt ttttttttta gttaggttta 3480 

ttttattgtg taaatagtat taatatatgg aagagttttt tgtattgtgt tataaaagat 3540 

ttttaatagg attttataga gaaaagggtt aaatagttga tataaaggat ttttgttttt 3600 

ttagaaaaga gggattttgg attttttttt tttttgaagt taagtatgag tttatataat 3660 

aggaataaaa t^aaatttaag gtgtatatj:a gtataatatt agggatatta gaatggatgg 3720 
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taaatttttt tttttatata aatatgaaag tatttttata attttttttt ttgaagtttt 3780 

tatttagaaa attatattaa atataggatt ttaaaatagt agtgattaaa gatgaaagtt 3840 

aatttttttt tttaattttt ttgattagtt tggtttaaat ttatgtttta aaatttttag 3900 

taatttagat ttttgtatat gtgtatttat aagatttttt ttatattttg taatttgtag 3960 

gtatttagta tggttattga ttttaagtga taaataggta gaagatttgt taggataatt 4020 

ttagttttta ttatgtgtat tttttatttt ttttgttatt tattagtttt ttgtagtaat 4080 

ataattgtta gttatattat ttggaaatta atgtgttttt gtagtgaatt atttattttt 4140 

agtaaattta ttgtattatt attttaaata ttgaaatttg attttattag atgtttagaa 4200 

tgatagttat agagtagaat tagattttaa tttttatgta taatatgtaa ataaattaag 4260 

gtgtagaaag ttattgtttt aaagttatat aattaatagt ttagtggttt ttttagtgtt 4320 

aattatagta attttagatt taagattgtt tgatttagga atggagagaa ataaaataaa 4380 

atgaatgttt attgaaatat atattttaga tatt 4414 

<210> 336 
<211> 4001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 336 

ttgatatgga atagagttaa gatttttttt tttttttttg atatggagtt ttattttgtt 60 

ttttaggttg gagtgtagag gtgtaatttt agtttattgt aagttttgtt ttttaggttt 120 

atgttatttt tttgttttag ttttttgagt agttgggatt ataggtattt gttattatat 180 

ttggttaatt ttttgtattt ttagtagaga tagggtttta ttgtgttagt taggatgqtt 240 

ttgatttttt gattttgtga tttgtttgtt ttggtttttt aaagtgttgg aattataggt 300 

gtgagttatt gtgattggtt agatttaaga tttgaattta ggtttttttg gttttagagg 360 

tttttgtttt ttaatttttt aggatggtat agtaatttgt tttataagag gtgtttgttt 420 

taagtgtgtt tagtatatgg aagtaagttt agaaatgtaa gtgtatattt gtaaagaggt 480 

gtgggagatg ggggggaggg aagagagaaa gagatgttgg tgttttttat tttttagttt 540 

ttgataggtg tttttgattt ttttttgatt agtatagttg tatttttggt tggggtattt 600 

taattagaat tgttaaattt agtatataaa aataaggagg tttagttaaa tttgaatttt 660 

agataaataa tgaataattt gttagtataa atatgtttta tgtaatattt tgttgaaatt 720 

aaaaaaaaaa aaaaaagttt tttttttatt tttattttta ttattaggtt taaggaatag 780 

ggttaggggt tttaaataga atgtggttga gaagtggaat taagtaggtt aatagaaggt 840 

aaggggtaaa gaagaaattt tgaatgtatt gggtgttggg tgttttttta aataagtaaq 900 

aagggtgtat tttgaagaat tgagatagaa gtttttttgg gttgggtgta gttgtttgtg 960 

gttgtaattt tagtattttg ggaggttgag gtgggaggat tatttgaggt tgggagttta 1020 

agattagttt tattaatgtg gagaaatttt gtttttatta aaaatataaa aaattagttg 1080 

gttatggtgg tatatgtttg taattttagt tgtttgggag gttgaggtag gagaattatt 1140 

tgaattaggg aggtagaggt tgtggtgagt agagattgtg ttattgtttt ttagtttggg 1200 

taataagagt aaaagtttgt ttaaaaaaaa aaaaaagttt ttttgatgtg attgtttttt 1260 

tttaaatttg tagatttttt taagattatg ttttttagat attttaaaga ttttagaaga 1320 

tatgttttgg gggttttgga agttataagg taaatataat atattttttt ttttgattat 1380 

taattttatt agaggatgtg gtgggaaaat tattatttga tattaaaata aataggtttg 1440 

ggatggagta ggatgtaagt tttttaggaa agtttaagat aaaatttgag atttaaaagg 1500 

gtgttaagag tggtagttta gggaatttat tttggatttt gggggagggg gtagagttat 1560 

tagtttttgt atttagggat tttttgagga aaagtgtgag aatggttgta ggtaatttag 1620 

gtgttttggt gttaggaggg atgtatttag gtttgtgtga agagagggag aaagtgaagt 1680 

tgggagttgt tatttttaga tttgttggaa tgtagttgga gggggtgagt tgggagtgtg 1740 

tttgttttta attataggag aaggaggagg tggaggagga gggttgtttg aggaagtata 1800 

agaatgaagt tgtgaagttg agattttttt ttattgggat tggagaaatt aggggagttt 1860 

tttgggtagt tgtgtgtttt tttttatggg gttttttatt gtgttgtgtg tttggttttt 1920 

attttttgta gtattttgtg ttttgtgttt ttttagttgg gtttagttgg agttatgggg 1980 

ttggagttgt agtgagtatt atggagttgg tggttttgtg ttgttggggg tttttttttg 2040 

tttttttgtt ttttggagtt gtgagtattt aaggtgggtt tggtgtgggg aggggatgga 2100 

gtagtggtgg gattttgttt tgtggatgtt ttgttgaggt tttgtggttg gtggggttag 2160 

aggggtttgg atgagttttt ttattttgaa gttgtggata gttgagatgt ttagggtagt 2220 

tgggttttgg ggtttttggg tgggaggggg tagttatatg gtagtggttt gagatggttt 2280 

atttaagaga ttggtgtttt ttaggttttg aggggttttg ggaatttgtt aaagaagttt 2340 

tttgaaattg tttagaaagt ttttttgtaa agggtgtatt gtgtagagtg tgtgtgtgtq 2400 

tttttttatt ttttgagttt ttttaagttt ttttaaagtt tttttagttg gtagtttttg 2460 

tttttggatt ggtttgggtt ggattttttg ggggggtttt ttgttttgtt ttttttttag 2520 

tttttttttg ttttttttta gatgattttg gtttgjgttgt ttttgttttt ggtggggttg 2580 

ggtgtgtgtg tgtgtggtgg agtggagggt ggtatagtaa tttgttttaa ttagagttgg 2640 

ggaggaaagg gtggtt-tgga gggtggtttt- ttgttggggt ttgggttggg wtgggggag^ -2700 
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atgtttgttt tgaatagatt tttggggtta gtttagggat tgtgttttgt gatttttgga 2760 

gtgtgtggat tatggagggg tgggggtggg tttrttgggg tgtaaagtgg gagagttttt 2820 

agagaaggaa gttaagaaat aaggttagat gggagtttag ggagggttgt gttgttttgt 2880 

tgtttttttt ttggtgttgt gtgtggggaa gggtgagtgg gggtagtgtg tattttgatt 2940 

tatttgttta tttgtgtgta ttaattataa aagttaatat atagtttggg ttaggtatat 3000 

tttgttagga attgtttgtg gtgttttgta tgtatttttt tttaatttta gaatattttt 3060 

atagtggaag ttttgttagt attttggatt gagtagtagt ttagaggttg agtagtagtt 3120 

agtaagtggt ggggttaaga tgggatttta ggtagtgtga tttttaatta tgtatttgaa 3180 

attgttatat ggatgagtgt atttggagta atgagggata ttgttttttg agttattggg 3240 

ttgtagggga gataaaatga aagtgttttg ggagttgtgg gtggttttta taggttagag 3300 

ggtttgggga gggagtgggt gttattgtgg ttgtgtgttg tttgaggggt tttttgtgag 3360 

tgagtgtatg gttgtgttat ttttgtaggt ttatgttagg gtgtttttta gttgtgtggt 3420 

trrcgtattt gtgtgtttgg gttttgtgtt gttaaatagt agtttttttg ttgatttggg 3480 

gatataggtt gaattttgtt ttttgtagga atttttttaa ggtgttgggt tagatttgtt 3540 

ataaatagag ggaggtagtt ttttatggtt atgttttttt gttgaggaag aaggtttttt 3600 

ttttttaggg agtatatttt tgtttttttt gttttttaga taagtatttt tattttttat 3660 

tttttgatga gaagggtgag gttatattga gttgttaggt tgagttgttg ttttttttta 3720 

ttttgggttg ggagttgatt agggaatggt agttgttgta gagttggatt tgagggttgg 3780 

gttttttgga tggggttttt ttatgttttt attttttaat ttgtattatt gattgtgttg 3840 

tgtaggagtt agttaaaaag ttattgtata gtttgggtaa taaggtaaaa ttttgtataa 3900 

aaaatataaa aattagttgg atgtgattat atgtgtttgt agttttagtt attttggagg 3960 

ttgaggtagg aggattattt gagtttagga agttgaggtt t 4001 

<210> 337 
<211> 4001 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 337 

aagttttaat tttttgggtt taggtgattt ttttgtttta gtttttggag tagttgggat 60 

tataggtatg tgtaattata tttaattaat ttttgtattt tttgtataga gttttgtttt 120 

gttgtttagg ttgtgtaatg attttttaat taatttttgt ataatataat taatagtgta 180 

ggttgagggg tgaggatatg aggaggtttt atttagagaa tttagttttt aaatttagtt 240 

ttgtagtaat tgttattttt tgattaattt ttagtttaag gtgaggaagg gtagtagttt 300 

agtttgatag tttagtatgg ttttattttt tttattagaa gatgagaggt gaagatgttt 360 

gtttgggaaa tagggagggt aaggatgtat tttttggaag agaggaattt ttttttttag 420 

taagaaggtg tggttataga aggttatttt tttttgttta tggtagattt ggtttagtat 480 

tttaagggag tttttgtaga ggatagagtt tagtttgtgt ttttaagtta gtagagagat 540 

tgttgtttgg taatataaag tttagatata taaatataaa gattatataa ttgaggaata 600 

ttttggtgta gatttgtaga gataatatgg ttatgtattt atttatagag ggttttttgg 660 

gtaatatata gttatgatga tatttatttt ttttttagat tttttgattt atggagatta 720 

tttatgattt ttagaatatt tttattttgt tttttttgta gtttagtgat ttagggagta 780 

gtgtttttta ttgttttagg tgtatttatt tatatagtga ttttgaatgt atggttgggg 840 

gttgtattgt ttggggtttt attttgattt tattatttat tagttgttat ttaatttttg 900 

gattgttatt tagtttagaa tgttggtaga atttttatta taggggtgtt ttaggattaa 960 

aggagaatgt atgtaaaata ttataaatag tttttggtag agtatatttg gtttaggtta 1020 

tatgttagtt tttatgatta atgtatatag gtggatagat gggttaggat atatattgtt 1080 

tttatttatt tttttttatg tatagtatta aggaaaaggt agtagaataa tgtagttttt 1140 

tttaggtttt tatttggttt tattttttag tttttttttt tgggaatttt tttattttat 1200 

attttaagaa atttattttt atttttttat ggtttatgtg ttttaaaagt tatagagtat 1260 

agtttttaag ttggttttaa gaatttgttt aaagtaaatg ttttttttgt ttttaattta 1320 

gattttagta agaggttatt ttttgggtta tttttttttt tttggttttg gttgggatag 1380 

gttgttatgt tattttttat tttattatat atatatatat ttgattttgt tagaagtagg 1440 

agtaattaaa ttaaaattgt ttgaagggga gtggggaggg gttggaggag gggtagggta 1500 

gaggattttt ttaaggaatt tagtttaggt tagtttggag gtggaggttg ttaattggaa 1560 

aggttttgag aaagtttgag ggggtttaag aaggggggaa atgtgtgtgt gtgttttatg 1620 

taatatattt tttgtgggaa aattttttga ataattttag agaatttttt tgataagttt 1680 

ttggagtttt ttggagtttg gaaagtgtta gttttttgga tgggttattt tgagttgttg 1740 

ttgtgtaatt gttttttttt gtttgagggt tttagggttt ggttgttttg agtgttttga 1800 

ttgtttataa ttttgggata ggagagtttg tttgggtttt tttggttttg ttggttgtgg 1860 

gattttggtg gggtatttat agggtagggt tttgttgttg ttttgttttt tttttatatt 1920 

agatttattt tgggtgtttg tggttttggg gggtaagagg gtgaggagga gtttttagtg 1980 

gtataaggtt gttagtttta tggtgtttat tgtggttttg gttttatggt tttggttgga 2040 

tttggttggg agggtgtggg gtgtggggtg ttgtgagggg tgggggttgg gtgtgtggtg 2100 
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tagtaaaggg 

ggttttaatg 

tttttttttt 

ttttaattgt 

tttgtgtagg 

ttttatattt 

tggagtttgg 

tattttaaat 

attaaataat 

tgttgtgttt 

tatttgaaaa 

aggatttttt 

gtgtgatttt 

ttttttgagt 

ttagtaaaga 

gatttttttg 

tttaaaaaga 

atttagtatt 

aattttattt 

gtaggggtgg 

atgggatata 

gtttttttat 

gtagttatat 

attagtattt 

tttgtatttt 

gataggttgt 

ttgggtttaa 

tgggaggttg 

gtgaaatttt 

tagttttagt 

tgtagtgagt 

ttaaaaaaaa 



ttttgtggga 
gaggggaatt 
attttttttt 
attttaataa 
tttgggtgtg 
ttttttggag 
gataaatttt 
ttttttgggg 
ggttttttta 
attttgtggt 
gtatgatttt 
ttttttttta 
tgtttattat 
agttgggatt 
tagggttttt 
ttttagtttt 
tttttatttt 
taatgtattt 
tttaattata 
ggatggaagg 
tttatattaa 
ttttatgtgt 
tggttaagaa 
tttttttttt 
taaatttgtt 
tatgttattt 
attttgaatt 
aggtaggtag 
gtttttgtta 
tatttaggag 
tgagattgtg 
aaaaaaaaat 



aggggtgtgt 
ttagttttat 
ttttttgtga 
gtttgggagt 
ttttttttag 
aatttttaaa 
ttaggttgtt 
agtttgtatt 
ttatattttt 
ttttaggatt 
aagagggttt 
aatgaatttt 
aatttttgtt 
ataggtatgt 
ttatgttggt 
ttaaagtgtt 
aattttttaa 
aaggtttttt 
ttttatttgg 
aagatttttt 
taaattattt 
taaatttggt 
gggattaaag 
tttttttttt 
tttatgtgtt 
tagggagttg 
tggttagttg 
attatgaggt 
aaaatataaa 
gttgaggtag 
tttttgtatt 
tttgatttta 



seq_prot . 
ggttgtttgg 
aattttattt 
ttgggagtaa 
ggtaattttt 
tgttgggatg 
tgtagaggtt 
atttttaata 
ttattttatt 
tagtaaaatt 
tttggggtat 
ataaatttgg 
tgtttttgtt 
tttttggttt 
gttattatga 
gaggttggtt 
ggaattataa 
aatgtatttt 
ttttgttttt 
agtttttgat 
tttttttttt 
attgtttatt 
agttttagtt 
gtatttatta 
ttatttttta 
gagtatattt 
agaaataggg 
tggtggttta 
taggagattg 
aaattagtta 
gagaatggtg 
ttagtttggg 
ttttatatta 



txt 

ggggtttttt 

ttatattttt 

gtgtgttttt 

agttttattt 
tttgggttgt 
ggtgattttg 
tttttttaag 
ttaagtttat 
gatagttaag 
attttttgga 
gaggagatag 
gtttaggttg 
aagtgatttt 
ttagttaatt 
ttgaattttt 
ttatgagtaa 
ttttgtttat 
tgttttttat 
tttatttttt 
taattttaat 
tgaaatttaa 
ggaatgtttt 
gggattggag 
tattttttta 
aaagtaggta 
gtttttggga 
tatttgtaat 
agattatttt 
ggtgtggtgg 
tgaatttggg 
agatagagtg 

g 



tggttttttt 2160 

ttaagtagtt 2220 

agtttgtttt 2280 

tttttttttt 2340 

ttgtagttgt 2400 

tttttttttt 2460 

ttttaggttt 2520 

ttgttttaat 2580 

gagggggatg 2640 

atttttgaag 2700 

ttatattgaa 2760 

gagagtaatg 2820 

tttgttttag 2880 

ttttgtattt 2940 

aattttaggt 3000 

ttgtatttag 3060 

ttaaggaggt 3120 

tagtttgttt 3180 

aggtttagtg 3240 

aagatattgt 3300 

atttaattgg 3360 

agttaagaat 3420 

aatgaaggat 3480 

taggtatata 3540 

ttttttgtgg 3600 

ttaagaggat 3660 

tttagtattt 3720 

ggttaatatg 3780 

tgggtgtttg 3840 

aggtagagtt 3900 

agattttgtg 3960 

4001 



<210> 338 
<211> 4334 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 338 

aaattaatta 
aatatgatat 
gatttttaaa 
tattttattg 
ttttaatatt 
ttgatgtggt 
tgttttttag 
ttttgattta 
ttaatattat 
aaaagttagg 
gtagtattgg 
tttttggtta 
gtgttttttt 

gggtsggtgg 

tggatgtttt 
tagtttagtg 
gttattgttt 
ggttgtgaag 
gtggttgtta 
taaagttgtt 
gtttgnaggg 
tttga-tggta 
gt^tttttgg 
gatrttttgtt 
gttgttgt-ta 



tgaaaaaaat 
tggttttaga 
attagataaa 
aaattgtaag 
gtttataatt 
ggataagaaa 
gtttaagttt 
tatttgttgt 
gatttttgta 
aattaaaatt 
atttaggatg 
attagagtat 
gaattttttt 
ggttaatggt 
ttttaatttt 
ggtggggtag 
tttgttgttt 
atagaagaat 
ggggatggtt 
ggtttttggg 
aagggaJggga 5 
ttttggagtg 
agagtgggga 
tattaatttt 
-t^gtttagtt 



tattaaaatt 
aaattaagtt 
gttataaggg 
aagatatttt 
tattgtgtaa 
atgtttaagt 
ttttgtttaa 
tgttatgtta 
gatgttagtt 
agaggtagtg 
ggtgtggttt 
gttttgtgtt 
gaatttaata 
ttaatttggg 
tgttaaaaat 
gttttgggaa 
ttatgtttaa 
ttttgtgggg 
gtagggaggg 
gtattgtttt 

gggagggagg 

gatgtttggt 
taaaggtgag 
tttgaggttt 
tt-ttttgttg 



aattgagttt 
tgaagaattt 
agatttaatt 
atgaaatatt 
taatagtaat 
tagtaagtaa 
tgaaaaatgt 
agtattaaaa 
ataagattta 
tggtttgggt 
ttggtttttg 
ttttggagtt 
taattttgtt 
atttttaggt 
taggtttttg 
tttgttgtta 
tttgggagtg 
ttgtggtttt 
gtttttttag 
taaatgdggg 

garttgtttag 
gtttt#ttt§ 

gtagttttgt 
attgtgtttt 
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atattgggaa 
ttttggttat 
tttttttata 
taatagtgtt 
tatttttgtt 
taataatgtt 
aatgtttatg 
tgtatttttt 
atgtaggatg 
agaaaagtag 
gaggaaaaga 
gttttgtttt 
tgtgtgtgtg 
tttgtttgat 
tgttgttatt 
ggtggttttg 
tttatgttgt 
ttgtttttta 
ggttgtttgt 
ataatgggga 
tttatgtttt 
ttttttgttt 
a^ttatatat 
ttgttgtttt 
ttrttgttggt- 



atatgatttg 

aaaatttttg 

aattttttaa 

tttggagtta 

ttgtttgtgt 

gaatttttat 

aatttttaga 

gttgattttt 

ataggttttt 

agaattagat 

ggattggttg 

tttttttttt 

ggaaggggag 

tttttttgtg 

atttttttta 

tggtgagttt 

ttgtgtgttt 

attattagtt 

agtttggttt 

atttggtggg 

ttattgtttt 

atgaat^gtt 

aaagtggagt 

gtjagtagggg 

tttagttagt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500" 
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tgttgttttt tttgagtttt gtatttattt gtttagtttg agttgtttga ttatattttg 1560 

agtattaata gatttgggtg tgtataaaag tgttaaatag aggtatgtga tttttgtgtg 1620 

gggttttgta ttggttgaga gtggtggtat gtgttgtttt tgttttttat aatttgtgtg 1680 

gagtgttgtt ttttgggatt ttgaagtatt ttgggaagtg tagttttgtt ttggagggta 1740 

ggaggataat gttgggggtt gtagggtgtt tggggagtat ttttaggtgt gggtttggga 1800 

ttaggggggt ggttttgttg tggaagaggg tgtggtagtg gattttggat taggtttgaa 1860 

ttagtttttt atttttgaga ataagggtag ggtggagtag gtggagggtg ggggggtttt 1920 

tttttatttt tttaaatttt ggttgtttgg attttatttt gtagagttat atgtttgttt 1980 

tgttttaagg gtagtatttg gatagagtag tgtttttttt agtgtgttat ttatatgggt 2040 

gtttaagatg tttgagtttt tgtgatagta taatgttatt ttatgttttt gtagggtttt 2100 

ttgtttttgg tattttatat tgatattggt gtgttattta tattgtgata ttattgtaat 2160 

ttttaattta tttagtttat aggttgtggg tgtggttttt tttttattgt gtgtgttttt 2220 

gttagtattt ttaggttttt gagttggtga ttagggattt gtgtgggaaa gtgtggggtt 2280 

ttggaatttg attgtttttg gtttaataaa gttatttagt ttaaatgtag attttatttg 2340 

tttttttaag ttttttgtgg atgatagggg aggaagagta atagttgttt tagattatat 2400 

agattgtagt taagtgttgt ttttgttttt agagtatgtg ttttttagag tagtattggg 2460 

ttttttttaa aaattgaaaa aggagttata aggttattgt tttttttgat ttggatagtt 2520 

tagaatttta gtaaaggtat aaaggaaatt ggtttgattg tgttttagaa taattggttt 2580 

ttttttttat aaaaagggga aaaaaataaa tgaaatgaat atggtttttg ttttttttta 2640 

ttttttgatt aagattgtgt ttttggttgg tggttaatat attaagatat ttgttttgag 2700 

attttgatta agtagaagag atatttttat tttattaatg agttgtattt ttttaaatta 2760 

agaaataaat gtgattaatt tttttgaaat aatagtttta atttagaaag atttttatgt 2820 

ttagttagtt attagggatg tgttagagat aaaatataat taaggggtgt rtaagataat 2880 

aaaaattgga ttatataagt agtagtatat tatataaagt aaataattag taatgtgttg 2940 

ttgtttgaaa tatgnttgtt gtggtatttt ttttaataat tgtttttgtg tggggttggt 3000 

ttttttttta agttaattga agtggttttt atttagtttg tgagtaggat taggaataat 3060 

ttagatattg agttttgaaa ggtttttttt tagattaaat ttagttttaa aatggttagg 3120 

tgtttttgtt tgttattttt ttgaatagaa ttttagatat tattagaaaa gttggagaag 3180 

gatgggtatg agatttttta ggaaagtgat agataggtag gtaatataaa tgagtaagga 3240 

atttaagttt aagaggtttt tatttattga ataggttttt tattttgaag tttttgtatt 3300 

tgatttggag gtttatagag gaaggtttta tttatttaat gtattttttt aagtataata 3360 

aattgttata taaaagtttt atatttttga ttattaatgt tgtgatagaa aaaagaaata 3420 

aattttttaa aatattgata atttgaaagt atttatattg ttttttttta tatttttttt 3480 

ttttttagag tagatgttta tttttatgga aattatagta aggaatgtag atgttagaat 3540 

ttatgtttat ttttaatttt ttttagtaat gttatttggt ttttgggttt gttgggagat 3600 

gtttggtttt gttagtttgt gataatgttt taagtttttt atagttgttt gaggattgag 3660 

agggttgggt taaagttttt tttagaatga gtttttgaat aaaaaggtgt ttttgaggtg 3720 

ggatttttgt ttttttatta ttattattat tattattatt attattatta ttattattat 3780 

tatttgattt aaaaaaaatt gagatagggt tttattatgt tgtttaggtt ggttttgaat 3840 

ttttgggttt aagtaatttt tttgttttag ttttttaaag tgttgggatt ataggtatga 3900 

gttattatat ttagtttgat ttttgttttt tagggtgggg ttgttttttt aagagttaga 3960 

ttatagttta ttattgttag tttttgtggg tgttttatat aagttttgga ataggaaggg 4020 

tttttaattg taaggagaga taatagtttt gttattttat tttggagagg ggtagaattg 4080 

ttttttttga aagtttttta aaatgaattt taagatttat ttttttttga agtggtagta 4140 

aggagttaat gtttatattt tggaattgtt tatttttgtt tataagaatt atttttgttt 4200 

ttgtttagat ggtttggttg gattgttatg gagttggttg tgattatgga gttatataat 4260 

gaagtggaga tagggtagtt atttagttta tttgatttgt gtagatagtt aaaggagagt 4320 

attttaggtt ttaa 4334 

<210> 339 
<211> 4334 
<212> DNA 

<213> Artificial Sequence 

<220> . 

<223> chemically treated genomic DNA (Homo sapiens) 

<400^ 339 ' •, :' 

ttgaagtttg aagtgttttt ttttagttgt ttatatagat taaataagtt ggatgattgt 60 

tttattttta ttttgttgtg tggttttata gttatagtta gttttataat aatttaatta 120 

aattatttga gtagaagtaa aaataatttt tatgggtaaa gatagatagt tttaagatgt 180 

gaatattaat tttttgttgt tattttagaa gagagtggat tttgaagttt gttttaggaa .240 

gtttttgaga agggtgattt tgtttttttt taaggtga&g tggtaaaatt gttgtttttt 300 

tttatagttg aaagtttttt ttattttaag gtttatatag gatatttata aaagttgata 360 

ataatgagtt gtgatttggt ttttgggaag gtagttttat tttaaagaat aagglattaga 420 

tt5gggtgt]gg tggtttatgt ttgtaatttt agtattttgg f galggttgaga tiaagaggatt , 480 

gtt'tgaattt aggagtttga g f gttaatttg ggtaatatag tgagattttg ttttaatttt ' 540 
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ttttaagtta aataataata ataataataa taataataat aataataata ataataataa 600 

aaggatagga attttatttt aaaagtattt ttttatttaa aggtttattt taggggaaat 660 

tttaatttag tttttttagt ttttaagtag ttgtgaggag tttggaatat tgttatagat 720 

tggtagaatt aggtattttt taataaattt agaagttaag tggtattgtt aaaaaagatt 780 

aaaaatgaat atgagtttta atatttatgt tttttgttat ggtttttatg gaaatagata 840 

5 t HH ttag 9gagaaagaa aatgtggggg gaagtagtgt aagtgttttt aaattattag 900 

ISPFS 33 ? satttatttt ttttttttgt tataatatta atagttagga atgtgagatt 960 

tttgtgtggt aatttattgt atttaaaaaa atgtattaaa tgagtgggat ttttttttgt 1020 

gaatttttag attaaatata aagattttaa gataaaaagt ttgtttagtg gqtaaqqatt 1080 

ttttgggttt gggttttttg tttatttgtg ttgtttgttt atttgttgtt tttttgaggg 1140 

gttttatgtt tatttttttt taattttttt aatgatgttt gagattttgt ttagaaaaat 1200 

gatagatgga agtatttagt tattttgaag ttaagtttgg tttgagaaga agttttttaa 1260 

g9 H£ agtat tt; 9 a attatt tttggttttg tttataggtt aagtgaaaat tattttgatt 1320 

SS^HSi 99 ggaggttgat tttatataag ggtgattgtt ggaaaaaatg ttgtaataag 1380 

tatattttag gtaatagtgt attgttaatt gtttgttttg tgtagtgtgt tgttgtttqt 1440 

atgatttagt ttttattgtt ttgggtattt tttagttgti tttttttttt ggtatatttt 1500 

J?9^ a " aa ^ a f9tatga gagttttttt aaattaggat tgttgtttta ggaaggttgg 1560 

ttatatttgt tttttgattt ggggagatgt agtttattaa taaaataaag atattttttt 1620 

Si ta agattttaaa gtaggtgttt tggtgtattg attattaatt gaggatatgg 1680 

ttttggttag agaatgggag gaggtgggga ttatgtttat tttatttatt tttttttttt 1740 

ttttgtgaga gaaagggtta gttattttaa ggtatggtta gattaatttt ttttgtgttt 1800 

t i§5i gaggt tttgagttat ttaggttagg gagaatagtg attttgtggt tttttttttg 1860 

H3 t1: H a 2 a agaatttagt gttgttttgg ggagtatgtg ttttgagaat aaaagtaata 1920 

tttaattgta atttgtgtag tttagaatgg ttattgtttt ttttttttta ttatttgtag 1980 

aaggtttgga agaatagatg gagtttgtat ttaagttgag tggttttgtt agattaaaqq 2040 

tgattaaatt ttagagtttt atgttttttt gtataggttt ttggttattg gtttgggaat 2100 

ttgaggatgt tggtaagaat gtatatagtg agggaagggt tatgtttgtg atttgtgggt 2160 

tgagtggatt agaaattatg atgatgttat aatatgggtg atatgttggt gttggtgtaq 2220 

99"tgttgggg gtagggggtt ttgtaggggt gtggagtggt attgtgttgt tataggggtt 2280 

iSSSiSH" aggtatttat gtgggtggtg tattagaagg ggtattgttt tgtttgagtg 2340 

ttgtttttgg ggtgaggtgg gtatgtggtt ttataaggtg gagtttaggt ggttaiagtt 2400 

tggaaaggta gggaaggatt tttttgtttt ttgtttgttt tgttttgttt ttgtttttga 2460 

gaatggggag ttggtttgga tttagtttgg ggtttattgt tatgtttttt tttatggtga 2520 

ga ^ a ""I JHiF^HS 9«tatattt ggggatgttt tttaggtgtt ttgtaatttt 2580 

tggtattgtt tttttgtttt ttgggatagg attatatttt ttgaggtgtt ttgqqqtttt 2640 

999999tggt gttttatgtg ggttgtgggg ggtgggggtg gtatgtgttg ttgtttttgg 2700 

" aa * g * gga gt "5 gtgt 2 gaggttatgt gtttttgttt ggtgtttttg tgtgtgtttg 2760 

ggtttgttgg tgtttagagt gtggttaggt ggtttggatt gagtaggtgg gtgtgqqqtt 2820 

tggaggaggt ggtggttggt tgaggttagt aagagggatg tggttggtgg gaggggttgg 2880 

gttgtggtag tgattttttg ttgtagggtg gtgggtgggg ttgtgggttt tgglggggtt 2940 

ggtgggtggg ggttgttttg ttttgtgtgt ggtttgggtg gagttttgtt tttgtttttg 3000 

™£HS 9 S 9tgtggttgt ttgtgggtag ggggttgggt gattattggg tgtttgtttt 3060 

2??. g HH 9 3 tgaggagata atagggggtg tgggtttttg tttatttttt tttttttttt 3120 

ttttttttgt gggttttgtt gggtttttta ttgtttgaag ggatggggtg qtgttttaqq 3180 

ga " agtggt tttaggatta aattgtgggt agttagggtt gtgatttttt ttgtgattgt 3240 

iPH^I 93 tt 9tagttgg tgattgaggg gtggtgtttt tgggttttat gagggttttt 3300 

"9tttttgt ggttggatgt gtggatagtg tgggtggtgg taggttgggt atggggatgg 3360 

5 9g9agg * 9g tggtgagttt attgtgggat tatttggggg tttgtttttg gggtttgttt 3420 

™HH § a a "gtgaagg gggtggtggt ggtggtttgg aggtgttttt ggtgggagtt 3480 

gg ??? ggg J g *" g ^ g J agg 999agttagg taggggtgga gttatttgga ttggattgtt 3540 

ag *"? gttt attttttttt tttttatgtg tgtgggtttt ggggtgttga gtttggggag 3600 

atttgaaaag gtgtggggag gaagggggtg gggttagggg ttggagtgta aggtgtgttt 3660 

tgattggttg ggggtgattg gttttttttt ttttgtttgg attagggtta tgtttatttt 3720 

gggtttggtg ttgtgtttaa ttttttgttt tttttaaatt ttgttgttgt ttttgatttt 3780 

SSSIHi* 8 ? 11 H ttgggaat "gttatttt atgtttttgg tattagttgg tgtttataaa 3840 

a !™H a S? ttaa aaagat taataagaga tatagtattt tttatgatat aatggtagta 3900 

aatataagtt aaaatttaga ggtttataaa tattttgttt gttgttgggt aaggaagttt 3960 

fSSi^JS 3 " gataatagga gtttaatatt attggttatt attagtttgg gtgttttttt 4020 

a ^H a i a 5S ttagatatag ataaagtagg ggtgggtatt ttatttgtat aatgagttgt 4080 

agg H 9 att aagatggttt tgggggtatt gttgagttga atttggaata tttttttata 4140 

SiHH5 tga aatgttaaag agtttatggg ggaaaaattt tttatttttg tgattttgtt 4200 

a??«^ aaa 2 a H5 aag *2 «ttatgatt gagaaagttt agttaatttg attttttgga 4260 

^J a *HH a ta Jttaggtt atatttttta atgtttattt agtagatttt gataattttt 4320 

tttgtggtta attt u • ■ aiio. 



<210> 340 

<211> 4528 - ' 
<212> -DNA — — 



-■ • . y 

n t w * 



Seite 286 



seq_prot.txt 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 340 

tgtgtagatt tgttaggaag agtataagaa gaaatatttg gatttttttg ttaattttgt 60 

ggaatttttt aagaagtgtt tggagagatg gaagattatg tttgtaaagg agaagtgaag 120 

tttgaagaga aggtaaaaag tgataaagtt tgttgtgata gggagattaa aaattatatt 180 

tttttgaaat gtaagaaagg gtaagaaagg aaagaaaaag gattgtaatg tttttagaag 240 

gttattattt gttttttttt tgttttgttt tgaatattgt ttaaagatta aaagtggata 300 

tttaggttta tttgttgtgg aaattgtaaa gaaattgggt gaaatgtggt ttgggtagtt 360 

agttaaagat aaataattat atgagtagaa agtagttaag ttataggaga gatatgaaaa 420 

gggtattgtt gtatattgtg ttaagggtaa aagtgaagta ggaaagaagg gtttaaagaa 480 

gaataaatta gaagatgagg aggaggagga ggagaaagaa gatgaagatg aggaggaaga 540 

gggtgaagat gaagaataaa tggttatttt ttaatgatgt ttgtgtagtg ggtttgtttt 600 

gttaagaatg tgaattttag tatagtttag tattagtttt agtataaaat tgtataaatt 660 

tttgtatagt ttataagatt ttttgtatag aaaatatttt tttttttttt tttttttttt 720 

gagatagagt tttgtttttg ttgtttaggt tggagtgtaa tggtgtgatt ttggtttatt 780 

gtaatttttg ttttttgggt tttggtttaa gtagtttttt tgttttagtt ttttgagtag 840 

ttgggattat aggtatatgt tattatgttt agttaatttt tgtattttta gtagagatgg 900 

ggttttatta tgttggttag gttgttttta aatttttgat tttgtgattt gtttgttttg 960 

gttttttaaa atattgggat tataggtgtg agttattgta ttttgtttaa tgtttttaaa 1020 

tatttaatgg tttttaaaaa atttattgtg tatggtagta tagtatattt gtaggaatta 1080 

gtattaatag tatattttgt gttttttaag atgttgtatt ttttaatatt ttgtaataaa 1140 

attatgtgta ttaaaaaaat aaagaaattt tgtgtgtagt ttatatttat agtatatttt 1200 

tgtttaggta tttgagagaa tgattaggag gggtttttgg aggaggtggt ttttgaatgg 1260 

agaatttatt tttaaggatt ttgtttgtaa tggttattaa gtattttttg agttattttt 1320 

atgtgttttg tagttttttg aaggggtgtg ggatttattg atgttaatta tttagtatta 1380 

tttttagatt ttaagaagtt ggggtgtgag ttagtaatta gtatagaaaa gagatattaa 1440 

aataagtttg agttggggag tgttttttta attttagttt tttggaagag attttttttt 1500 

tttttttgag atagagtttt gtttttattg tttaagttgg agtgtagtgg tatgattttg 1560 

gtttattgta attttttttt tttgggttta agtgattttt tttttttagt tttttgagta 1620 

gttgggatta tagatatgta tttgtaattt ttattaaaaa tataaaaatt agttgggtgt 1680 

ggtggtgtat gtttgtaatt ttagttattg gggaggttga ggtaggagaa ttgtttgaaa 1740 

ttaggaggtg gagattgtat taagatagtt tgttttagtt aaataatttg gtgttagtgt 1800 

aggaaaaggt ggaaggtatg gggttagtat aggagggttt aatattttta attttattaa 1860 

gttatatttt ggtaattttt gttttttatg agaagttttt gttgggtttg ttttagtgtt 1920 

gttttgaggt tttttttatg agttttgata gggtagaggt tgttttgagt gttttttttt 1980 

tttttggttt aagagtggtt taaaagaagg atttttgatt ggaattggtt attttgtgtt 2040 

attttttgat ttttgatttt gttttaaagg gggatgtggg ggaggggttt tggtaggggt 2100 

ggttttgttt tttttaggtt tgtaagttta ggtttttgtt tattgggttt agtttatttt 2160 

gtggttgttt agggaggttg ttggtatttg tgatttatga tttttttttt gagttttatt 2220 

gaggttatag ttgtggtttt gtttttttat gtttgttttt tgttttttgt ttgtgatggg 2280 

tgtttttgag gattaatgag tgtgttgtat ttattttttg ggtggggtta agtgttgatt 2340 

aattgttgtt tgggtgtttg gttgggttta aatgttttaa ttgttagtgg tggtggggtg 2400 

ggtagagggt tggggatggt aggtttaatt aatgggtggg tatgttgttt ttgtgaggag 2460 

gtgtgttttg tggttgggtg tgtggtgttt gtggtggtgt agggaggggg agggaggtaa 2520 

ataagatggt ggtggtgtgt tgggtgtgga agggggaggt ggtttggggt gtttgtgagt 2580 

gaggtgtggg gtggtgaagg gagtgtgggt ggtggtattt gttgttgtgg ttttggatgg 2640 

gttgggtttt ttttgttgtt ttgttttttt tatatgtgtg gtggttgtgg tgagggggat 2700 

gtgttgtttg gggtttggta tttttgggaa ttttttggtt tggagtttgt ggtttgtgtt 2760 

gttttggttg ttgggagttt tgtggagttt ttgttgttgt gttgttttgt ggattggatg 2820 

ttgagggtat ttggggtggg gtgtgtgttt gggtagatgt ttgtggggag gggggtgttt 2880 

gttgggtttt ggtgattatt ttgggggttg tgggttggtt tggggggtgt ttagtgtggg 294p 

tttttgtggg tgttgggtag tgattagttt tgagtggagt tgttggttgt ggtgggaggt 3000 

tttttggatg tttttagttt tttgaatgtt tgtttgggtt ggtgggagtt ggtgtttttt 3060 

gggaggttfcg tttggttgtt tgtggtggag tgtttgtttt tgggataggt ggtgggattg 3120 

gggtgttgtt ggagatgttt ttagtgaagt tgggtttttt aggtgtgggg gttttggggg 3180 

gtagtgatgt tgtggatttg gtttgtggga tgggtggttt ggagaagatt gtgtttggtt 3240 

gtgtttatat ttgtttgtgg gtttgaggtt tttggaggat gatttagtat tgaaaagttt 3300 

tggttggttt ttttagggtt tttgaggatg aagttgattt tgattgggtt gttttttagt 3360 

tttg,aggt;tt gggttttatt ggaatttgtg tttgagttgt tgttttgga ! t ttttggtgtt 342Q 

tgttggtttg t^gattttgt attgggtttg gatttgtagt tgggattgat gtgtagaata 3480 

attgtttttg t;tggaagaag ggtjttttttt tt^tttttgg ggtttttgtt gttttttttt 3540 

tttttttttt ttttgtaaaa ttttggagaa gggaagttgg aatataagga aggattgttt 3600 

ajtttgtgg^t t^gggttgg t;ggtgggatt t^aggatttt gggtttagta tggaggtggt 3660 
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ggatttgtag tagttgggta tgtttatgga gggtgagttg atgttggtgg gtatggatat 3720 

gtttatttat tgtattgatt ttattgaggt tatttattag ttgtgttgta agtgggttaa 3780 

gtttattggt aagtatttga tgggggattt gttgggggaa ggtttttatg gtaaggtgaa 3840 

ggaggtgttg gatttggaga tgttgtgtag gagggttgtt aagattttta agaagaagaa 3900 

gttgtgaagg atttttaatg gggaggttaa tgtgaagaag taagtatggt ttgttggggt 3960 

tggggttggg ttgggttagt tatggtgttg atggttttgt tttttttttt tttttttttt 4020 

tttttttttt tatttttttt taatattttg agttggattt gtttggtgtt tgtgtttttt 4080 

gtgttaggga gagtgtggtt gggggtttgt gttatggatt tttatttagg taaggttagt 4140 

tgttgtagtg gggtgtgtgt ttgtatgggt ttttggattt tagttaaaat gttttggtag 4200 

tgaaattttt ttgagaaggg agtggttttt aattttttaa gattagtttt ttggtttttt 4260 

tagttgttta aggagtagag gtgtttagtg gaattagttt gtgtttgttt gggttttgaq 4320 

agtttgtgtg tggttgttaa tatgtttttt gtgtagtgtg tggttgttat tggggttagg 4380 

tgaaatgtga tttgtttatt ttgttagagg ggaattttgg gttgttaaaa ataattgttt 4440 

gtattggttt attagttagt aggagggaaa tgtagttttt ttttatttgt tagggatgtg 4500 

atgttggaag tatttttggt ttttgggg M u 452 8 

<210> 341 
<211> 4528 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 341 

ttttgggggt tagggatgtt tttagtgtta tatttttggt aaatgaggaa aggttatgtt 60 

ttttttttgt tgattgataa gttggtgtaa atagttattt ttggtagttt agggtttttt 120 

tttgatagga taaataaatt atattttgtt tggttttggt aatggttata tattgtgtaq 180 

aaaatgtgtt ggtggttgta tataaatttt tggggtttaa ataaatatag gttgatttta 240 

ttgggtgttt ttgttttttg gatagttggg ggagttaagg ggttagtttt aggggattgg 300 

gggttgtttt ttttttagga gggttttgtt attagggtat tttaattgga gtttaagagt 360 

tta £ g i aaat gtatgttttg ttgtgataat tggttttgtt tgagtgaaag tttgtaatgt 420 

aggtttttaa ttatgttttt tttggtatgg aggatatagg tgttagatgg gtttagttta 480 

ggg I: gt \!: aag aggaagtaag ggagggaggg aggagagaag gaaggaagat agaattatta 540 

gtattgtgat tggtttggtt tggttttgat tttagtaagt tatatttatt tttttatgtt 600 

§££353 ff?2£2i££ ^£?H a Hi FSJFSI 8 aggattttga *gg««ttt eeo 

gtatagtgtt tttgagttta gtattttttt tattttgttg taagagtttt tttttagtag 720 

Sii tt i tatt aggtatttgt tgatgagttt ggtttgtttg tggtgtggtt ggtagatgat 780 

tttggtggag ttgatgtggt ggatgaatgt gtttatattt attgatatta gtttgttttt 840 

tgtgaatatg tttagttgtt gtgggtttat tatttttatg ttggatttag ggttttggag 900 

ttttgttgtt agttttgagt ttgtgggtga gtggtttttt tttgtgtttt gatttttttt 960 

ttttaaaatt ttataaagaa aaaagaaaaa aaaaaggtaa taaaaatttt aaaaggaagg 1020 

gaaaaatttt ttttttaata gaaatgattg ttttatatgt tagttttggt tgtgagttta 1080 

agtttggtgt agggtttgtg gattggtggg tattgggggt ttgggatggt ggtttagatg 1140 

*9agttttag tgsgatttgg gttttagaat tgggagatgg tttggttagg gttaattttg 1200 

tttttgggga ttttggggag gttggttggg gttttttagt gttaggttat tttttgggga 1260 

ttttaggttt atggataagt atgaatatgg ttgagtgtag ttttttttgg gttgtttatt 1320 

ttataggttg ggtttgtgat gttgttattt tttgggattt ttatatttgg agagtttaat 1380 

tttgttgggg gtgtttttgg tgatgttttg gttttattgt ttgttttagg agtaaatgtt 1440 

ttgttgtgga tgattgagtg gattttttgg ggggtgttga tttttgttgg tttgggtgag 1500 

23 tt: 2 9999 9 tt g99gg t g tttgggaggt tttttgttgt ggttaatagt tttgtttagg 1560 

9HSSi tgtt 9 t1:t ggtgtt tgtgagggtt tgtattgggt gttttttgag ttggtttgtg 1620 

a i t i t i aagg tflflttflttgg ggtttggtag gtgttttttt ttttgtaaat gtttgtttga 1680 

gtgtgtgttt tgttttgagt gtttttagtg tttggtttgt ggggtggtgt ggtggtgggg 1740 

g 5" tatgg g gtttttggtg gttgaggtgg tgtaggttgt aggttttggg ttggggggtt 1800 

tttgaaggtg ttgggttttg ggtggtgtgt tttttttgtt gtggttgttg tgtgtgtgga 1860 

? gagg i ggag . tggtgagggg ggtttagttt atttaaggtt gtggtagtaa gtgttgttat 1920 

3 2 tg H tt:t tttgttgttt tgtgttttat ttgtgggtgt tttgggttgt tttttttttt 1980 

tgtgtttgat atgttgttgt tattttgttt attttttttt ttttttttgt gttgttgtgq 2040 

a 3 attg J atg tttggttgta gggtatgttt ttttgtgagg atgatgtgtt tatttgttgg 2100 

ttgaatttgt tatttttggt tttttgtttg ttttgttgtt gttgatgatt ggagtgtttg 2160 

gatttggttg ggtgtttgag tggtgattgg ttggtgtttg gttttgtttg aggggtggat 2220 

ta H ggtttt tggagatgtt tgttatgggt agggggtggg aagtaggtat 2280 

ggggagatga ggttatggtt gtgattttgg tggggtttgg gaagggggtt gtaggttatg 2340 

gg B?!: taat9 gtttttttga atggttgtag ggtgggttga gtttggtggg tgggaatttg 2400 

ggtttgtgga tttggaagga gtggggttat ttttattaga gttttttttt tgtatttttt 2460 

_tttggggtga- ggttaagggt- taaaaagtaa tataaagtgg ttaatt-ttag ttaaaaattt -2520 — 
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3960 
4020 
4080 
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tttttttgag ttatttttgg attaggggaa agagaaatgt ttagggtggt ttttgtttta 2580 

ttggaattta tgggggaagt tttaggatag tgttgggata agtttagtgg gggttttttg 2640 

tgaaaaataa gagttgttaa aatatggttt gataaggttg aaaatattga atttttttgt 2700 

gttagttttg tgttttttat ttttttttgt attagtgtta ggttgtttag ttggaataaa 2760 

ttattttggt ataatttttg ttttttggtt ttaagtgatt tttttgtttt agttttttta 2820 

gtagttgaga ttataggtgt gtgttattat gtttggttaa tttttgtatt tttagtagaa 2880 

attataggtg tatgtttgta attttagtta tttagaaggt tgagggagga gaattgtttg 2940 

aatttgggag gaggaggttg tggtgagttg agattgtgtt attgtatttt agtttgggta 3000 

ataagagtga aattttgttt taaaaaaaaa aaaaagattt tttttagaaa attgaagttg 3060 

aaggaatatt ttttaattta aatttatttt ggtatttttt ttttgtattg attgttggtt 3120 

tatattttaa ttttttggaa tttggagata atgttggata attggtattg gtaggtttta 3180 

tgttttttta gagaattgta ggatatatgg aagtgattta ggaaatattt ggtgattatt 3240 

atagatagaa tttttgaaga tggatttttt gtttaaagat tatttttttt aagaattttt 3300 

tttagttatt tttttaagtg tttggatgga gatgtgttgt gagtgtgaat tatatatgga 3360 

atttttttgt ttttttgata tgtataattt tattataaaa tgttaaaaaa tgtagtattt 3420 

taaaaaatgt aagatgtatt gttgatatta atttttataa gtgtgttgtg ttgttatata 3480 

taataaattt tttaaaaatt attaaatatt tagggatatt aggtagggtg tggtggttta 3540 

tatttgtaat tttagtattt tgggaggttg aagtaggtgg attatgaggt taggagtttg 3600 

aaagtagttt ggttaatatg gtgaaatttt atttttatta aaaatataaa aattagttga 3660 

gtgtggtggt atgtgtttgt aattttagtt atttaggagg ttgaggtagg agaattgttt 3720 

gaattaggat ttgggaggtg gaggttgtgg tgagttgaga ttatgttatt gtattttagt 3780 

ttaggtaata agagtgaaat tttgttttaa aaaaaagaaa agaaaagaaa agtatttttt 3840 

gtatagagaa ttttatgagt tatatgaaaa tttgtatagt tttatattga agttaatatt 3900 
gagttgtatt agaatttata tttttagtaa aataagttta ttgtataggt attattaaag 
gatagttatt tattttttat ttttattttt ttttttttta tttttatttt tttttttttt 
tttttttttt ttattttttg gtttgttttt ttttgagttt tttttttttg ttttattttt 

gtttttagta tgatatgtag taatattttt tttatatttt ttttgtagtt taattgtttt 4140 

ttgtttatat ggttgtttat ttttggttga ttgtttagat tatattttat ttagtttttt 4200 

tgtagttttt atgataaata ggtttgggtg tttatttttg atttttgggt gatgtttaga 4260 

gtaaaatagg aagaaggtag atggtggttt tttaggagta ttgtgatttt tttttttttt 4320 

ttttttattt ttttttatat tttggaggaa tgtaattttt aatttttttg ttatagtgag 4380 

ttttgttatt ttttgttttt tttttaaatt ttattttttt tttgtagatg tggtttttta 4440 

tttttttaaa tattttttgg agaattttat gaaattgata gaagagtttg ggtgtttttt 4500 

tttgtgtttt ttttggtagg tttgtgtg 45Z8 

<210> 342 
<211> 4616 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 342 

tgaaatggtt ttattttttt ttagtttgtt taagttaatg gagttttatt agaatgtgag 60 

ttataaaata gtagagttat tttttattta atgttgtatt tgtagggttt agaatagtat 120 

ttgaaggatt ggtggaggtt taataaattg ttaaaagggt agattagtta ggtgtgatgg 180 

tgtgtgtttg tagttttagt tatttgggag gttgaggtag gaggattatt tgattttagg 240 

atttttgggt tgtagtgtag tatattagtt aggtgtttgt attaagtttt gtattaattt 300 

qqttgtttaa ggatggttga attgatttag gtggaaattg gagtaggtta aaatttaatg 360 

ttgattagta atagggttgt atttgtaaat agttattgtt ttttagtttg ggtaatatag 420 

tgagatttta tttttaaaat aaattttaaa aataattaat tagaaaaaaa aattagtttg 480 

taattttagt attttgggag gttaaggtgg gtagattatt tgaggttagg agtttaagat 540 

- - -■ - - ------ ---- ■* 600 



aagtttggtt aatatggtaa aattttaatt tttattaaaa atataaaaat tagttgggtg 
tggtggtagg tgtttgtagt tttagttatt taggaggttg aggtaagaga attgtttgaa 660 
tttgggaggt ggagattgta gtgggttgag attgtgttat tttattttag tttgggtagt 720 

840 
900 
960 
1020 
1080 



agagtgtgat tttgttttaa aaaaaaaaaa aaagaaagaa aagaaaagaa aaagaaaaga 
aaagaaaaaa attgggaggt ttaagtttat ttttggtgtt tttatatttt tttgttttat 
ttttttgtat ttagtttttt tttgtaattt gtgttttata ttagttttta agtttttaat 
gtgatttagt atgagaattg gattttgtta tttttttgtt tataatattt tatgtttttt 
ttttgtttag aatattattt tttttattgt ttttattaat ggaattggta tttttttaag 
gatatgatta aatttgttta tttttatatt attttttaaa gtagaattta tttttttttt 
tttaatatga tgatattgat agggtttgtt tttatttatt agattgtgag ttgtttaggg 1140 
taggtagtgt tttttgtttt tgtttttgtt tttttttttt gagatagggt tttgttttgt 1200 
tatttaggtt agagtgtaat ggtatagttt tagtttattg tagttttaat tgttttggtt 1260 
taaattatta ttttatttta gttttttgag tagttgggat tataggtata tgttattata 1320 
tttggttaat ttttttgtat ttttagtaga gatagggttt ggttatgttg tttgggttgg 1380 
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JJ?* 9 , 33 *" 3?gatttaa ?Jf a H3 a P ta "«agtt ttttaaaatg agggattgtg 
S"?"^ H^fHH* "agtttata gtttagtgtt ggatttatgg tagtattlai 
taaatatttg ttgaatgtaa tagtaaatag tattttaggg agtaagaatt aqattaataa 
aggtggtaaa aggtttggag aaaaaaataa tagtttaatt tggttagagt atgagggaga 
gtagtaggag ataagatgga aaggtttttt gggtaaggtt ttgaaggaag ttggalgttt 
J^n 9 ^* 3 ^ atgtgtatat tgtggtaggt agtggggagt taatgaaggt SttgfgSg 
9 aga ?* aa ^9 tgttgaaaaa taaatatagg ttaaatttat tagagttttt ttgatatatl 
^HSilii i ta J ttaa 9t ttaagtttgt tttttatata tttattattt aatttatttt 
tgggtttttt tagtagtttg ttttattttt ttatttgttt tttggtggag ttaggqatqt 

?SS*8£ SK222 ttta 9 tta 9 a 99 a tatgggg gttttagttt i?3g?S?t 
9 J gtttggagtt gggaagtagg ttagggttag ttgaggttgg ttggtaagta 
otSKSB 82?Ii?fS ?tttgtatag tflttaggtgg tgttttgggt tttaagttga 
2H*2J2S tt pgataatt* tttgtttgtg tatatatttg ttttttattt tatttttatt 
5£ a 2£H tgg tat 9999gag agggtatagg gttagataaa tttgtgagat tttggtttta 

SS32S J8S3IIS SES 8 ?*^ "gttttttt ttaggtttgt tttttSSS 
tttagttttt gtttttaatg tatgtatagt ttgtatatat tgtqtattqg gatattttat 
agttagttgt atggtttttt tgtgttttag tttttggttt tttttgttqi ttttaatttt 
tgttttaggt tttattgtgt aattgttgtt gttattgttg tttttggtg? SgtttSS 
" a ? a ?9"9 ttttggatgt aggaggattt ttttttggga ggaggttttt ttggggaaga 
tgatttattg ggtgaggagg atttgtttag tgaagaggat ttatttagag aggaggattt 

? a ?S atttat tt9gagagga ggatttattt ggagaggagg atttatttga 
agttaagttt aaattagaag aagagggttt tttgaagtta gaggatttat ttattgttaa 
ggtttttgga gatttttaag aattttagaa taatgtttat agggataaag aaggtlagta 
22?S?£ ^n aaattta 99 tttta 99a ggtttatgat ttttti???! XSttSgt? 
i a H?H J gt ttattta 999 aaggagggga gattgtattt tttatagaag tttttttaaa 
ggttttatat taatattttt atttttattt ttggaggtag aaagggitag atgtggaqaa 
a ^ aaaaa 999tgtaaaa ggagagaggt gagttggatg agatgggagf galggggSg 
gttggagaag agaaagggat gagaattgta gatgagagaa alaatgtqta oataoagaal 
aaaaataggt ggagaaggag agttagagag tttgagggga agagaaaagg Ila|tttg?q 

?ltal?ala§ S223E? ^ aa 9 taa 9 a a 9agt?gg?a gialtta??? XtSSg? 
? a J aa ^ a ?f aaa " gaga * ttaggaagaa gggatatagt aggtagagaa atgtggtttt 
ttgattttta agttaggaat ttggggaaag gggttggaga ttatataaqq taqaooaata 
agtggggaga agaaagaagg gagaaaggaa agltggtgta tttatttatt tgaqitttS 
gattgaagtg tttatttatt tttttttttt ttttttttga gataaatttt titttttatt 
gtttaggttg gagtgtaatg gtgtgatttt ggtttattgt iatttttatt t?ttgqqtt? 

SStSSS 232SH? llll^t tta a 9* a g«g°g a«aS| a S SS??S5 

*£2£££? 52£H8l at t £ tta 9 ta 9a gatggggttt tgttatgttg gttaggttgg 
ttttgaattt ttgattttag gtgatttaat tattttggtt ttttaaaotq ttaaaatta* 

tSt&££ 32128*2 ggtttgaagt agttatttat ttttatSgat StaagSS 
tgattgtaag ttggtaggat tgttgtttgg tttatttaqt tqtqatotta aatttoaotn 
tggttttttg tgttttgtat ttggtttgtt taaggtattt gttatttSa atgtttttqt 

SSfSttS t&ttggl att ? ttt? " «g??aggaa SggStgS ?8S3tS 
ga 9^99"^ a H at . ttatataggg gatgattaga gttattggtg ttatggiggt 

KB222 3 ? attt ^ a H ^ggaattta gttttgtggl ttttttttat 

t?S?S2f ??nnn^??^ ^H 9 ?-^ 9 -** "a*"3ttg tttattgttt tattttttta 
?S2S2» iS2f * aagtttttg atttaggtgt tagatttttt tattatattt 
„iH a 5 t1: 3 a ggtgatttgt tttggttttg ggtgttttta gtttgtgtgg gttqttttta 
9"H5 9gt 9 gatatttgtt tttagtttgt tgttttttgt ttggttttgt gttttttqqS 
atttttgggt ttttagtttt tgttgttttt agaattgtgt ttgtgtaata atggttataq 

tStSSSS! SSS? tt1: SS gg " g ggg?g gggtltaggg !3g£!?tg 

H?i g 2HS5 fltttgtttgg gggttgggtt ggttttattg ggtggggttg gtttatttqt 
ttttttttat gtagtgtaat tgattttgtt ttttgggtta gagatggttt tqqqttttqq 
gtgggagtat tgggttttgt agttgtattt gtattggggg gttgtlqqtt qtttaaatt? 
ggagtatatt gtggaaggtt attgtttttt tgttglggtg StpUg? ?g|??I 9tt1: 

<210> 343 
<211> 4616 

<212> DNA ' 
<213> Artificial sequence 

<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 343 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4616 



\ » 



i > 

J??,*,*??^*. ^H^HH "ggtaggga aatggtggtt ttttatagtg tgttttgagt 
Sif??SSJ J2S?S££ tagtgtagat gtagttgtag agtttggtat ttttgtttgg 
gatttagagt tatttttagt ttaggaggta gggttaqtto tatfgtgtag ggagaggtaa 
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gtgagttggt tttgtttggt agggttagtt taatttttgg gtaggtattg ttgtgatggt 240 

tttttttttt gtgttttgtt ttatttttaa gttttggtgg ggagattttt ttattattgt 300 

ggttattgtt gtgtaggtgt agttttggga gtggtgggag ttggaagttt aggagtttta 360 

gggggtgtag ggttgggtag aaggtggtga gttgggggtg gatatttatt ggggattgga 420 

agtggtttgt gtaggttggg gatatttggg gttagggtgg gttgtttggg gtgggagagt 480 

atagtgagga agtttgatgt ttaggttagg aatttaggga atttgggtag aaaaggtgag 540 

ggggtgggat ggtgggtggt gggtgggatg tttgggarta gtgtttaggg atggttgtag 600 

gggagattta tagagttggg tttttagatt gggtttgtgt agtgggtggg tgttttattt 660 

ttatagtgtt aatgattttg gttatttttt gtataaatga aaaggatgaa ttgtttaagt 720 

ttagagtttt aatttttttt tggtgattaa aatgatgtta taaatgtaga tgttttatag 780 

gagtattatg ggtaataaat gttttaagtg ggttaggtgt aaagtatagg agattgtatt 840 

taaatttaat attgtagttg ggtgggttaa atagtaattt tattagtttg taattattgt 900 

tttagggttt gtaaaagtga gtggttgttt taggttaggt gttgtggttt atgtttataa 960 

ttttagtatt ttgggaggtt agggtggttg gattatttga gattaggagt ttgagattag 1020 

tttgattaat atggtgaaat tttgttttta ttaaaaatat aaaaattagt tgggtgtggt 1080 

ggtgtatgtt tgtaattgta gttatttggt tagaggttga ggtaggagaa ttatttgaat 1140 

ttgggaggtg gaggttgtag tgagttgaga ttgtgttatt gtattttagt ttgggtaata 1200 

aaagtgaaag tttgttttaa aaaaaaaaaa aaaaaaagtg agtgggtatt ttagttttga 1260 

gttttaaatg agtgagtata ttattttttt tttttttttt tttttttttt tttatttatt 1320 

tttttgtttt gtatggtttt taattttttt ttttaaattt ttggtttggg agttaagaag 1380 

ttatgttttt ttatttgttg tgtttttttt ttttaggttt taattttttt attgtagttt 1440 

aagatgagat gatttttatt agtttttttt gtttgttttt ggtatttatt ttatttttta 1500 

agtttttttt tttttttttt tttaaatttt ttgatttttt ttttttattt attttttttt 1560 

tttgtttgta tatttttttt tttatttgta gtttttattt tttttttttt tttagttttt 1620 

tttttttttt ttattttatt tagtttattt tttttttttt gtattttttt tatttttttt 1680 

ttatatttgt ttttttttat ttttgagagt ggggatgggg atattggtat gggatttttg 1740 

gaagggtttt tgtggggagt atagtttttt tttttttttt gagtgaatag agtttaggtt 1800 

ggggtatggg aggggagtta tgaatttttt ggaatttgga tttggagatt gatgattatt 1860 

tatttttttt gtttttgtgg gtattatttt ggggtttttg aggattttta ggagttttaa 1920 

tagtaggtag attttttaat tttagggagt tttttttttt tgatttaggt ttaattttag 1980 

gtagattttt ttttttaggt agattttttt ttttaggtag attttttttt ttgggtggat 2040 

tttttttttt gggtgaattt tttttattgg gtagattttt tttgtttagt gggttatttt 2100 

ttttagaaga gttttttttt aagggggaat ttttttgtat ttggggtaat ttttggggat 2160 

ggataggtat tagaagtagt agtgatagta gtagttgtat agtgaggttt ggagtagggg 2220 

ttgggattaa tagagggagt taggggttgg ggtatagggg agttatgtgg ttgattgtgg 2280 

ggtgttttag tatatggtgt gtatgggttg tatgtgtatt ggaaatgaga gttgggtggg 2340 

ggaggagtaa gtttggaggg gagtaggttg atttatagag tgtttttttg tagagatgga 2400 

gttaaagttt tataggtttg tttggttttg tgtttttttt tttatattaa agttaggatg 2460 

ggggtggagt gaggggtagg tgtgtgtata ggtagaaggt tattggggtt atggatttag 2520 

tttggaattt aaggtattat ttggtattat gtaggttttt tttggtatta tttagttgtt 2580 

tgttagttag ttttagttaa ttttggtttg ttttttagtt ttaggtatag aaggggaaag 2640 

gtaggggagt tggggttttt atgttttttg gttgagaggg aaagtagttt atgtatatat 2700 

ttttgatttt attaggaagt aggtaaagag gtagggtagg ttgttagggg agtttgaggg 2760 

tgagttaagt aatgggtatg tgggagataa atttgagttt gaatgaaaag taagtgtatg 2820 

tgttagaggg gttttgatag gtttaattta tatttatttt ttaatatatt atttttttgt 2880 

ttaaaagttt ttattggttt tttattgttt gttatgatat gtatattgtg tatttttgat 2940 

ttttaatttt ttttaaaatt ttgtttaaga gattttttta ttttgttttt tattattttt 3000 

ttttatattt tagttaaatt aaattattat tttttttttt aaatttttta ttatttttgt 3060 

taatttagtt tttgtttttt gaaatgttat ttattattgt atttaataaa tatttattta 3120 

gtattattat aggtttagta ttgggttatg gattagggat atggaaatga ataagatatg 3180 

gttttttatt ttgggaggtt gaggtgggtg gattgtttga gtttaggagt ttgagattag 3240 

tttgggtaat atggttaaat tttgttttta ttagaaatat aaaaaaatta gttaggtgta 3300 

atggtatgtg tttgtagttt tagttattta ggaggttgaa atgggatgat ggtttgagtt 3360 

gaggtggttg aggttgtagt gagttgagat tgtattattg tattttggtt tgggtgatag 3420 

agtaagattt tgttttaaaa aagaaaaata aaaataaaaa taaaaaatgt tatttgtttt 3480 

gagtagttta taatttagtg agtgagggta aattttgtta atattattat attgagggaa 3540 

gagagatgaa ttttgtttta gaagataata tagaagtagg taaatttgat tatgtttttg 3600 

aagaaatatt agttttattg gtaaaaataa tagggagaat gatattttgg gtaaagggaa 3660 

ggtataaaat attatgagta ggagagtgat agaatttagt ttttatatta ggttatgttg 3720 

agggtttgag ggttagtatg gagtatagat tgtaggggaa gattgggtgt aaggagatgg 3780 

ggtagaagag tgtgaggata ttaagaatgg gtttaggttt tttagttttt tttttttttt 3840 

tttttttttt tttttttttt tttttttttt ttttttttga gatagagttg tattttgtta 3900 

tttaggttgg agtgaagtgg tgtgattttg gtttattgta atttttattt tttaggttta 3960 

agtgattttt ttgttttagt tttttgggta gttgggatta taggtatttg ttattatgtt 4020 

tagttaattt ttgtattttt ggtagagatt ggggttttgt tatgttggtt aggtttgttt 4080 

tgaatttttg attttaggtg atttgtttgt. tttggttttt taaagtgttg ggattatagg 4140 

ttggtttttt tttttaatta attattttta aaatttattt tagagatggg attttattgt 4200 

gttgtttagg ttggagggtg gtggttattt ataggtgtga ttttgttatt gattagtatt 4260 
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gagttttgat ttgttttaat ttttatttgg gttagtttaa ttatttttag gtaattaggt 4320 

tgatgtagaa tttagtgtag atatttgatt ggtatattat attatagttt agaaattttg 4380 

ggattaagtg atttttttgt tttagttttt taagtagttg ggattatagg tatatgttat 4440 

tatatttggt tagtttgttt ttttaataat ttattaagtt tttgttagtt ttttaggtgt 4500 

tgttttaggt tttatagatg tagtattgaa tagagaatag ttttattgtt titgtagttta 4560 

tattttagta aggttttatt ggtttaaata aattaaaaga gaataaggtt atttta 4616 

<2I0> 344 
<211> 4374 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 344 

ttggtgtagt agtgggggag gggattggtg agaggggagg aaggaagggg ggaggaaggg 60 

ggagatttgt ttgaatattg taataaaaat aaagtgagaa gaaagaagtg gatttatttt 120 

tatgaggtat tttttttggt tgttatagtt tttgttaaga gttttgtttg gttggggttt 180 

tttgtgttgt tgttgttgtt tgaagattat aatggtttga aatgatggtg ttttaaagga 240 

gt 3 g H ggtg a 9agtttttt ttatggatgt tgttgggttg gtgtgtgaga gtaattttta 300 

gatttttggg aaggagatag agattgataa taaaatgggt tgttagtggt tggagagtga 360 

gagataaaga gtgtgggtga gggaagtggt tgtagttagt atatttatgt tgattggtga 420 

tggtttaagt gtgttaatgt gtgtgtgttg gtgtttggtt ttgtttttat tgttttttat 480 

tggtttatta gtgaagtttg atttttgtta ttatttttta gtaaaatatt tttttttgtg 540 

tttgaattag agtgggaaat gaggttgagt tatggttttt tttttaaatt tattaattat 600 

tttgtaatat gtaaatgtat tgtttgtttt tatataaaat attgttttat gtaaagtagt 660 

gttggggttt tgtgttagtt gattggatgt tttttttttt tgtttggtgt aattggtttg 720 

tttgtttgaa tatgaatata tttgtttttg gttttgttgg tgggttttgg tttttttttt 780 

tggtgtaggg tttttgagtt tgggttttgg tgtttgtttg tatttttttg ttttgtgtat 840 

gttgagattt ggtgtggttt gttttgttgt tgtgaatttg agtggttaag tgaaggtttt 900 

tgggttgggg gttgtgtttt tatttttttt aagttggatt tgggattttt agtttttgga 960 

ggg 3 g B ag £ ttttgtgttg tttttttttg ttaattttag gatagtgtgg tttgttgttt 1020 

tg 3 t ^ t I gtt tgtttttatt ttattttagt tggggtttta agatatagag tatagtggtg 1080 

g H?I a "" * g "H£H 9 9 H t 3 ttgga agaagaggaa aggggtggga gtgttattgg 1140 

g H ttggtag ttttatttta gttgtggagg ttgtggtgaa gtttaggttg agaggaggtt 1200 

gttgggttga gaattaaatg aggttagagg tttttttagt ttaagttttt agggtttgtt 1260 

taaatttttt attttgtttt ttttgttttt tttttttttt tttttttttt tggaagttgt 1320 

9 §i g J g 5 agt ggagtagagg tatagttttg gttggagagg tttgagtaaa tatgttattt 1380 

attttgtgta ta gagggttt ttgtgaaaag ttttgaagag tttttattaa atatttatta 1440 

atttttttta tgtgttattt gttgattaag tgtggttgtt ttttgtagtg tttgagagga 1500 

ggggaagtta ggggagataa gaggggaggg ggagtggaag tttgggtggg ggggtggatg 1560 

attaatgttg ggagggattt tttttttttt tttgtgtgtg tgtgtgtgtg tgtgtgtgtg 1620 

tgtgtgtgtg ttttattggt gatgttggta tgttagtgtg tgttgtgtgt gttattggtt 1680 

atttgtgtgt ggtgtgtgtg tttttgtatt atgatttttt tttttgggtt tgtttagttt 1740 

a H?S2 gtag """ttttta gttgttgtta gaaaattggg gaggaaagta agtaaatggt 1800 

tttttttttt tgattatata ttagaagttt atttgttgaa tgttgtatga ttaggatata 1860 

ttttagtata ttttaaaggt gttttttgga gtttgtagtt aaggtgatag gagagtgagt 1920 

ggttgtttgg ttttgtgttg ttttgagttt ttttggtggt tttgggttat ggagttggga 1980 

aggagaagga aggtttgggt tgttgtatat attttaggat agggtgaggg ggaatagggt 2040 

tgaagagttg aggtagggag gttgaaatga agtagttgaa gggtatgtta gatttttttt 2100 

gatttaggga aagtgttgaa agttagagtt tattaataaa atttgtgtga tttgaatttg 2160 

ttgttgattg gttgtgttgt tgtagttttt ggtagttggg atagtgagga tgttgtaagt 2220 

tgttttgggt tgttgggtta tatgtgggtt ttattttttt gtttttgttt gggatttagt 2280 

agttttgtga ttaaggagtt tgggtagggt ttttgggaag taaaggtttt tttggtttta 2340 

g " agagg 5 g ggg«gtata attttttttt ttttttgggt tggggagggt tgttggtttt 2400 

S^HH 11 ^ HH gag H tg ta g t g tt ggg aggaaatttg ggttagaggt gaaggaggtg 2460 

gtgttgggtg tttaggtttg tggttttttg ggtgaggttt tattttgtlg gagtgtgitt 2520 

HiSS^SSSi g"ggtaagt ggggttgtgg ttttgagttt tagattagtg aaagttggtt 2580 

tttttttttt tttttttttt attttttaag tgagtattaa agaagtttgg atttaagatt 2640 

J aag " tg P gtgtatgagt gagggatttg tgggggggtg gggggggggg aagtgatttt 2700 

?^2™ 9 gagg J ga 2 ta ttatttttat aattttttaa ttagtgattg agtggattag 2760 

tattttttta agttttgttt tggaaagaag tagttgtaat tttgttgtta ttttttttta 2820 

gggtattggt tttttgaatt agtttttttt tttatttttt taatttttat tttgttgtgg 2880 

agaaaagttt taataaaaaa ttagaaaagg taaggagtga ggagtgaatg ttattgatgt 2940 

t^ttgggtgg ggaagggggt tttgggaaag atttttggaa atttttttgt ataaataaaa 3000 

aataaaaag g att tggttgt tt ttt; atgtg ttaatatttg gggaggqqaq tttaatttta iQfiO__ 

■ _ _ _ * i 
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gaatgaggtt gagtttttaa agtttaagga gagaggggtg agagtgagat ttttttgttt 3120 

tttttggatt ggttttaagg aaggtggggg tgttgtttta ttttgtgggg agttttattt 3180 

tttttgttaa gtattggttt tgggggtgaa gtggtttagg ggtgtggggg ttgagattga 3240 

ggtgtgggtg tggttgttag aggtgtgtat aaaattgaga taatttgtag gttgttattt 3300 

taattgtggt tattaatatg tttgaggttt atttgttgtt ttttaagatg tgggggggtg 3360 

gtggggattg tgttttaggt tttgtttttt ttttttgttt ttttataggt ttttgttttt 3420 

ttttattttt tgttttttgt ggttagaggg aggatttttt aggttggtgg gtgtgttttt 3480 

gtgggggttg ttggggttgt tttatttggt agtggggttg tttgtgtgga gggagttttg 3540 

ttgggggatt gtggtttgtt agagtttgtt tagagttttt agttagtatt gatgtgttaa 3600 

tatgaggtga ggtaagtatt ttatttgtat tgatttttgg tatttttttg tgtttatttt 3660 

attaggtttt tggggaggta aagaagtgtt tatttttggg gggaagtgtt tttagttttt 3720 

agttttagaa ttaagagttt tttatttgga gttttgatag ttttgatatt tttaaatttt 3780 

tttttttttt ggttaagtag aggtgtagtg ttggtgtgtt ggaaagggtg aagattggta 3840 

ttttttggag aaggaaagtt ttggtagata tttgtgttat tttgtagtgt tttaaatgtt 3900 

tttttttttg tagattttgg gaaggtggga ttttttgttt gtggagttgg taaatgggtt 3960 

tagtgagttt taaaggtaaa ggtttttatt ttggttgttt agtttttggt gtttttttta 4020 

tttttaggtt tttgtttttt tttttttttt agtaatttgt tggtttttgt taatttggta 4080 

gttttgttga gtttagagtt tagaaaagtt aagaagattt tagagttgtt ttgtggtatt 4140 

tttaggttgt ggttattgtt tggtttgggg ggaagttgtg gtgtttttgt ttttttttgt 4200 

tttttgttaa atggttagtg ttaggggttt tttgggttta gtttttagtt tttttttgtt 4260 

tgttttggga ggttagtttg tttttttttt tttttttttt tttttttttt tttttttttt 4320 

tttttttttg tttttttttt gtttattttt aaaatatttt taaaataata ttgt 4374 

<210> 345 
<211> 4374 
<212> DNA 

<213> Artificial Sequence 

<220> ^ 
<223> chemically treated genomic DNA (Homo sapiens) 

<400> 345 

ataatgttat tttaaaggta ttttgggggt gggtagggag aagatagaga gagaaagaga 60 

gagagagaga gagagagaga gagagagaga gagataggtt ggttttttag agtgaatgga 120 

agaggattga ggattgaatt tagggagttt ttagtattgg ttatttggta ggaggtgagg 180 

aagggtgagg gtattgtggt ttttttttaa gttgagtagt ggttatagtt tgaaggtgtt 240 

atggggtggt tttgggattt ttttggtttt tttaggtttt gggtttggtg aggttgttgg 300 

gttagtggga attggtaggt tgttggagaa aggagaggga taaaggtttg gaagtggggg 360 

aggtattagg gattgggtgg ttgggatggg agtttttgtt tttagggttt attggatttg 420 

tttgttagtt ttgtggataa gggattttat ttttttaggg tttgtaaaaa gagagatgtt 480 

tggggtatta taaagtgatg taggtgtttg ttaagatttt ttttttttaa aaaatgttag 540 

tttttatttt ttttagtgtg ttggtgttgt gtttttgttt gattggggaa agagagagtt 600 

tggaggtgtt agaattgtta gaattttgga tgggaggttt ttggttttga agttgggagt 660 

tggaggtatt tttttttaga agtgggtatt tttttgtttt tttgaaggtt tgatggggtg 720 

ggtatagggg aatgttaggg attggtgtgg gtgagatgtt tgttttattt tatgttgatg 780 

tattaatatt ggttagaagt tttgggtagg ttttggtgag ttatggtttt ttagtgggat 840 

tttttttatg tagatagttt tgttattggg tgggatgatt ttggtggttt ttgtgggaat 900 

gtatttgttg gtttgagagg tttttttttt ggttataggg ggtggagggt aggggggaat 960 

aggggtttgt ggggaggtgg ggaaaggaga tagaatttaa agtgtaattt ttgttgtttt 1020 

tttgtgtttt ggagggtagt aagtgggttt tgggtgtgtt ggtagttgta attaaaatgg 1080 

tgatttgtga gttgttttgg ttttgtatat atttttggta attgtgtttg tattttagtt 1140 

ttggttttta tatttttagg ttgttttgtt tttagagttg gtatttggta gagggagtga 1200 

ggttttttat gaggtagaat agtattttta ttttttttga gattagttta gagagaatga 1260 

ggaaatttta tttttatttt ttttttttta aattttgaaa atttggtttt attttggggt 1320 

tgaatttttt tttttaaata ttgatatatg ggaagtagtt aagttttttt tgttttttgt 1380 

ttgtgtggag gagtttttag gagttttttt tggagttttt ttttttattt aatgatgtta 1440 

gtggtattta ttttttgttt tttgtttttt ttgatttttt gttaaaattt ttttttgtag 1500 

tgagatgggg gttggagggg tgggagaagg agttgattta aagagttagt gttttggggg 1560 

gaaatagtaa taagattatg gttgtttttt tttgaggtaa agtttaagaa ggtgttaatt 1620 

tatttggttg ttaattgaga ggttgtaaaa atgatatttg tttttagttt ggatagaatt 1680 

attttttttt tttttgtttt tttgtaagtt ttttgtttat gtatagtgga tttgagtttt 1740 

gagtttaagt ttttttaatg tttgtttgga aagtaaaagg agagagaaag agaaagttga 1800 

tttttgttgg tttgggattt agggttgtgg, ttttatttgt tggtattttt tgaaagttgt 1860 

gtttttgtga aatgaaattt tgtttaggag gttgtggatt tggatatttg gtgttatttt 1920 

ttttattttt gatttaggtt tttttttggt gttgtgagtt. tttggggaag ggttagagtt 1980 

ggtagtttt* tttagtttgg ggaggggaga gggttatgtg atmatgt tggttagggt 2040 

tggggaggtt tttgtttttt gggagttttg tttgggtttt, ttggttgtag ggttgttggg 2100 
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ttttaggtag Qaatgagagg gtgaggttta tatgtggttt ggtggtttag ggtgqtttqt 2160 

agtgttttta ttgttttggt tgttaggggt tgtggtgatg tggttagtta gtagtgagtt 2220 

* agg 2 g I g t a ?. a IIII at I gatgagtttt gatttttagt atttttttta Igttaagaag 2280 

ag I2 ag I g t ^HIIII 99 ! tgttttattt tagttttttt gttttagttt tttagtttta 2340 

I222IIS "" g ttttg 99gtgtgtat agtagtttag gttttttttt ttttttttgg 2400 

""^tggtt tgaagttgtt gagagagttt gggatagtgt aggattaggt agttgtttgt 2460 

IH"I g " a IHI aa I tg I aggttttgag gggtgttttt ggagtgtatt gaggtgtgtt 2520 

22228S aaa I?- ga I tt: "Qgtgtgtg gttagaagag laalgttitt 2580 

tatttatttt tttttttggt tttttggtaa tagttgaagg ggagttgttt ttgtggattq 2640 

^^^attta ggagagggag ttgtggtgtg gagatatatg tattatitat agitgattgg 2700 

8E5S2 5 atatatt 9 a tattgttagt gggatatata tatatatati 2760 

H tatatatata tagagagaga gagagaattt tttttagtat tggttatttq 2820 

2IIIII at I taggtt ttta tttttttttt ttttttattt tttttggttt tttttttttt 2880 

}9SSSS tt 9 t gaaaagtagt tgtatttagt taataaatgg tatgtgggag aagttggtga 2940 

f* 9 "! 99 !?. aggatttttt agggtttttt ataagaattt tttgtatlta aagtalgtgg 3000 

tgtgtttatt tgggtttttt tagttagagt tgtgtttttg ttttgttgtg tattgtggtt 3060 

HI? aaagga ?. aaa9gagag aagaaagggt ggggagagtg gggtggagga tttggatagg 3120 

IIII ggagg I ttgggttggg gaggtttttg gttttgttta gtttttggtt tggtaatttt 3180 

JtSSSSt 8823S ?9?JS tt9 ta9 "99aat ggagttgtta ggStttagtS 3240 

a *?"tttgt tttttttttt tttttttaag gggttaggtg ggttggggtg tggttgttgt 3300 

8J&28J £ff352E I2gS ttggg at 9gggtggg ggtgggtggg ggtggggtgg 3360 

S??™n8? I?2222S 9 "ggtaagaa aggatagtat agaaatttgt attttttgag 3420 

? a "?ggagt "tgagttta gtttaggggg agtgggggtg tgatttttaa tttagaaatt 3480 

III2I 9 tfltttaagtt tgtggtagta gggtgggttg tgttgaattt tggtgtgtgt 3540 

ggagtgggga gatgtaggtg agtgttagag tttgggtttg ggggttttgt gttggggaga 3600 

ggag " ggga tttattggtg gagttgaaaa taagtgtatt tatatttaaa taaatggttt 3660 

aattgtatta ggtggggaga gggagtattt aattggttgg tgtgaggttt tggtgttgtt 3720 

* aatatttt9 tgtgagagtg agtggtgtat ttgtatgttg tggagtgltt 3780 

agtgggtttg aaaagggaat tgtggtttgg ttttattttt tgttttggtt taggtgtaqq 3840 

ag J aagtgtt "gttggagg atgatgatag aggttaggtt ttgttaatgg gttigtgagg 3900 

agtggtggag gtgaggttgg gtgttggtat atatatatta atatatttga gttattattl 3960 

attagtatag gtgtgttggt tgtagttatt ttttttattt atatttttta ttttttattt 4020 

25?2§2 9a ^ a9tt 3 a J tttattgtta atttttgttt tttttttagg aatttgagaa 4080 

2iI2II a I ata II aattt agtaatattt gtggagaaaa tttttattig taattttttt 4140 

a f aa ^ a H23 HIIIH 33 * tattgtggtt tttaagtaat aatagtagta taaaaaattt 4200 

* aa " aaa * a aaa "tttga tagaagttgt gataattaga aaggatgttt tataaaggtg 4260 

??2822 tttttttttg ttttattttt attgtaatat ttagataggt tttttttttt 4320 

tttttttttt tttttttttt tttttgttgg tttttttttt tattgttatg ttgg 4374 

<210> 346 
<211> 2534 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA CHomo sapiens) 
<400> 346 

??^?S^ aa 9 I 99 5 a 5 agt tggggagttg gattttattt tttgtgagtt ttgttggtat 60 

ttgatggtat gtggtttgga gagggtaggt gatttggtgt ggagggttag agggtaaatt 120 

5S???S?I 2EK22! SSSS**^ gaaagggaaa «tgtgtag? gatJggSS 180 

Si 9 . H a aatttattgg gtgattaatt ataaaggttg ggttggagtt ttaqagqttq 240 

SSnnnn^ 522^ ttaa gtggta tttt gaaagttgtt atttg?g?at tttgggl^? 300 

tagaggggat tttgaggggg aatgaggttt ggaggatgga attattttta gqtagattoa 360 

gaaggagttt ggattttatt tttaaatata gtttggagtt tataggttag aSgttttaat 420 

gggagaaaag ttaaaggaag agggtgtaga aaggagtttt agggaattgq tqqttatatq 480 

EES28S SEES? 1 !" """gaga tttagtgttt ttatttttS ggttttgtgt 540 

SffiSI??? i" a ^ gg 5 gg ggattaaatt tgattgtgat atgaaagtgt ttgggalatt 600 

ttatggtttt atttaaatat gagttatttt tatttgaatt aaggggggaa qttatttqqt 660 

aggattagga attttatttt tttgaatttt tatgggtttt gttgiggttg iaqtagttaq 720 

ggg " aaag I tagtttttag tttttggaag ggtattgtga aagtgglSt Satttgadaa 780 

£8222 gtta I gtgat Sttagttttg aafaagaggg ggtagtttgg 840 

rnllH ?f aa ; SSHIHSIS Hggtggggt ttatgtttag attttttttg ttgattgttt 900 

I2 a l§ «III a tttttatatt ttagtttttt tatttttatt gtagagttgg aaagggtgtg 960 

? ggaa ? agag 9agagggagg taggttttgg gttttggttt tgttttttgt ttttttttat 1020 

*22222 ?IIIf2lI a I ttagttaaaa ggtaggttaa gagtaggaga gatatagagt 1080 

ttggtattgg ttt^gtag i H4o 
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gagagagtta gtttgattta gaaggttaag ttggtagagt aggttgaatg ttatgaggat 1200 

atggtagttt ttatgaaagg tgttgtggag aagggtgagg agtttttttg tgaagagtga 1260 

aatttgtttt tagtagttta taagaatgtg gtgggtggtt agagggttgt ttggagggtg 1320 

ttgtttagta ttgagtagaa aagtaatgag gagggtttgg aggagaaggg gtttgaggtg 1380 

tgtgagtatt gggagaaggt ggagattgag ttttagggtg tgtgtgatat tgtgttgggt 1440 

ttgttggata gttattttat taaggaggtt ggggatgttg agagttgggt tttttatttg 1500 

aagatgaagg gtgattatta ttgttatttg gttgaggtgg ttattggtga tgataagaag 1560 

tgtattattg atttagtttg gttagtttat taggaggtta tggatattag taagaaggag 1620 

atgttgttta ttaattttat ttgtttgggt ttggttttga atttttttgt tttttattat 1680 

gagattgtta atagttttga ggaggttatt tttttggtta agattatttt tgatgaggtt 1740 

atggttgatt tgtatatttt tagtgaggat ttttataaag atagtatttt tattatgtag 1800 

ttgttgtgag ataatttgat attgtggatg gttgataatg ttggggaaga ggggggtgag 1860 

gttttttagg agttttagag ttgagtgttg tttgttattg ttttgttttg ttttttttag 1920 

ttttttattt tgttgagagg attagtatgg ggtgggaggt tttatttttt ttttttaggt 1980 

gttgtttttg ttttaaaggg ttttgtggag agggattggt agagttgagg ttatttgggg 2040 

ttggggattt tatttttttt gtagttgttg agtgtattta attattggtt atgtttttat 2100 

ttttgttttt tgtatttgtt tttttttgat tttaggatta ggttattttt tttttttttt 2160 

tgtttttttt ttgtttttgt tgtttttgat tgtaggaatt gaggagtgtt ttgttttgtg 2220 

gttgagaatt ggatagtggt aggggttgga gatgggtgtg tgtgtgtgtg tgtgtgtgtg 2280 

tgtgtgtgtg tgtgttagtg taagattgag attgagggaa agtatgtttg ttgggtgtga 2340 

ttatgttttt ttttaataaa gtttttttgt gatatttttt ttgttttttt tttagttttt 2400 

ggtgatgggt tgggagtggg attggaattt gatttagaga ttttgatttt ggatttttga 2460 

gttagggttt tgaatttttt aggtggttta gtggtttgta tgtaagattt tgagtttagg 2520 

tgaggttggg gttt 2534 

<210> 347 
<211> 2534 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 347 

ggattttggt tttatttgga tttaaagttt tgtgtgtggg ttattgagtt atttagggag 60 

tttagggttt taatttagag gtttaaagtt agggttttta agttagattt tagttttatt 120 

tttagtttat tgttaagaat tggaagagag ataggaggag tgttataggg gaattttatt 180 

gagaggaaat atggttatat ttagtagata tgtttttttt tagttttggt tttgtattgg 240 

tgtgtgtgta tatatatata tatatatata tatatatatt tatttttagt ttttgttatt 300 

gtttagtttt tagttataag gtgggatatt ttttaatttt tatgattaga ggtagtaggg 360 

gtaggaggga ggtaagagga ggggagaagt agtttggttt tggggttggg aggaagtggg 420 

tgtggagagt aggggtgggg gtatgattag tggttaggtg tgtttaatag ttgtaagaag 480 

agtgggattt ttagttttag gtggttttag ttttgttagt ttttttttat ggagtttttt 540 

ggagtaagaa tagtgtttag gggagaaggg tggggttttt tattttatat tagttttttt 600 

ggtagggtgg gggattggag ggggtagggt ggggtggtgg tgggtaatat ttagttttgg 660 

ggtttttggg gagttttgtt tttttttttt ttggtgttgt tggttgttta tagtgttagg 720 

ttgttttgta gtagttgtat gatgagggtg ttgtttttgt aggagttttt gttgagggtg 780 

tgtagattag ttatggtttt gttgaaagtg gttttggtta gagagatggt ttttttgggg 840 

ttgttggtga ttttgtagtg gaagatggaa aagtttaggg ttaggtttag gtggatgggg 900 

ttggtgggtg gtattttttt tttgttgatg tttatggttt tttggtaggt tgattgggtt 960 

gagttaatga tgtgtttttt gttgttattg gtggttattt tggttaggta gtggtagtag 1020 

ttatttttta tttttaggta gaagatttgg tttttggtgt ttttggtttt tttgatgagg 1080 

"tggttgttta gtaggtttag tatggtgttg tatatgtttt ggagtttagt ttttattttt 1140 

ttttggtatt tatgtatttt gggttttttt ttttttgagt tttttttgtt gtttttttgt 1200 

ttaatattgg atagtatttt ttaggtagtt ttttggttgt ttattatgtt tttataggtt 1260 

attgagagta ggttttgttt tttgtaggag agttttttgt ttttttttat ggtgtttttt 1320 

atgaaggttg ttangttttt atagtgtttg gtttgttttg ttagtttggt tttttggatt 1380 

agattggttt tttttatggt tttggggata tataggtggg tggtgggtta attgttgttt 1440 

gggattaatg ttggattttg tgtttttttt gtttttggtt tgttttttgg ttgggtggtt 1500 

aggtttagag aagggtgggg aggagtaggg ggtgggatta gggtttaaga tttgtttttt 1560 

tttttttttt tttttatatt ttttttggtt ttijtagtaaa ggtaaaaagg ttgggatgtg 1620 

ggggtgaatt attagaatag ttagtaggag aaatttgggt gtgggtttta tttgtattgg 1680 

tattttttta ggttttaggt tgtttttttt tgtttQgggt tggtattata tggttgttta 1740 

tattaggaaa tggtttttta aattag^ttt atttttatag tgttttttta ggg^ttaagg 1800 

attggtttta gtttttggtt gttttagttt tgatkgkgtt tataaaggtt taggai^gatg 1860 

gggtttttaa ttttgttagg taattttttt ttttggttta ggtgaggata atttatgttt 1920 

aggtagggtt atggagtttt ttaagtattt ttat&ttatg attgaattt^ dtttttatta 1980 
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tgtttttgtg atatatatag gattatagag atgggaatat taagttttag agaggggtga 2040 

gatttgttta aagttatata gttattaatt ttttgaaatt tttttttgta tttttttttt 2100 

ttagtttttt ttttattgag gtttttgatt tatgagtttt agattgtgtt tggaagtgag 2160 

atttaggttt ttttttagtt tatttgaaga tggttttatt ttttaggttt tatttttttt 2220 

tagggttttt tttgagtttt tagaatgtgt aggtggtagt ttttagagtg ttgtttaatg 2280 

aagtgtttaa taagtagttt ttgaagtttt agtttagttt ttgtaattag ttatttagta 2340 

ggtttgttgg atatattttt tattattata taattttttt ttttaagttg gtggtttgtt 2400 

gttatttgtt tgaggattta ttttttggtt ttttatgtta ggttatttgt tttttttaag 2460 

ttatatgtta ttaggtatta gtggggttta tagagggtgg ggtttagttt tttggttgtt 2520 

ttattttttt gttt ~ 2534 

<210> 348 
<211> 10001 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 348 

ttaagggatt tgtttgtttt ggttttttaa tgtgttggaa ttataggtgt gagttattgt 60 

gtttggttta ataaggaatt ttttaaataa agttttgtgg ggttgattag attaatttat 120 

attttttgag tagattttgg tataatttat tatttgtagt attgaatgtt aaggtttgtt 180 

ttttattttg aaattatatt tttttttttt tattggaatt gaaattttat ttttttatga 240 

aatgatagtg atggtggatg gtatttgttt tttttaatat ttttatttga taaaataaaa 300 

gttagtaatt tattttgatt tttaattttt tgttggtgtt tgaaatttta aaattgagat 360 

ttaaagtata gttttggttt tggagagatt tttaggagag ttagagttta gaagggagta 420 

ggatttagga ggtttttatt ttttagtatt ttagttgagt tagttgggtt atggaatatt 480 

attgagtaat taaaatatta ttaatagata aaaaaagttt attgaatata aaatttaaag 540 

gtattaatag ttttgggttt aagagattta tggtaggaag ttaagagttt tgttttaggg 600 

ttggtttggg tagttttgga agaagttatt gtatatgata gtgatgagtg ttaggaaaat 660 

agtatatttt tggaagttta tttgttggtt attgttttta tttaggttgt ttattagttt 720 

ttttagtttt tttttattta tttttttttg aaagtgatag gaaatatatt tattattaag 780 

ttagttttaa ttttagtttt attaaatatt tagattttta tatatttagg ttggttttta 840 

gttttttttt tttattgggt agttgggtat aaggtgggta ggaggttttg ggagttatta 900 

agagggttta gtgagtaagg agagagatag atatttattg gtgagtattt ttagtgtgtt 960 

ttttttttgg atatagggag gatattggat gtttttttga gtatgtatta gtttgaattt 1020 

tataggagtt tagtgaggta ggtattatta ttttattttg ttgatgagga aattgaaggt 1080 

tgttgttttt taatattagt taggtagatt tagaaaagaa tattttttat tttttaagat 1140 

aaggttttgt tttgttgttt aggttggagt gtagtggtat aattataatt tattgtagtt 1200 

ttaatttttt gggttttagt aattttttta ttttagtttt ttaagtagtt gggattatag 1260 

gtttatgtta ttatatttag ttaattgttt atttttagta gagataaggt tttgttatgt 1320 

tgtttaggtt ggttttggta attagagtga ttttattttg ttatttaggt tgtattatag 1380 

tttattgtaa ttttgaattt ttgggtttaa gtaatttatt taagtagttg ggattataag 1440 

tgtatgttat tatgtttggt taatttttat ttttattttt gtagagatgg agttttgtta 1500 

tgttgtttag gttggttgaa tttttgggtt taagagattt ttttgtttta gttttttata 1560 

ttgtggggat tataggtata ggttattgta tttagttgaa aaatattttt ttaatattag 1620 

ttgaaataat tagaaaaatt taattagtta taaaaataaa attaaaaaaa ggaaagaagg 1680 

atttgtttta tgtgtttaaa gtttgatatt taattttggg aattattaat gagttagtat 1740 

agggagggtg agggataaat tgaaggttga tgtatttttt tataaaatgt tttagattat 1800 

atattttagt tgggtttttt tatttttata tattttttta gtttggtata taaaaatgga 1860 

aagaggttaa gtatagtgtt ttatatttgt aattttaata ttttgggagg ttgaggtggg 1920 

aggattgttt gaatttagga gtttaagttt agtttggata atatagtaag attttatttt 1980 

tataaaaata aaaattggtt aggtatggtg ataatatttt tgtagtttta attatttggg 2040 

aggattgttt gagtttagga gtttgagatt atagtgagtt atgatttagt ttgggtgata 2100 

gagtgagatt ttgttttttt attataaaaa aaaaaaaaaa aaaaaggagt ttgggtatgg 2160 

tggtttatgt ttgtaagttt agtattttta gaggttgagg agggtagatt atttgaggtt 2220 

aggagtttga gattagtttg attaatatga tgaaattttg tttttattaa aaatataaaa 2280 

attatttggg aagttgaggt aggtagatta tgaggttagg agtttgagat tagtttggtt 2340 

aatatggtga aattttgttt ttattaaaaa tataaaaatt agttgggtgt agtgggatgt 2400 

gtttgtagtt ttagttattt aggaggttga gataggagaa ttgtttgatt tttggaggta 2460 

gaggttgtag tgagttgaga ttgtgttatt gtattttagt ttgggtaata agagggaaat 2520 

tttattttaa aaaaaaaaaa aagaaaagaa aagagagaga gagaatgtgt tagtgttgtg 2580 

■tgattttggg ataattattt tttttttttg ggtt*ttggtt ttttaagttt ttaatgatag 2640 

gattaaatag atggatttgg aaggttttta tagtatatgg ttttttggaa tgttgtagga 2700 

aatagttatt aattaagtat ttttatttgg gttttatttt tattttagtg gagttatgaa 2760 

gttaaagtgg ggagggggt^gggreraatfr -t-tgg^tatttg "tattgatragg taxt t ggaat 2820- 
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tgggggagag atttaatttg tatttttata tttatattag gatttttttt ataggtttgt 2880 

gttatttatt tttataaagt tgggtagttt tttgtgtaga agttttttta tttttttttt 2940 

atttagtttg aatttgttgt ttttttggta ggagtatttg tggaaggtag tgattagtat 3000 

agttagtgtt tgttttagag aattgtatat tatggatttg tggttgtaga ggtgttaggt 3060 

gatgggtata ttgttgaggt ttttggggtt ttagtttagt ttgtaggatt tatttttgtt 3120 

ttatgttttt aagttatttt ataaaatttt tttttgaatg ggaaggtagg ggttgtattt 3180 

tttttttaga ttgtggtttt ttgaggttgg aggtgatatt tttattttat tttgagggta 3240 

gatgttaaag ttttttttgt gtttgggagt tattaagttt tttttgatgg tatttgggat 3300 

atttttttga gggtagggtt ttttatttta gtgtttgtta gaagggaaat gatagggtga 3360 

ttagagtttt attttgatgg tatttgtgta ggttttgggg agtagggtta tatgtattgg 3420 

ttggtagatg ggggtttttg tgtagtttgg taggagttat ggttatagtt taggagtagg 3480 

aagttttttg aaggagttgt tttttttttt agatttggga tttttgaagt tagaggtatt 3540 

tttttttgtt atttaattgg aagttttttt gggtaggagt gttttttttt ttagattagg 3600 

agtttttaga gggtggggaa tgtgtttttt tttattttga aggtttttga gtttttaggt 3660 

agataggggt atggataggg tgggggtaat ttaatatttt ttttattagg aaggattttg 3720 

ttggaattag aattaaagat atgaaatgtt tattgttttg ggattaggat gggatagatt 3780 

ttagtggtaa ggtttttgag aatatttaga ttagtttgga aaggttaggt aggttttagg 3840 

gtgaggttag aggagttatg ttagaggtag ggtatgtttt gggtttatta tgttagtttt 3900 

gtttttattg aggtaggggg tagagttttg ttgtttgggg ttatgattta ggtagagttt 3960 

tgagattttt gttttttgta ttttagtttt gttagaattt aggattttgt aaaaaaaaaa 4020 

aaaaaaaaaa aaaaaaattt taggattttt tttgtagtaa tgtttatatg tatatttttt 4080 

gtttttaggt ttagattttt aaagaagggt ttttagattg agatatttag gggagggttt 4140 

tagagtaatt tttgtattga gattttatag aggaatttgg agaaaaattt tttagttggt 4200 

tgggtgtggt ggtttatttt tgtaatttta gtattttggg aggttgagat gggtggatta 4260 

tgagattagg agattgagat tattttggtt aatatggtga aattttattt ttattaaaaa 4320 

tataaaaaaa ttagttgggt atggtggtgg gtgtttgtag ttttagttat ttgggaggtt 4380 

gaggtaggag aatggtgtga atttgggagg gggagtttgt agtgagttga gattatatta 4440 

ttgtatttta gtttgggtga tagagtgaga ttttatttta aaaaaaaaaa aaaaaaaaaa 4500 

aaatttttta gttaatattt tagtatttta tagagagttt ttttaagggg gaaggtttag 4560 

agaggtttag gttgatttta ttagggaagt tggaatgaga gaattgtgga gataaagata 4620 

gagatttatg gggatttttt ttattttgtt tttagtagga tagaggtatt tattagatta 4680 

ggtggtagag atagatttag gaaggagagt aaggtagtta ggtttttagg gtgaggattt 4740 

atatgtgggt tttattggtt tttaatttgg tattttaagg aaattaaatt tgttttaatt 4800 

tgattttatt aaattttggt tttttgttat ttttatttat ttttttttag tttttttttt 4860 

tttagagagt gttagtttta ttattagttg ggggagggtt ttaggttagg ttaagggtag 4920 

tttttgagtt tatttaggtt atagtgggaa gtgggaggtg ttgtggggat tgggtatttt 4980 

ggattggggt gaggggaggt aatttttgtt ttggggtttg gagattggtg ttaggtttta 5040 

tttttatgat atgtttgtga gttatagttt atttttgtgt gagaggttgt gagtatatgt 5100 

tgagatttta tggtttagtg tttgtgtata tatatgtgtt tgtgtgtgtg agagagaagt 5160 

aatttggggt tatttttagg aagttggggg ttagaggagg tagttggggg atattaggag 5220 

gagaaggata ggattgattt tagggtatgg atgaagtgga gtgattttta atggttagta 5280 

ttagaatatt ttttaatgag gtaatgaggt aattaagtgt gtttagaaat atttgggaag 5340 

gtttttttgt gtaggaaggt gtttggaagg atggggaagt agtgagaaat ggggttgttt 5400 

tatatatagt tgtagttggg ttgggttata gtttgtgggg agtataagga agttttaagt 5460 

agttaaaggt tgaataggaa aattttggtt tttttttttt tagtttgagg gttgggtggg 5520 

tgggtagggg tatattttta tttaaggttg aaaatattag ttggatagtt ttttaagtta 5580 

gaatttaggt ttgaagtata gtttttagtt atagtttttt tttattgttt tttatatata 5640 

tgtttttttt agatttggag aaaggtaaat attgtggtta tttagagttt agtttaaggg 5700 

agggttgttt ttttttaggg tttttttttt tttggttttt tgtttatttt ggtttgtttg 5760 

ggtattaggg tattgttttt tttgtggttt tagttttttt ttttgtttta gttgttgagt 5820 

agtttttttt ttttttttag tttttttatt ttagttttaa gtgatttatt tgaggtttta 5880 

ggtttagatg tttaggtagt gatttagtgg tatatttttt ttttttaggg ttagggtagt 5940 

agtagtttta gagtagtagg tggaagtttt agtttagttt ggtagggaaa gtattggtat 6000 

agagatttta aataggtttt gttaggagga gttttggata atgatatggt aggaggatta 6060 

gattgagaat aggttggttt ttttttagtg gttattgtgt gatttggggg ttagtttgag 6120 

tagtgtatat gaagttaggt tttggggttt ttagtgtttt tttttatatt ttggtatagt 6180 

tattgagtgt tttttgtttt ttttagtatt tgaatttttt atgttatttg tttttttatt 6240 

gaggtttatt ttaagtagaa tttggaataa aaatttttta tattgagggt gagggataag 6300 

ggagtgtaat gtattgattt ttatagttat tttttttttt taattaaggg tftttatagg 6360 

gttttttttg tttagaaatt gtagtgtatt tttatggttg ttttttaaag ttaggtggta 6420 

tgtttttatt tataagtggg agttgaataa tgagaatata tggatatagg gaagggaata 6480 

ttatatatta gggtttgtgg gtagttgtgg ggagaaggat gggggtaagg ggagggagag 6540 

tattaggatt aatatttgat gtatgtgggg tttaaaattt agatgatggg ttgataggtg 6600 

tagtaaatta ttatggtata tgta.tattta tgtaataaat tatattttgt atatgtatgt 6660 

tagaatttaa agtaaaaata aaataaaatt aaattaaaat taaaaatttt taaaaagttt 6720 

ggtagtgagg tgttttagat ttggtttgtt gttgtgtgtt ttttaaatat attggta^at 6750 

gtaattagtg tggt-taagat ttttggtttg tgttgatttt tttgtttaga atgtgtttat 6 §40 

atatgtti;3t gttttttatt ttgttgaaat ^taajittttt ttttaaaggt tattgtaggg 6900 
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agttttttta 
tttttgtttt 
tgtttttttt 
aggtaagggt 
ggggttattg 
attgattatg 
tgggttggaa 
ggtttttagg 
gtaggtagat 
atagtagtag 
tttttgtttt 
ggtgataatt 
tatggtaggg 
tatatttttt 
tttaaagtag 
attttatttt 
ttagttgttt 
tttttttttt 



gtgttgattt 

ttttggatag 

gaagttattt 

ttgagtttag 

atgatagagt 

gagaaaatta 

ttttttttag 

tttaagtttt 

tatgaatttg 

aaaataagtt 

tgtattaatt 

tattatattg 

ttgttggggg 

tattatttta 

tagaggagaa 

ggtatgatga 

tgataatttt 

tatttttttt 

agaagtttat 

ttgtggattg 

ttgttaaggt 

tatattaatt 

attttttgtg 

aagtaatttt 

tagtttagtt 

aattttagta 

agtttgttta 

gtggtataag 

ttgggaggta 

gagtaagatt 

tattgttaaa 

atatttgtaa 

gggtatagtg 

ttaagtttag 



gggtttaatt 
ttttgtgtgt 
tagttttatt 
tttgtttata 
gaggagtgtt 
ttatgaaata 
aagagtttta 
atttagtgag 
gaagaagata 
agttgtgggt 
aggaattgag 
gtatttattt 
taggagagtt 
agttgtttat 
tgattttgtt 
tgggttttag 
tgtgtttttt 
tttgtttgtt 
atttttaagt 
tttttgttta 
taaaagttta 
gagtttgagg 
aagattttgt 
ggggatagaa 
gttgtttagg 
gatattgttt 
agtttaaatg 
ttgaggtatt 
ggatttataa 
ggtttaggtt 
gtgtagattg 
tttaatttat 
tattgagatt 
atataggtgg 
gagagtttta 
ttattttata 
ttttttttta 
gatttttttt 
tggagtatag 
tttttagtaa 
gtgatggagt 
tttgttttgg 
tggattaaat 
ttttgggagg 
atatggtaaa 
tttgtagttt 
gaggttgtag 
tttttttaaa 
tatttatttt 



ggatttagtg 
atttagagaa 
tttatatggt 
ttaggtgtta 
taggagtaag 
attatttttt 



gaaggaagaa 
gtttatgttt 
gagtttaaga 



gagagttata 
ggagttgagt 
aagagttagt 
tattagggtt 
gttgtgattg 
gttattaatt 
gtatggaatg 
aaaggtagag 
ttggtttagt 
gtgttagaaa 
ggtggttaag 
agattttttg 
ggttttttta 
gaagtttagg 
gggatatggg 
ttgtagtgag 
tttaattaat 
gtgagggatt 
ggaaagagaa 
ttggtttatt 
tttttagtag 
ttgttttagt 
gtgttttttg 
attttggatt 
aaattttgaa 
ataataattt 
taaagtttgt 
ttataatgtt 
atattttatt 
tttagtttat 
tttttgattt 
aatttttgtt 
tggtattatt 
ttggggttat 
tttgttattt 
ttttttaaag 
ataaatataa 
ttgaggtagg 
attttatttt 
tagttattta 
tgatttgaga 
aataaattaa 
atgttgttat 
atggatataa 
gtaattttaa 
ttagtatggg 



seq_prot . 
tttatttgtt 
ttttgatttt 
gagtttattg 
attagtgttt 
ggagtttttt 
ttttggattt 
aatttttgag 
aaatttttta 
tagatggaaa 
agtttttgaa 
tgtttattaa 
tgttgtgtaa 
gtttttagag 
tagttaagat 
tttaggaagt 
ttttttatta 
gtgtgttatt 
gtgttttgtt 
tatttatatt 
tggatgtttg 
tatttgggag 
ttatgattgt 
taattaatgt 
ggttattgtt 
atggttttat 
gtgtgattgt 
aaggattttt 
aaaatgtatt 
gtagtatgat 
tgttatttat 
gtatttgttg 
taggagttgg 
taaatttata 



txt 

ttgttttgta 
gattgaatta 
gttttattta 
atggttatta 
ttattttgtg 
taagagtgga 
gagtgatagt 
gtgggtagag 
agttagggag 
ggtttgtttt 
atggttatat 
ttttgggtaa 
tttgatttta 
ataagtaggt 

tgggttttta 

gtggttttgt 
gtagtgttgg 
tatgatttaa 
tgttttatat 
agatggtagt 
ataggaggtg 
gttattgtat 
aagtttagag 
aagttttgtt 
tgtgggggtt 
tgattttggt 
tttattatgt 
aaataattta 



taaagtgttt 
tttgtgtatt 
tttttgtatg 
tatggtaata 
aaattatatt 



aaattttttt 
tttatttaga 
ttgttttatt 
tggtattttt 
tatttattta 
atagtttatt 
aggtgtgtgt 
tgtttagatg 
tgttgggata 
aagagattgg 
tagattattt 
tattaaaaat 
gggaggttga 
ttatgttatt 
aaatgaatat 
ttttttatta 
gaaaaatttt 
tattttggga 
taatataatg 



tagtatgtga 
aatgggtgtg 
tgtgttaggt 
aataagatga 
taattagtaa 
tagaagtttt 
ttagttttag 
ttttaatttt 
tttttttgtt 
tttgagatat 
gtagttttat 
tattgtgtgt 
ggttttgaat 
ttaatgtgag 
gtatggtggt 
gaagttaaga 
ataaaaatta 
gttaggaaaa 
gtattttagt 
gttatttaat 
gtatgtaggt 
tattaataaa 
ggttagggta 



attgagataa 
aggaaattgt 
atggagatag 
aaaaatagag 
taattaggtg 
atttgagatt 
tttttttttt 
attttatttt 
tttgtggata 
tattttgttt 
gggattttgt 
ttttttgtta 
ataggagata 
agatgaggtt 
tagatatttg 
gaaagattat 
tttagtttgg 
ataagggtag 
tatttatttt 
ataggtttag 
tgagtggtgt 
tttgttttat 
ttttttttgt 
ggatttatgt 
tgtatatttt 
gttttttgta 
ttttatttta 
ggggtgggga 

aattattttt 
tttttttgtt 
agtaggtgtt 
tttaaatgtt 
agatttattg 
gtgatttttt 
atttttttgt 
ttttgagttt 
ttattatgtt 
ttatatttgt 
gtttaagatt 
gttaggtgtg 
ttgtttgaat 
ttgggtgaaa 
ttaatatttt 
atataaatag 
ggattgggtt 
ggaggattat 



<210> 349 
<211> 10001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic 
<400> 349 

ttattatatt gtttatgttg gttttgaatt 
tttttaaagt gttgggatta taggtatgag 
ggaaattttt tttatgttta tttttttttt 
t;Tggtggaaa aata^taata taaggtaaat 
tatgtttatt ttfraatttgt ttttgagaaa 



DNA (Homo sapiens) 



6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10001 



tttaggttta 
ttattatgtt 
tttgtaaata 
gtttggt^gt 
^gattttgttL 



agtgattttt 
tagtttaatt 
tttgtttgta 
iagaaatatta 

•fcttt-tatt:;ta- 
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ttgttttggt 
ttttattgat 
tatttatata 
ggttaagtgg 
gg^tggagtg 



60 

, 120 
180 
, 240 
,300 



seq_prot.txt 

tagtggtatg attttgggtt attgtaattt ttgtttttta ggtttaagtg atttttttgg 360 

tttagttttt ttgagtagtt gggattatag gtttgtgtta ttatgtttgg ttaatttttg 420 

tatttttagt agagatgggg ttttgttatg ttaggtaggt tggttttgaa tttttggttt 480 

taggtgattt gtttgtttta gttttttaaa atgttgggat tataggtgtg agttattatg 540 

tttagttttt tttatattta tatttaattt gggttaggtt gggtatggtg gtttatgttg 600 

gtattttagt attttgggag gttgaggtag gaggattgtt tgagtttaag agtttaagat 660 

ttgtttgggt aagatggtaa aattttatta ttataaaaga tgtaaaaaga tgtgtatagt 720 

ggtgtatatt tatagtttta gttattgagg aggttaatgt gggaggatta tatgaggttg 780 

tagtgagttg tgatggtgtt attgtatttt agttttggtg atagtgagtt tatgttttaa 840 

ataagtaagt aaataaaaat taaaaagaat ttagtttata gggtatttga aggtaagagg 900 

aaaagatgtt agaattagag atggggagaa gatgggtttt tggtgtttgt tgaggttgag 960 

aaatgagata gataggttga gtgtggggtg gagagaggat gggtagagag attgaggttg 1020 

gtttgaatgg aaatgaaatg ttagggtttt tagggttatt ggggaataat tggagttttt 1080 

aggaaaggtt taatgttgtg attatttgtg tttgttatgt ttttttattt tttattaatt 1140 

gtgtgaattt ggtagatttt gagttttagt gttttttttt gtgaagtggg gttattttat 1200 

tttaattttt gggattgttg tgtgaattaa atggggtaat gtatggagag tatttgatgt 1260 

atagtgagtg ttttaaaatt ttagtttgta ttttttagta aaggatatgt atatgtttat 1320 

tgtgagtgat aaatttagga tgatttgaat ttaatgtgat aatgtgggtt tttgtatgtt 1380 

ggttatgttg ttgggagata tttatggatt taattagtat aataggggaa ataaattatt 1440 

taatgtattt tgttaagata gaatatttta gaatttattt tgtggggtgg ggtataataa 1500 

agggggtttt tttgttgaaa atgtttaagt ttaggtttgt ggtattattt aattaaggtt 1560 

gatagttata tagtaagtta gaggtaatgt taggatttaa attaaatttg tggtttttat 1620 

aatgaggtta tttttttttt ttttgaatgg tttggggaaa gggggtgggt gggtagaatt 1680 

tggtagtggt taatttttta tttttgtttt ttggtttttt tttgttttta tttttaggtt 1740 

tgtattgatt gattgattga gatagggttt tgttttgttg tttaggttgg agtgtagtgg 1800 

tatgattatg gtttattgta gttttaaatt tttaggttta agtggttttt ttgtttttta 1860 

ttttttgagt atttatattt ttaggttttt aaaatggttt ttaggtattt ggttgttgtt 1920 

ttagatattt atttgggttt ttgggtaggg attgtttggg aaattttatt tatgtgaagt 1980 

aggtgtgggt gtaggaaggt tgtttggaaa tgaattagta ttgttttttg tttgagttgt 2040 

aagtagggtg ttagagggtt tggtggataa gaaagggagg atgataggag gttggtattg 2100 

taatgatatg ttttagttat tagagggtat gaagtagttg ggtaaaattt tgtggggttg 2160 

ttggtggaaa atttttggta tttggagttt ggagatgggg tggatggaat gtgaggattt 2220 

agttttttga ggttgggttg gggtagagtt attgttttgg atgtttgtag ggtttgtttg 2280 

tgttttgatt attttgtttt tgtagatagt tggagaatgt gagagtggga ttgggattgg 2340 

attttagggg ttattttgta taattttttt gttttgttgt gggggaggga gttgtttaag 2400 

gttatgtagt aagttagtgg taaatgaata tgattattat tagttttagg tatatggtta 2460 

tttgatgggt gtagttgtag ttttagtttt tgagatagag atatttgatt aaggataggt 2520 

ttttaggagt tgattttagt gatttgtggt tttgttgttg tttttgtttt tttttttggt 2580 

ttttttattt gattgatttt ttgttttttt tgtttgtttg tttgtttttg tttttgtttg 2640 

ttggggggtt tgtttaattt ttttattggg ttttgggagt tgtagttttt tgttgttatt 2700 

ttttagggat ttgtagtttt ttgaagtttt tttttgattt gttgttttgg ttttattttt 2760 

gggatttaga ggagaggtga ttattttgta gtatagttag tggtgtgatt ttatggggtg 2820 

agaaggattt ttttgttttt aagtattttt ttagtgattt ttgttgttat gtggtagttg 2880 

taagtattgg ttggtatttg gtgtgggtga gatttttatt ttatgtagaa atgagtaaga 2940 

ttggtgagtt tattatgtgg gggtgaggtt gagagaaaat aagtatatag gtgatttagt 3000 

taaaattaga attttttaag tatatatgaa aagggtaaaa ggggtgtttt gtataggata 3060 

gaataggtag atattgaatt tggttgggtt ttgggaaggt tttttgtagt ggtttttgaa 3120 

gggggggttg gattttagta ggatagaggg tatgggtatg tgtgggtatg ttttgaatag 3180 

aggggttagt gtaagttgag ggttttggtt atattagttg tatgtgttgg tgtgtttaag 3240 

ggatatgtag tagtaggttg agtttggagt gttttattgt taggtttttt aaaaattttt 3300 

aattttaatt taattttatt ttatttttat tttaagtttt ggtatatatg tgtagaatgt 3360 

ggtttgttat ataggtatat atgtgttatg gtggtttgtt gtatttatta atttattatt 3420 

taggttttaa gttttgtatg tattaggtat tagttttaat gttttttttt tttttgtttt 3480 

tatttttttt tttgtaattg tttataggtt ttggtatgtg gtgttttttt ttttgtgttt 3540 

atatgttttt attgtttaat ttttatttat gagtgagaat atattgtttg gttttaaggg 3600 

atagttatgg ggatgtattg tagtttttga gtagggaagg ttttgtggag gtttttagtt 3660 

aaaaggaaag aatggttgtg aaaattgatg tattgtgttt ttttgttttt tatttttagt 3720 

gtgaagggtt tttattttga gttttatttg aagtaggttt tgatgggaag ataagtagta 3780 

tgaggggttt aagtattgag gggagtaagg gatatttggt ggttgtgtta aggtgtagaa 3840 

gaggatattg ggggttttaa ggtttgattt tatgtatatt gtttaggttg gtttttaagt 3900 

tatatggtga ttgttaggaa gggattagtt tgtttttagt ttgatttttt tgttatgtta 3960 

ttatttaaag ttttttttgg tagggtttgt ttggggtttt tgtgttagtg ttttttttgt 4020 

taggttgggt tggggttttt atttattgtt ttgggattgt tgttgttttg gttttggggg 4080 

aggagggtgt gttgttgagt tattgtttgg gtatttgggt ttggaatttt gggtgagtta 4140 

tttagggttg aggtagaggg gttgggggag gggaagaagt tatttgatag ttggagtagg 4200 

gaggggagtt ggggttatag gaagggtggt gttttgatgt ttagatgggt tgggatagat 4260 

aaagggttaa ggaggaaggg gttttgggag ggggtagttt ttttttgggt tgggttttga 4320 

atggt^atag tgtt£gtttt tttttgggtt tggggaggat atgtgtgtgg ggggtagtga 4380 
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gagagggttg 
gttggtgttt 
ggagaagagg 
tttttataga 
tgttttttta 
tatatttagt 
ttttatttta 
gttttttttg 
aggtgtatgt 
tatataggag 
ttaggtttta 
atatttttta 
gagttttttt 
aatgagtggg 
tttttaaaat 
ttggttgttt 
gttttgttga 
tttttatttt 
agaatttttt 
ttgagatgga 
tgtaaatttt 
attgtaggtg 
tttattatgt 
ttttaaagtg 
tttaggtttt 
ttagtttagg 
gttgttgtaa 
tggattttag 
gttttaagta 
tagggtatgt 
ttttaggttg 
ttaaggtaat 
gagaatgtta 
tttagggtga 
tatttttgtt 
gttttgagtt 
tatggttttt 
tttttagagt 
ttggtagata 
gaggtttggt 
aggtattgtt 
ttatttagag 
ggttgagtta 
atagatttat 
ataagtattt 
ttttgtataa 
tttggtgtga 
gtaggtgtta 
ggaaatgagg 
attatgtgtt 
aaattagagt 
tttttttttt 
ggttggagtg 
atttttttgt 
taatttttgt 
ttttgatttt 
atggggtttt 
tttttggttt 
tttttttttt 
tjagtttattg 
aggagtattg 
ttgtttaggt 
gtattggaat 
ttaagaaagt 
ggagaaatgt 
ttttaaagtt 
ttttgttttt 
ttttagttgg 



tggttgaggg 
ttagttttgg 
aggttaaagt 
ttgtggttta 
tttttttaga 
tattttatta 
tttatgtttt 
gtttttagtt 
atgtatatga 
tgggttgtga 
gagtaggggt 
ttttttattg 
ttagttggtg 
aatggtaaga 
gttaagttgg 
tgtttttttt 
gggtagggtg 
agtttttttg 
gtgaagtgtt 
gttttgtttt 
tttttttagg 
tttattatta 
tagttaggat 
ttgggattat 
tttgtgaggt 
ggtttttttt 
aaagaatttt 
taggattaag 
gtaaagtttt 
tttgtttttg 
gtttgagtat 
gaatatttta 
agttgttttt 

ggggagatat 

taagggagtt 
taaggaggga 
gttagattgt 
ttgtgtagat 
ttaaaatggg 
gatttttaga 
tttggtttta 
aggggttttg 

aggttttaag 
gatgtgtagt 
ttgttaagag 
ggagttgttt 
gtgtgggggt 
gggtggggtt 
tttgagtggg 
gtgagggttt 
ttagaaggga 
tttttttttt 
taatggtatg 
tttagttttt 
atttttagta 
gtgatttatt 
attatattgg 
ttgaaagtgt 
ttttttgtgg 
taattttgaa 
ttattatgtt 
tgggtttgaa 
tatagatgtg 
atgtagggat 
attgattttt 
aggtgttagg 
gtggttggtt 



ttgtgtttta 
gtagggatgt 
ttttttgttt 
gtttaattgt 
tattttttta 
ttttattaag 
gaagttagtt 
ttttaagggt 
gtattggatt 
tttataggta 
tgtttttttt 
tggtttgggt 
gtggagttgg 
ggttagggtt 
gggttagtgg 
tttgggtttg 
gggaggattt 
gtgggattaa 
gaagtgttga 
gttgtttagg 
tttatgttat 
tgtttggtta 
ggttttgatt 
aggagtaagt 
tttagtgtag 

*gggggttta 

gagatttttt 
gtgtaagagg 
gttttttgtt 
gtatagtttt 
ttttagaggt 
tatttttaat 
attttattta 
attttttatt 
tttagttaga 
aatgattttt 
ataggagttt 
gttattaaaa 
gagttttgtt 
tataagggaa 
gggaattata 
tgaggtggtt 
agttttagta 
tttttggagt 
ggtgataagt 
agttttgtgg 
gtaggttaaa 
tagttttttt 
agtgtttaat 
tttgagttta 
aaagtgattg 
tttttttttt 
attttggttt 
tgagtagttg 
gagatagggt 
tgttttggtt 
ttaggttggt 
tgggtttata 
tggggggata 

tttttgggtt 
tggttaattt 
tttttgggtt 
aggtattgtg 
agaaaagttt 
aatttgtttt 
ttttgaatat 
aaattttttt 



seq_prot.txt 
ggtttggatt ttggtttggg 
atttttattt atttatttag 
agtttttaat tatttgggat 
ggttgtgtgt agagtaattt 
tatagaggga tttttttagg 
aggtattttg gtgttggtta 
ttgttttttt ttttttaata 
ggttttaggt tgtttttttt 
atgaagtttt agtgtgtgtt 
tgttatgaga atgaggtttg 
tattttggtt taggatgttt 
gggtttaggg gttgtttttg 
tattttttgg gagggagggg 
tggtgggatt aggttgaggt 
ggtttatata taaattttta 
tttttgttat ttggtttggt 
ttgtgggttt ttgtttttgt 
tttgggtttt tttgggtttt 
ttgaagggtt tttttttttt 
ttggagtata gtggtgtgat 
ttttttgttt tagttttttg 
atttttttgt atttttagta 
ttttgatttt gtgatttatt 
tattgtgttt ggttgattga 
gggttgtttt gaggtttttt 
ggtttaggag taggaggtgt 
tttttttttt tttttttttt 
taggggtttt aagattttgt 
ttggtgaagg tagggttggt 
tttggtttta ttttgaaatt 
tttgttgttg aggtttgttt 
tttaatttta ataggatttt 
tgtttttgtt tgtttagagg 
ttttgggagt ttttagtttg 
tggtagagag agatgttttt 
ttagggagtt ttttgttttt 
ttatttgtta gttggtgtat 
tgggattttg gttattttgt 
tttagggggg tgttttaagt 
gttttagtgt ttgtttttag 
gtttgagggg gagatgtagt 
tgggggtata gggtagaagt 
gtgtatttat tatttggtat 
aggtgttggt tgtgttggtt 
ttaagttgag taagggggaa 
gggtgagtgg tataggtttg 
tttttttttt agttttgggt 
tttattttag ttttatggtt 
taatggttgt tttttgtagt 
tttgtttaat tttgttattg 
ttttaagatt atatagtatt 
tttgagatgg agtttttttt 
attgtaattt ttgtttttag 
ggattatagg tgtattttat 
tttattatat tggttaggtt 
ttttaagtga tttttgtatt 
tttgaatttt tgattttagg 
ggtgtgagtt attgtgtttg 
agattttatt ttgttattta 
taagtaattt ttttaagtag 
ttatttttgt agagatggag 
taagtaattt ttttattttg 
tttgattttt ttttattttt 
agttaagata tatagtttgg 
ttattttttt tatattgatt 
atgaggtagg tttttttttt 
aattattttg gttagtatta 
_aatttttata_gtgtgggagg 
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aagttgttta 
tttttaagtt 
ttttttatgt 
tattttttat 
tatttttaag 
ttaaaagtta 
ttttttagtt 
ttatatatat 
tatagttttt 
gtattagttt 
agtttttatg 
atttggttta 
gttgggaggg 
aggtttggtt 
ttttgggagt 
gagtattttt 
ttttatagtt 
tttttttgga 
tttttttttt 
tttagtttat 
agtagttggg 
gagatggggt 
tattttggtt 
agggtttttt 
tttggatatt 
gtatgtgggt 
tgtaaagttt 
ttgggttatg 
atgatgggtt 
tgtttaattt 
tattttgatt 
ttttggtgga 
tttaggggtt 
agagaggaaa 
ggttttagga 
aggttgtagt 
gtggttttgt 
tatttttttt 
gttattagag 
ggtgagatgg 
ttttgttttt 
ggattttata 
ttttgtagtt 
attatttttt 
atgaaggaat 
tgggggaggt 
gtttgttgat 
ttattggagt 
attttagaga 
gagatttgag 
ggtatgtttt 
ttgttgttta 
gggttaagta 
tatgtttagt 
ggttttgaat 
tttagtagag 
tgatttgttt 
gatttttttt 
ggttggatta 
ttggaattat 
ttttgttatg 
gttttttaaa 
atatgttaaa 
gatattttgt 
tattggtgat 
ttttttttaa 
aaaaagtgtt 
ttaaggtagg — 



4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

5460 — 



seq_prot.txt 

aggatttttt aagtttagga gtttgattag tttgggtaat atagtaagat tttattttta 8520 

taaaaataaa aataaaaatt ggttaggtat ggtggtatat gtttgtagtt ttagttattt 8580 

gggtggattg tttgagttta ggagtttaag gttatagtga gttataatat agtttgggtg 8640 

ataaagtgag attattttgg ttgttaagat tagtttaggt aatatagtga ggttttgttt 8700 

ttattaaaaa taaataatta gttgggtgtg gtggtatgag tttgtggttt tagttatttg 8760 

ggaggttgag gtgggaggat tgttggagtt taggaggttg aggttgtagt gagttgtgat 8820 

tgtgttattg tattttagtt tgggtaatag agtaagattt tgttttaaaa aataaaaagt 8880 

gttttttttt gaatttattt ggttggtgtt ggggagtagt aatttttggt ttttttatta 8940 

gtagaatggg gtgatgatat ttattttgtt gggtttttgt gggatttgag ttgatgtatg 9000 

tttagaggag tatttagtgt tttttttgtg tttaggagga gggtatattg gagatgttta 9060 

ttaatgagta tttgtttttt tttttattta ttgggttttt ttggtagttt ttagggtttt 9120 

ttgtttattt tatatttagt tgtttagtgg ggagggagag ttgggaatta atttgaatgt 9180 

gtgagggttt gggtgtttgg tggagttggg gttggggttg gtttggtgat gagtgtattt 9240 

tttgttattt ttaggagaaa gtggatgagg aggggttgaa gaagttgatg ggtagtttgg 9300 

atgagaatag tgattagtag gtggattttt aggagtatgt tgtttttttg gtatttatta 9360 

ttgttatgtg taatgatttt ttttagggtt gtttagattg attttgaagt agaatttttg 9420 

attttttgtt atggattttt tgggtttagg attgttgatg tttttgagtt ttgtatttaa 9480 

taaatttttt ttgtttgttg ataatatttt aattgtttag tgatgtttta taatttggtt 9540 

ggtttagttg gagtgttggg agatgagggt tttttggatt ttgttttttt ttgggttttg 9600 

atttttttgg aaattttttt aaggttagag ttatgtttta ggttttaatt ttggaatttt 9660 

aaatattagt aaaaaattgg aaattgagat aggttgttga tttttatttt gttaaataaa 9720 

gatattaaaa aaggtaaata ttatttatta ttattattat tttataaaaa gataaaattt 9780 

taattttaat aaaggaggaa aggtataatt ttagaataaa ggatagattt taatatttaa 9840 

tattgtaagt aatgagttgt attaggattt atttaagaga tatgaattag tttaattgat 9900 

tttgtaaagt tttatttaaa agattttttg ttagattgag tatggtggtt tatatttgta 9960 

attttagtat gttgggaggt taaggtaggt agattttttg a 10001 

<210> 350 
<211> 4449 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 350 

ttttttgtat atttgagagg taagaggagg tgtttgtggt ttttttgata tatatgtgat 60 

atatgtatta ttatggttta tgtgtgtgtt gttgtgtgta gggggtaggg ggttttttgt 120 

ttttgatggt tttttttggt gggtgttttt tggtttttta ggttttttta tttgttgaat 180 

gtggtagtta agggtttggg aatgatttgt tagattggtt aatgtggaaa gagttggttt 240 

tgtttagttt tgtatttttt tttttttttt tggaattgtt tggttgtaga gagtaggagt 300 

tttttagagt ttttggggga ttttattttt agtataattg tgtaatgtgt gtttagtata 360 

agagtattag atggagggat aggtgggagg aggaggaatt tagggtgtat ttagtttggg 420 

gtttgttgag ttgaggtttt tagagtaagg atgattaggt tggggggata ttttttttta 480 

atttttgtta agttatagag gggttagtgg gaataaagtg atattttggt ttttgtattt 540 

tttttttgtt gttttatttt ttagtttgtt tagatttgag gttggggagg ggtttttttg 600 

gaggggttag tttatgttat tttttgtaat ttttattata tatgttttta gatattttgt 660 

tataggtttt attttttatg ttaattttta tttgggaaag taaataaatg gaaagttaat 720 

ttgtgttatt tggttgttgg ggtatttgta gtgagtagtt aagaaatgtt aggggagttg 780 

gtgtttattt ttatgtttgg ataagagtgt gttttggatt tgtgttttgt tgtatatttt 840 

attgttttgt tgtatatttt attgttttat tgtatatttt attgttttat tgtatatttt 900 

attgttttat tgtatatttt attgttttat tgtatatttt attgttttat tgtatatttt 960 

attatttttg ttttttgtat ttagttgttt atttgtattt tagtttagga agtttagaag 1020 

atgtagaatt tttgtgagag tttagggtga aatgttgtgt tttattttta aagaaaggaa 1080 

aattatttat attttttaaa agaatgaata gtatagatta atattgattt tttttaattt 1140 

ttaggttaat tttgagtagt taaagttaga gtagttaatt tgtgttgtga gttgaggtat 1200 

agttgtagaa gtgtgtttga ggtgtttggt ggaggtggta gttgagtttt gggattaatt 1260 

attgtgttgg ggatggtatt gtgttaggat gtaggtagat ttttgtagaa gtgtttaaaa 1320 

tttatatttt tttatagggg tgagggggag ggagaaagag atgttttagt gaggataaat 1380 

attttttttt atatttaaat aattatagag tttttatttt aaagtattta taggtatatt 1440 

ttttagaaaa tatgaattgt tagttgggta tggtggttta tgtttgtaat tttagttttt 1500 

tgggaggttg aggtgggtag attatttgag gttaagagtt taagattggt ttggttaata 1560 

tggtgaaatt ttgtttatat taaaaatata aaaalaaattt agttgggtgt agtggtatat 1620 

atttgtaatt ttagttatta ggaagttgag gttgaattta ggaggtagag attgtagtga 1680 

gttaagattg tattattgta ttttagttta ggggtaatgg agtgagattt tattttaaaa 1740 

aaaaaaaaaa aaiaaagaata tatgaaatgt ttttagattt tqttatgttt ttttttttta 1800 

ttttaggtaa gttagaaagt gttattaata gtggtttttt ttaggttttt ggttagagat 1860 
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gtgaagagaa gttgggggga aattaggttt ttttttaagt tttttagttt tgttttttta 1920 

tttttggatt tgaatgtagt ttgatttagg ttattttatt gtattattat tggtggttgt 1980 

gattttgtgt aaaggtatag ttggtgatgt tgattagagt ttttgtagtt ttaaatgatt 2040 

tttttaatta attttaaatt tttagaattt attgtataaa aaggttatat tttttggagg 2100 

gatgttgatg gtattaggat agaagtatta ggggatttta tgaatggtgt tgttgaaata 2160 

gtagttttta tttgtatatt gggagggtgt gatattagga aaattataat tttgtttttt 2220 

atggggggtt attgtatatg tttttgaaag tgtataggta agaagtaaag taagttgtgg 2280 

gttgaatttt ttgatgttat tatgtatata tttatttagt tttttttttt aatgatatta 2340 

gtaattgttt agtgaggtgg atataaaatt tttaggatat gagagggaga tgtggttttt 2400 

JH^SS* 9tgtaaatat tatgtttagg gaaaatgtaa ggtgttttag gtttgtggat 2460 

tttgtatttt tttaggtaat ttatttattt attttttaat tttaataaat gattattaaa 2520 

ttttatttaa tatataaata tttattgagt attatttgtg tgtatgagaa gtgggagtta 2580 

9J at 99taaa a 9ttaggtat tgtgttaggt gagagagatt tagaaattaa aattagagaa 2640 

gttattaata agagtttaaa tatttgggtt taggtttatg tttgtaattt tagtattttg 2700 

ggaggttgaa ggaggtgaat tatttgaggt taggagttta agattagttt gattaaaatg 2760 

gtgaagtttt atttttatta aaaatataaa aaattaggtg ggtattgtgg tatatgtttg 2820 

taattttagt tatttgggag gttgaggtag gagaattatt tgaatttagg aggtagaggt 2880 

tgtagtgagt taagattgta ttattgtatt ttagtttgag tgatagagta agattttatt 2940 

ttaaaaaaaa aaaaaaaaga gtttaaggat ttgatggagg agaaaggtaa gaatatgtgt 3000 

gagataatgt aaggttattg ttttagggtg tttagggtaa ttatgggggt agggtatttt 3060 

ttggagaggt taatgataag taggttgaat aaagtagggg ggttttttgt aggaggaggt 3120 

ttattaggtg aagatggagt tgtatgggta aaggttattt tagagattta ggtgtgttta 3180 

ggaggtggaa atttattgta ggtaaggtga gaggattggg tggggtggtt taggaggagt 3240 

tgatagaggg tataagttgt gaaatagttt gaagtagggt agtgaggaaa gggatttaga 3300 

ggaggaagat atgtggatag atggggttgg ttgggggttg ttgtaggatt ttatgtaaga 3360 

ggttttaata ttagagtttt aggttttgag ttttgtggaa ttaaaggttt tagaaagtaa 3420 

tttattagga tttggtggtt gatagtttgt agtagggggt gaaagaggag tttagagtat 3480 

tttttggttt ttgtttttgt tttggggtat aggaggggag gaatttagtt tggttatatt 3540 

ggtttaggtg agggtgtttt aggggaggtt gagaggggtt gtttttttgt ttgttttgtt 3600 

attagggatt gttttgagat gtttttggag aaagtgtttt tggtttgttg ggaaggattt 3660 

gtgtttagtt tttgttgttt agtagttttt atgggaatut tgtttttttg gggtttggat 3720 

ttattgtgat gtgaaggtga ttattgttta tttttgggat attgtggggg ttaagagagg 3780 

ttttgtggtg agggaggatt attattttgg gggttggggg ggtttttttt ttagggagga 3840 

agatttttag ttttggtgtt ttgtttttgg tagattttgt tttgtttatt tgttttgttt 3900 

ttgttgtatg atggaaataa atggaaatgg tttttatata tgaatgtatt aaattgtaat 3960 

tataattttt tagattttag ttgtaatagg attatggatt tgagtgattt gtaaagatgt 4020 

HSf 9tttt 9*gttagagg gtgtgtgggg tggggagggg agtttgttat ggtttttgtg 4080 

«HH atagt ata 9999gta gagttggggg ttggggtggg ggtaagggtg taggtagatg 4140 

ggtttggggg tggttagtat ggggatttat ttagtttgtt ttgtatattg aggttatttt 4200 

tttttttggg ttttaaagtt tttttgtttt ttgttatggg tttgggggtt tttttattgt 4260 

agtttaatgt tggttgtttt tttatgttgt ttaagtttat tttttaggtt tggtatttta 4320 

tagtagagat taagaggtgg ttggagggta gtgggggtat ggatagtatt attgggtgtt 4380 

ttttttttag gtttttttgg aaatttttgt tttggaaatg tagaaagttt ttttttttgt 4440 

ttttatttt ~~ 4449 

<210> 351 
<211> 4449 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 351 

aaggtggggg taggagaaag ggttttttat gtttttaaag taagggtttt tagagaggtt 60 

tgaagaggga gtgtttagtg gtgttgtttg tgtttttatt gttttttagt tattttttga 120 

tttttgttgt ggggtattgg gtttgagggg tgggtttggg tagtgtagaa gagtagttag 180 

tattgggttg tagtgggaag atttttaagt ttatggtagg gagtggggga gttttggaat 240 

ttgagagagg aagtggtttt ggtgtataga atgaattggg tgggtttttg tgttggttat 300 

ttttaggttt atttgtttgt gtttttgttt ttattttagt ttttagtttt gttttttgtg 360 

ttgtgggatt atagaggttg tggtaaattt ttttttttat tttatatatt ttttggttta 420 

aggtttagag tgtttttgtg ggttatttag gtttatgatt ttgttataat tgaaatttag 480 

aaaattgtga ttatagttta gtgtatttgt gtgtggaaa* tatttttatt tatttttatt 540 

atgtgataaa gataaagtgg gtgggtaaga tagagtttgt tggaggtaga gtattggggt 600 

tggaaatttt tttttttgag gaggaaattt ttttgatttt taggatgatg attttttttt 660 

SS^SS 99 * H5 t1:t H at «ttatagtg ttttgggggt gggtgatgat tatttttatg 720 

ttgtgatgga tttagatttt aggagggtaa ggtttttatg gaagttgttg ggtagtggga 7an 
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gttgaatatg gatttttttt agtaagttag gaatattttt tttaaagata ttttgaggta 840 

gtttttgata gtaaagtaga taagagaata gttttttttg gttttttttg gggtgttttt 900 

atttgagtta gtgtggttag attgagtttt ttttttttta tgttttaagg tagggatagg 960 

gattggaggg tgttttgggt ttttttttta ttttttgttg taggttgtta attattagat 1020 

tttaataggt tgttttttga gatttttgat tttgtggagt ttagagtttg aagttttggt 1080 

gttagaattt tttgtataag attttgtggt agtttttagt tagttttatt tgtttatgtg 1140 

tttttttttt ttagattttt ttttttattg ttttgtttta agttgtttta tagtttgtat 1200 

tttttgttgg ttttttttag attattttat ttggtttttt tattttattt gtaatgggtt 1260 

tttatttttt gaatatattt gggtttttgg aatggttttt gtttatgtgg ttttattttt 1320 

atttggtgaa tttttttttg tagggagttt ttttgttttg tttaatttgt ttgttattgg 1380 

tttttttggg gagtgtttta tttttgtggt tattttgggt attttgggat gatggttttg 1440 

tgttgttttg tatatgtttt tgtttttttt ttttattaga tttttagatt tttttttttt 1500 

tttttttgag atggagtttt gttttgttat ttaggttgga gtgtaatggt gtgattttgg 1560 

tttattataa tttttgtttt ttgggtttaa gtgatttttt tgttttagtt ttttaagtag 1620 

ttgggattat agatgtgtgt tataatgttt gtttaatttt ttgtattttt agtagagatg 1680 

gggttttatt attttggtta ggttggtttt gaatttttga ttttaagtga tttatttttt 1740 

ttagtttttt aaagtgttgg gattataggt atgagtttgg gtttagatat ttagattttt 1800 

attaatgatt tttttggttt taatttttgg gtttttttta tttggtatag tgtttggttt 1860 

ttgttatgtt agtttttatt ttttatgtat ataaatggtg tttagtaaat atttatgtat 1920 

tgagtaaaat ttaataatta tttgttgaaa ttaaaaagtg aataaataag ttatttagaa 1980 

agatgtaaag tttataaatt tggggtattt tgtatttttt ttgagtgtaa tgtttgtata 2040 

ttaggatgtg aggattatgt ttttttttta tgttttgagg gttttatatt tgttttattg 2100 

gatagttgtt gatgttattg gagaaggaag ttggatgggt gtgtgtatga taatattaag 2160 

gaatttagtt tataatttat tttgtttttt atttgtgtat ttttagagat gtgtatagtg 2220 

gttttttgtg aaagatagaa ttgtggtttt tttggtgtta tgttttttta gtgtgtaaat 2280 

aagggttgtt gttttgatga tattgtttgt ggggtttttt ggtgttttta ttttaatatt 2340 

attgatgttt ttttagaagg tatggttttt ttatatgatg ggttttgaag atttagaatt 2400 

agttagaaaa gttatttaag attatagagg ttttgattag tattattagt tatgttttta 2460 

tatagagtta tggttgttag tggtggtgta atggggtagt ttgagttagg ttgtatttag 2520 

gtttaggaat agaaaggtag ggttaaggga tttgggaaga aatttgattt ttttttggtt 2580 

tttttttata tttttaatta aaagtttggg aagagttatt gttggtaatg ttttttagtt 2640 

tgtttaggat agagggggaa ggtatgatga aatttgaaga tattttatgt attttttttt 2700 

tttttttttt tttgaaatgg agttttgttt tgttgttttt gagttggagt gtaatggtgt 2760 

gattttggtt tattgtaatt tttgtttttt gagtttaatt ttagtttttt agtagttgag 2820 

attataggtg tgtgttatta tgtttagtta aatttttttt gtatttttag tatagatggg 2880 

gttttattat gttggttaga ttggttttga atttttgatt ttaggtgatt tgtttgtttt 2940 

agttttttag agagttggga ttataggtgt gagttattgt gtttggttga tagtttatgt 3000 

tttttaaaga atgtgtttat ggatatttta aagtaaaaat tttgtaattg tttaaatgtg 3060 

aaagaaaatg tttattttta ttaaagtatt tttttttttt ttttttttat ttttgtagag 3120 

gagtgtgaat tttagatatt tttgtaggga tttgtttgta ttttgatgtg gtgttgtttt 3180 

tagtatggtg attagtttta gagtttggtt gttattttta ttggatattt tagatatgtt 3240 

tttgtagttg tgttttggtt tataatatag attgattgtt ttgattttga ttatttaaaa 3300 

ttggtttaaa aattaaaaga gattgatatt aatttgtgtt gtttattttt ttaaagaata 3360 

tgaatgattt tttttttttt gaaagtgaag tgtagtgttt tattttgggt ttttgtagag 3420 

gttttgtatt ttttgggttt tttgagttgg gatataagtg ggtagttgag tgtagaaagt 3480 

agggatggtg gggtgtatag taggatagtg gggtgtgtag taggatagtg gggtgtgtag 3540 

taggatggtg gggtgtatag taggatagtg gggtgtgtag taggatggtg gggtgtgtag 3600 

tgggatggtg gggtgtgtag taggatgtaa gtttaagatg tatttttgtt taggtatgaa 3660 

aatggatatt gatttttttg gtatttttta attatttatt gtggatgttt tagtgattaa 3720 

gtgatataag ttagtttttt gtttatttgt ttttttaaat agaaattggt gtaggagatg 3780 

aaatttgtag tagaatgttt gaaagtatgt gtaataaaaa ttgtagaggg tggtatggat 3840 

tgattttttt aggaaaattt ttttttaatt ttagatttga atgaattaga aaatagaata 3900 

gtagaggagg gatatagaag ttaaaatgtt attttatttt tattggtttt tttgtagttt 3960 

ggtaaggatt gggagagggt gtttttttaa tttagttgtt tttgttttgg ggattttagt 4020 

ttagtaagtt ttaagttaaa tgtattttgg gttttttttt tttttatttg tttttttatt 4080 

tggtgttttt gtgttgggta tatgttgtat agttatattg gagatgggat tttttgggag 4140 

ttttggaaag tttttgtttt ttgtagttag atggttttag ggaggaagag ggaggtgtag 4200 

aattgagtgg ggttagtttt ttttatatta attagtttgg taaattattt ttaaattttt 4260 

ggttattatg tttagtagat aaagaggttt gagaggttgg gagatattta ttaagaagaa 4320 

ttattaggaa tgggaagttt tttgtttttt gtatatagta gtatatatat gagttgtaat 4380 

aatatatatg ttatgtatgt gttagggggg ttatagatat ttttttttat tttttagatg 4440 

tgtaaggaa c ? 4449 

<210> 352 
<211> 4296 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 352 



agaaatttat 
taaaatgtag 
agtaattaga 
tatttaataa 
agtggaggga 
aaatataaat 
tatggtgtta 
tattaaagaa 
ggttagagga 
agaggtatat 
gtttatggga 
tattttaaga 
tttgtatttt 
atggttgtgg 
gttggtggta 
agttatttta 
attttattgt 
gtagtttggg 
agaatgatag 
ggagattttg 
aatgaattgt 
ggaagtaggt 
gttttattaa 
tgttttagta 
tgtatatgag 
tatttaggtg 
tgaaaattat 
tgatttagtt 
aatgtagaat 
gatttttagg 
aagaaaaata 
tgaagtataa 
gtttgtgatt 
tatttttaaa 
atttattagg 
tatttttgag 
gtagttgaaa 
atgagagagt 
tttgtttagt 
tgagttattt 
gttggatttt 
taggtttttt 
tttggtgtgg 
ttgtttgttg 
gttgtttagg 
tttattgtat 
gggttttttt 
ttttgggtgg 
tgtagggatt 
ggtatgtttg 
agtttttatt 
agtggtttgg 
tatggtgatg 
tttagttttt 
gtttttggaa 
ttgatgtttt 
tattgaatgt 
ttttagggag 
ttaattaggt 
aaaaaatata 
atataatatt 
agtgtaaaat 
tttgttatta 



atattttttt 
tttgtttata 
ataattaaat 
atatttatta 



aagatatata 
taaagaatga 
agagaatgtg 
gttatgattt 
aggaagttta 
ttatgaaatt 
gagttggagg 
gtagtgggaa 
gaaaagttga 
ttgagggtga 
gaagagggaa 
gttttggtaa 
attttttgta 
tgttaggtgg 
gaaagatttg 
gtataaattt 
tagaagattt 
tgattatata 
tatttaaatt 
agaaaggaaa 
agggtaagta 
atttttaatt 
attttttatt 
ttgggaagag 
ttttgtttgt 
tgattaatat 
ttttataatt 
atttaatgaa 
taatagattg 
gtatttagga 
gaaaatgtgg 
taaatatttg 
gttggttaaa 
taggggttgg 
tgttggtgag 
gtgtgtatgg 
tgtgtagttt 
tttggttggt 
ggtttggaga 
tgttgggggt 
ggtttgtggt 
gtttagaagt 

ggggtttttg 

aaatgaggaa 
gtgagttttt 
gttttttttg 
tttggaaagg 
agtggtagtt 
ttgtgtgttt 
agttztttgga 
ataagtttta 
ggtatggatg 
ttattgggtg 
ttgatggttt 
tatttatttg 
tatatatatt 
aatatatata 
ttttgagatt 
ggttggagtg 



atttttaaga 
ttttaaatat 
ataatttttt 
aatattgatt 
atatttgttt 
tataaataat 
taatataaga 
aatttatgaa 
tattggggaa 
tggagaaggt 
tttgaagata 
gttattaaat 
atttagtttt 
tttgtgggag 
tagggagaag 
ggtaattaat 
tttttaatat 
agtatgagtt 
ggagttgggt 
ttgtttttgt 
attgattaga 
gtttattatt 
ataaaatatg 
tttgaaagtt 
tttttttttt 
tatttttaat 
attattattt 
gtggtgtaaa 
tttttagatt 
gtatattaaa 
tgatagtgta 
gtagaattta 
gagtatttat 
gtaatttgaa 
atgttttttg 
gagtgaggaa 
gtttttggag 
atgttgagga 
gagttttttg 
agtgatgata 
tttttggttg 
gagtgggtgt 
gggtgtggtg 
ttggtttatg 
tgttgtatat 
tgggtgagtg 
tttttttgtt 
agtttttttt 
tttgaaaaag 
tgtgagtagg 
gaagtttgag 
tgagttgtta 
atttagtttt 
gatggttatg 
agaaaataat 
gatagggagt 
ttagaggttg 
tatgtaatta 
attgttgttt 
atatatatat 
tgtatgtagg 
ttaaaagtta 
tagtggtgta 



aaaggaagta 
ttggtaataa 
tttaaatttt 
atgtgtttga 
ttaagaaatt 
tataaagtta 
tttatttatg 
gattaggagt 
ggttttaagt 
ttttagtaag 
tgtttaagga 
gattttaatt 
tgtttgagaa 
atttttatat 
atttggaatt 

tggtgggaaa 

gagtattggg 
aaaattttag 

"gggggagg 

aatttttttt 



gtggtgatat 
tgtttttttt 
tattgtttga 
tgtttaggga 
tggagttgag 
tagttgttaa 
gaggtgaggg 
tggttgggtg 
ataaagggta 
taaggagaag 
tttggttttt 
agagggttgg 
attgagtgaa 
aagttatggt 
taattttttt 



gggttgtata 
taattaggat 
tatattagga 
tatattgttt 
ttatttgtat 
tttatagttt 
tgtgataggt 
atgggatgtt 
tattgaatta 
atttgagaaa 
tgtatatata 
ttgttattat 
ttttggattt 
gaaagttgag 
ttgttatttt 
tttttgagtg 

gggttggttt 

gagggagaag 
ttttttttgt 
tttttgtgtg 
ttgggggttt 
tatatttggt 
tggaggtgta 
atgagtagtg 
tgttttgtgg 
gtttttagtt 
gtgtttgata 
gtgatattta 
agaaggaaag 
aaaggtgttg 
aagttggtta 
aagtagttag 
ttttgagtta 
ttttttttta 
tgatttttaa 
ggagatgttt 
tagattttgt 

ttgtttgttg 
ttgaaggttt 
aaatatatat 
aatgggggtt 
gatttttttt 
attttggttt 



gaagatgttt 
aagtggtaaa 
gatgaagtaa 
gttatttatt 
ttaggtaaag 
gaattatatt 
atatttgaaa 
atgttttggg 
tgtttttatg 
taagggaata 
ttgttatatt 
gattatttat 
ttatttaggt 
gaatttgtat 
aatttttaga 
tatatgtata 
tttatttggg 
taatttgtaa 
gggaggtggt 
gtgaattgtg 
gtgtgggagg 
aggaaatatg 
gtttttgggt 
agtgttgagt 
tgtatttgtt 
ttttgtgttt 
ttgtatattt 
gttaggggtt 
gggtttgtta 
atgtgtattg 
tgggtttggt 
gttgggtttg 
tgtagtttga 
tttagttgta 
tgttggttgt 
tttgaaggtg 
tgtatttggg 
tttgtttaaa 

ggttggtttt 

aagaaagtta 
aggtgaaatt 
ttggaataag 
gaaattgaag 
ttaaggaaaa 
ataattatat 
tttaattatt 
ttttggagat 
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tatttttggt 
agagttatga 
tttatttatt 
tataatagta 
tggaggatag 
aagaaaagta 
aaagtttgtt 
aagaaaaaaa 
ggaggattat 
ttaaatgaaa 
attttttttt 
tataattagt 
agagtttaga 
agtggtatgg 
ttattgataa 
agtttttttg 
atatttagag 
atgaatattt 
tttatttttt 
gaatttatta 
tttgagagtg 
gagaaagggg 
taaatttggt 
ttattttgtt 
aaagtataag 
ttatatattt 
aattatttat 
gtttattaga 
ttttaaataa 
ttttgattta 
tagatataat 
aaagaaatgt 
tttgaaaatg 
agatgttttg 
tgtatttagt 
gtggtgaggg 
attggtagtt 
ggagagaggt 
tgaagttgag 
tgggatagga 
tgttggtttt 
gtgttgttgg 
tgggaaggtg 
tgggttgggg 
ttagtatgat 
ggggtgttgg 
gtaatttggt 
tttttgtagt 
aggggtgtgg 
gtttgaggtg 
gttggggagt 
ttattttgtt 
aggttagttt 
ttggtttgga 
gttggtttta 
gagaattttt 
atagttttgt 
ggttaaaatt 
aataaaaata 
atataaatat 
tgttatggaa 
ggagttttgt 
tgtttrtggg-- 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
-3ZS0_ 



seq_prot.txt 

tttaagtgat ttttttgttt tagttttttg agtagttggg attataggtg tgtgttatta 3840 

tgtttagtta atttttgtat ttttagtaga gatggggttt tattatattg gttagggtgg 3900 

ttttgatttt ttgattttgt gatttgttta ttttggtttt ttaaagtgtt gggattatag 3960 

gtgtgagtta ttgtgtttgg ttaaaagtta gattttttgt agttatgatt taaataatag 4020 

tatttgtagt tgtggtttgg gtaattgagt atttgggtaa gttatagagt tttttttggg 4080 

ttttagtttt tttatttgta aaatggttag tttgaatttg ggtaatttta tattattttt 4140 

atgtttaata ttttatgtat tgtgtttttt tttttttttt ttttggtttt tttggtgatt 4200 

tggagatgat ttatgtagat tttggatgaa atgtagagga gttgatagtt tatattattt 4260 

ggaggtattt attttattta tttagggata tgggta 4296 

<210> 353 
<211> 4296 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 353 

tgtttatgtt tttggataaa taaaatgagt gtttttgggt gatgtggatt gttaattttt 60 

ttatatttta tttaaagttt atatgggtta tttttaagtt attagaagag ttaggggaaa 120 

ggaggggagg ggtatgatat gtaagatgtt aaatatgaag atgatgtaaa attatttaga 180 

tttaagttgg ttattttata gatgaggaaa ttaaggttta gaaaaagttt tatgatttgt 240 

ttaaatattt ggttatttag attatagttg tagatattgt tatttggatt atagttgtaa 300 

aaagtttaat ttttggttgg gtgtggtggt ttatgtttgt aattttagta ttttgggagg 360 

ttaaggtggg tggattatga ggttaagaga ttgagattat tttggttaat atggtgaaat 420 

tttgttttta ttaaaaatat aaaaattagt tgggtgtggt ggtatgtgtt tgtaatttta 480 

gttatttggg aggttaaggt aggagaattg tttgaattta gaggtaaagt tgtagtgagt 540 

tgagattgtg ttattgtatt ttagtttggt gatagagtga gattttgttt ttaaaaaaaa 600 

aaaatttaat ttttgaagtt ttagaggttt tatatttttt ataataaata gttaagaatt 660 

tttatttttg tatgtatata tatattaata ttatatatat ttatatatat aattatatat 720 

atatttatat atatataata tatatatata tttttttatt tttatttttt ttttaaaaat 780 

ttttagagat agtgattagg tgggtagttt agttaggatt ttaatttttt ggtttttaat 840 

aaatgataat tgtatagaat tattaatttt ttgaagatag gattattttg ttttaggtaa 900 

aatttatagt ttttagtatt tagtaggtat ttagtgaggg atttttggtt ttatttgagt 960 

atttttattt tttatttatt tatgttaagg tgttagtgga gttagttaat ttttttttgg 1020 

aaattgatta tttttttgaa atttgttttt aaaaattttg ggttaagaag ttggtttgag 1080 

ggaaagtgtg gttgttttta ggagttaagg attgaggagt tggttttttg aatgggtggt 1140 

ttagaaagag ttgggtgggt atgtggtatt gttatgggtg gagtggttta ggtgtgttgg 1200 

ttgttttggt agtttaggtt gttgttttgg gttgttgttt tttggttgtt tttagataat 1260 

taatttttta aatttttttt tttgggggtg ggggtttgtt ttgaatgtgg ttaatataat 1320 

gttttttttg tttgtataaa ggggattaaa tgtgttttgt gttttttgta attgaatttt 1380 

tttttttttt tttaagaaaa atttataatt tttgtagtta tgggagttgg gttgtgtgag 1440 

tgttgtgggg gaaatttttt ttgtttttgt ttgggggttg ggttgttggg tttgattgtt 1500 

aagtgtagtg gagaggtggg gattttagga agatttttgg tgttttgttg agtttgggtt 1560 

ggggattatt tatttgattt ttgaatgtgt ggtgggattg tgttggtgat atgtgtttat 1620 

aggatgatgt gtagtggtta taggtttttg agtagttttt gatttatggt agattttgtt 1680 

gtttgttata gattgagttt ttagtgtagt ggatggtgtt tttttggatt tttggttgtg 1740 

t:ttttgtgtt gtgttttttt tggattttgt: gttgggttgg tagtgtagat gtgtgggtta 1800 

gatgtggtgt ttgtttgtta gttaggaggg ggtttggagg ttggtgaggt gtggggaggt 1860 

ttttggtggt tgagggaagt tgtataggag tttggttttt gttttgagtg ggtgtatgtg 1920 

tgggggtgtt gttgttttgt gtgtgtgagt gatttattta attttaattt agtgttgtgg 1980 

gggaaatagg aaattttttg ttaatagttg ggtaggattt ttttttgttt gagagttttt 2040 

tttttttttt tgatgtttag tttttagttt ttttgtagtt gttaattatg ttttttagat 2100 

tagttttttt gagagttttg gttgattttt agttgttttt tattgttttt ttatattatt 2160 

taggagtttt ttgttttaag tatttattta agaatgatta agtgtatata gtttataaag 2220 

taataataga aaatgtttat gtttttttta gtagattaga atatttgttg ttttttttta 2280 

gtttttttta gattgttttt aggtgtttta gagatgtgtt tttaaattgt aagttgagat 2340 

ttaggagtga atattttaat ttattgagtt gtgagtatat ttttttttta aataaaatag 2400 

taatggtaaa ttttatttta ttaaatttgt gttttagttg tgtttatatg tatgtatgta 2460 

tgtgtatata ttgttaagtt gtaaaatgtt tttttttagg ttgaagttta gaggtttttt 2520 

ttaagtttta atgtatatat tgattatttg gaaattttat ttaaaaatgt agattttaat 2580 

ttagtaggtt taggaggtag gtagagattt tgtattttta atgagtattt ggatagagtg 2640 

ttttattttg tattgttttt ttttgggatt gagttagtga gtaattgtaa atgattattt 2700 

attatgaagt gatagtggtg ggaaatgtaa tttttagaat gtatagagta tagtagaaat 2760 

tgtaaaatta aaagtgggtt gggagttatt tgaatgtttg tgtttttatt tttttaatgt 2820 

aggtgaagaa agagaatatt tattttttta tgtgtaaatg gggtaatatg ggagtagaat 2880 
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agtataaatt 

ggtgtatatg 

tgattggata 

tagttttttg 

aagattgtag 

ttttaaattt 

aattttaatt 

gtatttatgt 

ttattaatta 

ttaaattttt 

aggagttttt 

taaatgagag 

aaaattattt 

tgaatgtgtt 

ttgaaagttt 

agagtttttt 

ttggtttttg 

agtagatttt 

tttataatta 

ttttgagggt 

atatataatt 

tttagaagaa 

tattgaatat 

tttttttttt 



tttaaatttt 
ttttgtagtt 
ataaattatg 
gttagtagat 
aggtaagggt 
aatttttaaa 
tatgttttat 
tgaagatgta 
attgttttgt 
tttttatttt 
tatgagttat 
ttgaatttag 
aatggttttt 
tttaagtttt 
tttttaaatt 
ttagtgtgga 
tggattgaat 
gtattatatg 
tttgtgttat 
aggtattatg 
gatgtttaat 
aattgtattt 
ttgagatata 
aagaatggag 



tttttttgtt 

taaatattaa 

tgattaattt 

tttttagtaa 

ttgtgttagg 

ttttttttgt 

ttagtattta 

gaaagtgtag 

taaagttgga 

tttttttgtt 

ttttagttat 

ttttttaaaa 

tattgttttt 

taattttttt 

ttgtgaatat 

tttttttttt 

tataattttt 

tttttttggt 

tttttaattt 

tatttttttt 

aaatatttgt 

gattgtttta 

aatagattat 

aagtatgtgg 



seq_prot- 
agggtaatta 
tagaattttt 
attttttatt 
tttatttaat 
gtttttaggg 
tattttaagt 
agttgttttt 
tgaaattagg 
gtgattttgt 
attagtttat 
ggttgttttg 
tataaaatta 
gagataaaga 
gtaaattttt 
gtttttgtaa 
ttgatttttt 
ttgatgggta 
attatatgtt 
atatttttat 



txt 

gatttgttta 

tttttttttt 

tttaaagata 

gaattttttt 

aataaaggta 

gtttatttgt 

ggatgttttg 

agggttaggt 

taatggagga 

gttattgtta 

ggttttttta 

gttatattaa 

gaagataaaa 

atttggtttt 

tttttttatt 

tttttttatt 

agtttttttt 

ttttttttaa 

tttttattta 



tttatttatt attgtgtttt 
tggatgaatg aatgaattag 
gttgttttat aattttggag 
attttagttg agaatgatgt 
gttttt 



agatattttt 

agatatgttt 

tgattttgta 

atttgagagg 

aatagttttt 

tttattttgg 

ttatttttta 

attttttttt 

agattgagtt 

tggtttgtgt 

tttagttttt 

ttttttgatt 

attagatttt 

tttttgttta 

ttttatgttt 

tagttaattt 

tatttttatg 

tagttgtaat 

attttaaatt 

taagtattaa 

gtaataaaga 

aaaatattgt 

tattattatt 



<210> 354 
<211> 4001 
<212> DNA 
<213> Artificial 



sequence 



<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 354 

atgtttagtt 
ggtttggaat 
agttatagtg 
tggagattaa 
ttgggatgat 
tttatttttt 
aggaaaaaga 
aaattgtgga 
gttttgtttt 
gatgtttgtg 
ggtttagttt 
attattgtat 
gggaagatga 
tttttttttt 
gggtatatta 
tttgtaaagg 
ttttttgtta 
tattgggtga 
agaaatgtaa 
ttttttttgg 
tttttaattg 
gaaggataaa 
aaatgattat 
atagttatga 
ttttttattt 
taggttggtt 
tggatggttt 
atgatttttt 
tttattttta 
tttagagagg 
gttgtggaaa 
gttttttgga 
gtatatggga 



aatttttgta 

ttttattttt 

tttagtttga 

aataaaggtg 

gaaaagggtt 

tgtgtatttg 

tgatgtaagt 

atgaaataat 

ttattaggaa 

ggtttgtttt 

aaagttagat 

ttttttaaaa 

gatatattta 

gttttatgtt 

tttaaagaag 

ttataattaa 

agtaagagga 

atagtgagtt 

atgtggagtt 

tttggtgggt 

gatttttttt 

ggtttggaag 

tttggaatgg 

gtttttaatg 

tagggatttt 

taaggttttt 

taatattttt 

tagttaagtt 

gttttagtga 

atttattaag 

gtatgttttt 

tggtggtttt 

ggggaaaatt 



ttttttagta 

aagtaatttg 

ttttatttta 

gggttgggaa 

tgtgatttgg 

ttgtgtgtat 

atggtataaa 

tgtgagttaa 

gaattggttt 

ttttgtttta 

aataggtaga 

tttgattttt 

tttttgaaaa 

atggtagtgt 

gggagggatt 

gtaatttaga 

attggtttaa 

gttttggtaa 

aaataataat 

aaatttttgg 

atttagtgga 

ttgtgggatt 

agttttagtt 

ttttgagatt 

ttatgtttag 

taaagtttta 

attatttata 

tttattttat 

tttttttgag 

tttagttagg 

agtttgaatg 

agatttttga 

ttaattttat 



gagatgggga 
tttgttttgg 
tatgaagttt 
ttgattggga 
taggtattat 
tggtgttgtg 
gtggttggta 
gagatggggt 
gggttggagg 
tgatgtttag 
ttgttttttt 
agatgtattt 
tatttttggg 
ttgtttaggt 
gaggtttgta 
aaaagaaatg 
aggatatttt 
aaagaaattg 
agggttgttg 
tttagtattt 
gttgggggtt 
ttagtagttt 
tttattagga 
taaaagtttt 
tattgggtgg 
tttttttgtt 
tttaggtttt 
ttatttttaa 
ttgaaaatat 
agtttattta 
taaagtgttt 
gaagtttaaa 
taatttttgt 



tttattatgt 
ttttttaaag 
tttaataggt 
agagatgtga 
ggagtggtta 
tgtaaatgtt 
tgtggtaggt 
agtgggagaa 
gttgtatgga 
tttgtgtttg 
gataaattat 
attttttttt 
tttgtttttt 
tgtaggtgat 
ttaaaataaa 
taggtggaga 
tttttttttt 
gaaatgttgt 
ggttttttag 
tttattttta 
gtttggaaag 
gggttttttg 
tgttatgggt 
agattttaat 
atgtgtaaag 
taggtgttta 
ttaataatgt 
atttgttaag 
atggagttga 
atttagggaa 
ttggagttta 
atttttagtg 
gaggtttttg 



tggttatgtt 

tgtaggtgtg 

aaaatggtta 

tgaaatgttt 

ggggttaaaa 

attttgattt 

gtatgggaag 

atgaatttga 

ggattatatg 

gaattggaat 

taaatgattt 

tttttttttt 

gtatattttt 

ttgtggggtg 

tatttttgtt 

atagtagttt 

tttttttttt 

tgtaagaggt 

attgtgatgg 

tgattgatag 

attgttttag 

gattattttt 

tttaaaatat 

gttttgtgta 

aagtatgttt 

attttgagtt 

aatttttatg 

tttttattgt 

gagtttgtga 

gtgtgttatt 

gtagttattt 

ttagttttga 



2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4296 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
19„80_ 
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seq_prot.txt 

ggatagttgt tatgataagt aagggtaagt aatttgtttg ttggaggaag taaaggaaat 2040 

ggagttgggg aggagggtgt agagttagga tttttgttga tttggtgttg tagatattaa 2100 

tattttgggg tggaaaattt tgtaagttag agttgtgagg gtagaattgg tggaaattat 2160 

tttggaggaa ttttgtattg tgttaaatat gaagggtgga aggaagaaag tttttgtgtt 2220 

tgtttttagt tggatttttt ttttttatta gttaaaatgt tattttttag gaaggttttt 2280 

tgtaatatta tattttaatg ttttttttta gatattttat attatattat tttatttaat 2340 

tttttttata atttttatta ttttgataag atttatttgt ttattgtttt tagtatatgg 2400 

aaatgtaagt tttatgagga tatagaattt ttttattatt ttatttattg ttgtattttt 2460 

gagtgtttat attagtgttg ggtagtaagt aagagtttga taataaatat tttttgaatg 2520 

agggagatag gtttgaagtt tggagaatga gatgtagaag aggtgtaaga tttgttgtgt 2580 

tttttgtagg tggtgggggg gtggtgtagg tgttttaaga attattgtgg gatttggtag 2640 

ggggagtgta ggtgtttttt gttaagatag aagtgtttag attataattt ttagtagtta 2700 

tqaggagttt tagggtttga tgggaatggg aaatttttta attttttatg ttttggtttt 2760 

gtgggttttg tgggttgttt gtgaaatttg atttgggatg tggtggttta attggaaggt 2820 

ggattgaaat tttgtgatag taagaggttt gtagtgattt gtggtgttaa ggaatatagt 2880 

gtttttaaaa gaattggtgt ttgttgtttg tttttttttt tgggagtttt ttgtttattt 2940 

ttagaagagg agggaagtat aggtgggttt ttttagtttt gtgttggatt tttgagaatt 3000 

ttgaagttat tttggttgag gttaattttt gttgtgtttt ttttgtagta tgaagatttt 3060 

ggagatttaa ttgttagttt tggattgttg ttttttagat taggatttag ttttagttta 3120 

tttttttttt tatgtttttt tgatgaataa aaatgtggat tttgaattga tgttattgtt 3180 

ttttgaaagg ggggatttgt tttggttgtt tttagatttt gtggttggtt tagttgtgtt 3240 

ttaggagtta tgggaggggg atttagtttt tttttttatt tttttggaaa tagagtttgg 3300 

tttttttagt gagttgagtt tttgaattga ggagtaagaa ttttttgaaa atataagttt 3360 

ttttgtagaa gaagtaaatg ggagtttttt tgaagaagaa gtgaatgggt tagagttggg 3420 

gtttggaaaa gttatggaag atatttttgg ggaatttgtt gtagaggatg agggagatat 3480 

gtaagtggtg atggtagtgg agtgtggagt ttggggagat gaagtgtgag gttgatttgt 3540 

tttttggttt tgagatttat ttttttaggt ttttgttttt tttgtttggt gatttaggtt 3600 

tattgttttt tatttttttt ttagtgtttg gaattatagt tttttttagt tgttttgatt 3660 

ttttagtggt ttttggttag agtttagtat ttaatttgag aattttttga aaggttgtaa 3720 

gtggtaagga taataatggg gtagggagtt gattattttt gagattttta ttgtaaaata 3780 

gtttagtttt ttaggagagg gatgttttag agttgggaga agtggtatgt agttttttta 3840 

gattgagttt atattttatt tagtagtttg tgttttttat tttttttaag gatttagggg 3900 

ggtttattta ttttagaggt aggtttagtt ttagttttat atttgaaaag tataggtttg 3960 

gttagttttt taattttttt ttgtttttag ggtttttgat g 4001 

<210> 355 
<211> 4001 
<212> DNA 

<213> Artificial Sequence 

<220> , > 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 355 

tgttaggagt tttagaaata ggggagagtt agaaagttgg ttagatttat gttttttaag 60 

tgtagggtta gggttgagtt tgtttttggg gtaggtaagt ttttttgaat ttttgaggga 120 

agtagaagat ataaattgtt agataaaatg taagtttagt ttaaaagggt tatgtgttgt 180 

tttttttagt tttggggtat ttttttttta gaaaattgga ttgttttata gtgaaaattt 240 

tgggggtggt tagttttttg ttttgttgtt atttttatta tttatagttt tttaagaagt 300 

ttttaggttg ggtgttgaat tttgattagg aattattgag aaattgaggt agttgggaga 360 

agttgtagtt ttaagtgttg aaaggaagat gggggataat aaatttgggt tgttaagtaa 420 

agggggtaga ggtttggaga agtgggtttt aggattagag gatagattga ttttatattt 480 
ttttttttta gattttatat tttattgtta ttittattta tgtgtttttt ttgttttttg 
tagtgggttt tttagaggta ttttttatgg tttttttaga ttttaatttt ggtttgtttg 
tttttttttt agaaaggttt ttgtttgttt tttttgtagg aaggtttgta tttttagaaa 

gtttttgttt tttgatttga ggatttaatt tattagggga attaaatttt gtttttaggg 720 

gagtggagag agaaattggg tttttttttt gtagtttttg ggatatagtt gagttagtta 780 

taggatttgg ggataattgg ggtggatttt tttttttggg aggtggtggt attagtttag 840 

agtttgtatt tttatttatt ggggaagtgt ggggagaagg atgggttgga gttgggtttt 900 

ggtttgaagg atagtagttt ggagttaatg gttgagtttt taaagttttt atattgtaga 960 

ggaagtatag tggagattag ttttagttag gatggttttg aagtttttag ggatttgatg 1020 

tagagttaaa gaaatttatt tgtgtttttt tttttttttg ggagtaggta gaagattttt 1080 

gggaggagag gtgaatagtg gatgttaatt tttttgaaag tattgtgttt tttagtattg 1140 

t^ggttgtta tgggtttttt gttgttgtgg gattttggtt tattttttga ttgggttgtt 1200 

gtattttgga ttagattttg tgggtgattt atggaatttg tggagttggg atgtgaaagg 1260 

ttagaaggtt ttttgttttt attaagtttt agggtttttt gtggttgttg ggagttgtag 1320 

ttt'gaatgtt tttattttgg- tgagaagtgt ttatgttttt tttattgagt tttgtggtaa 1380 
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540 
600 
660 



tttttaaagt 
ttttttgtat 
attgagtttt 
agatagtaga 
aataaataaa 
tataaagtat 
gatattttaa 
ttttattttt 
tttttatagt 
gttggtgaga 
gtaggtggat 
tgtaggggtt 
ttttgagttt 
gggatatttt 
ttgggtaagt 
tgtattttta 
tttgggggtg 
ggagatttgg 
aggtagagga 
tttatttggt 
tgagattttt 
attttagtga 
taagttgtta 
tagtttttgg 
gaagtgttaa 
ttggtagttt 
ttggtttttt 
gaaaatgttt 
gtattttttt 
atgtaaattt 
aatttaggtg 
gtttggaggt 
tgggtatgtt 
tggaaaaata 
gttgaatgtt 
ttttttggtt 
tgttttattt 
gtattggtta 
atatagtgtt 
tgtgatgttt 
tttttagttg 
agaattttat 
gttgaggtgg 
atttttgttt 



atttgtattg 
tttatttttt 
tatttgttat 
aaaattttat 
ttttattaga 
ttgggagaaa 
ttgatgagaa 
tatatttgat 
tttggtttgt 
attttgattt 
tatttgtttt 
gatgggattg 
tttaaaagtt 
gtgtttgggt 
ttttgattga 
gtttgggaaa 
agtgggatgg 
gtgtagatga 
gtggagtttt 
gttgggtgta 
ggattttgaa 
aaattggggt 
aggttttata 
ttttgttaga 
attaggggtt 
tgttattgtt 
tttgttggag 
tttaggttgg 
ttttggatta 
taattttttt 
gatgttatta 
atttttaaga 
tgagaattaa 
atttatttgt 
gtgaagtgga 
tgagttggtt 
tttagtttgt 
ttttgtgttg 
agtgtatata 
attaagttat 
atttttgatt 
atagaatgga 
gtggattatt 
ttattaaaaa 



tttttttgtt 
aggttttaga 
ttagtattga 
atttttataa 
gtgataaggg 
atgttagggt 
gaaaggattt 
ataatgtagg 
agaatttttt 
tgtatttttt 
tatttgttat 
gggttttttt 
tagagttatt 
tgggagtgtg 
atttgatgag 
attgttgggg 
aagtttggtt 
tggggatgtt 
ggggaatttt 
gggaattttt 
atattgagaa 
tttattttga 
atttttggat 
tggagaaaat 
tgtttgttag 
tggttttata 
tagtttatta 
ttttttttat 
tttagttatg 
ttttttttga 
tggtgtgaga 
atgagtatat 
attttgaaag 
tatttagttt 
aggggtgggt 
ttttttggta 
ggttgtttta 
tatttatgtt 
gtaagtgtat 
agattttttt 
ttatttttat 
attaggttgg 
tgaggatagg 
atataaaaat 



seq_prot . 
gtttgtagag 
tttgtttttt 
tataggtatt 
ggtttatgtt 
ttgtgaagga 
gtgatattat 
agttgagagt 
atttttttaa 
attttaaaat 
tttttaattt 



ggtgattgtt 
tttttatgtg 
gtttagggag 
ttttttatga 
ttttttttga 
ttgggggtgg 
agagggatta 
aggattattt 
gagttggttt 
gaaataaaag 
tttatagttg 
aatgattatt 
ttttgttttt 
ttaattgaag 
gttgaggagg 
tttatatttt 
tttatttgat 
ttggtagagg 
gtttttgtaa 
atggtgtgtt 
tagggaggga 
tttatttttt 
agtgtaatga 
tgggttaggt 
ttgtaggtgt 
ggaggtggaa 
ttttgtagtt 
attttttttt 



aggaagatga 
tattgtttta 
tttgattttt 
gtgttgtggt 
agttttagat 
tagttgggtg 



txt 
ggtgtagtag 
ttatttaaaa 
taggaatata 
tttatgtatt 
gattaaataa 
ggaaagtttt 
aaatgtaaaa 
aatgattttt 
gttagtattt 
tatttttttt 
tagttttgtg 
tttaagattg 
taggtagttg 
tggtgatatg 
gttatgggtt 
ggtagtgggg 
ttataggagt 
gaatttaaag 
aaagtgtatt 
atgtataaag 
tatattttag 
tgggggtgat 
tttggagtga 
gttgttagtt 
attgttgtaa 
tgttttttgt 
gagaggggag 
gaggttgtta 
aggtaggggt 
ttattttgtg 
aagaagtgtg 
tggaggaaaa 
tgggttgttt 
tattttagtt 
ttgtgtggtt 
tttgaattta 
tttttttatg 
taaattgagg 
gttttggttt 
gaaatgtttt 
ataattattt 
ttatgtttgt 
tagtgtggtt 
t 



gttttgtatt 
aatatttatt 
ataatgaata 
gaaagtaatg 
gatggtgtga 
tttaaaaaat 
gttttttttt 
attaattttg 
atggtattag 
gttttttttg 
ttaggagttt 
gtgttaaaag 
ttgggttttg 
tttttttgga 
tttggttttg 
atttagtgag 
tgtattgttg 
ttgaatgttt 
tttttgtata 
tattgaggtt 
agtttatggt 
ttggggagtt 
tttttttagg 
gtggaagtga 
tttgagaggt 
agtagtattt 
gagagagaga 
ttttttgttt 
atttgttttg 
ggttgtttgt 
tagaaggtaa 
aaaaaaagaa 
gataatttgt 
ttagatgtag 
ttttgtgtag 
ttttttttgt 
tatttgttgt 
tggtatttat 
ttaattgttt 
attatgtttt 
tgtttgttgg 
attttgggag 
aatgtggtga 



Sequence 



<210> 356 
<211> 4607 
<212> DMA 
<213> Artificial 

<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 356 



tgttatttat 
gggatagaat 
gaggagaagt 
attaatttgt 
tgtgtgtagt 
ttgtttatgg 
tgtttggttt 
tgtttgggtt 
tttttgggtt 
tgtttgggtt 
tatgtggttg 
ggtttaggtt 
tggagagttt 



ggaatgaaga 
attagtattt 
agagatgata 
tttttgtgat 
tggtattgtt 
tatttttttg 
ttatatttgt 
ttgtatttgt 
ttgtatttgt 
ttgtatttgt 
tttttggttt 
tttgttgttt 
gaatattttg 



ttttaaattt 
aaattattta 
aagtagttat 
aaagtaataa 
gatatttatt 
ggttttgtat 
ttggggtatt 
ttggggtatt 
"ttggggtatt 
ttgggtattg 
tgtttgttgg 
attaggtatt 



tagttatgag gataattatt 
tttggtatgt ggtagaggag 
attatttatt agtttatagg 
taaagagttg atatttttta 
tttttaaggg tattttttta 
ttgtttgggt atttgttggg 
tgtttgggtt ttgtatttgt 
tttttgggtt ttttatttgt 
tgtttagata ttgtatttgt 
tatttgtagg atttttaagt 
ggatgttggt attgaggatg 
tgttgtttgg ggagaatttg 
JtSflttg ttgtL_tHgt:tgtggg 
Seite 308 



tttttatttg 
aagagaatta 
taatagaatt 
tatttttatt 
gattttttat 
gttttgtatt 
ttggggtatt 
ttagggtatt 
tttgggtatt 
tgtttttatt 
ttttgtttgt 
tagagtaagt 

tgttattttg- 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4001 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
^780 



seq_prot.txt 

tgtttgaaaa tttgtgggat ttttgggtgt gtagtaggtt gtaatagttg atgttggttt 840 

ttgaggttgg aagttagaag gtggaagtga attgtagttt attagtgttg ttggtttttg 900 

tgtggtattg tggggtttgt agtttttgtg ttgtagggtt ttaaaggaaa tgtttatgtt 960 

ttttttgatt agggattttt gatttgagaa ttttatttta aaggttggga ggtttttgag 1020 

tatttttagt tagggttgtt gataaaaatg tagaaagtat agtaaaattt gaattttaga 1080 

tttataataa atttagttat aagtatgttt ttaaatattg tatgggatat gttaatatgg 1140 

aaaaattatt tgttagtttg aaatttaaat ttaattgagt gatttgtgtg tttgtgtgtg 1200 

tgtatatatg tatatatata tatttatatt tatatgtaaa tgtatgttta tatgtaaata 1260 

tatgtttatt tataaatata tttttaataa gtaatatggt gtttgttgta tatatattat 1320 

attgtgtatg taatgtataa gtatttattt tgtttgtttg gggttttgtt tgtttttgtt 1380 

gagtttgatt tttttatttg ttgtttggtt tttgttttat gttttagtgt tattgagatt 1440 

aaggagagaa tgaatttgtt gttgattggg tagagtgagt gtgtggattg tggttattgt 1500 

ttgtttatta tttgtgtgta tttgggttgg tattgggtga agaattgtgt gggtttggga 1560 

tttgggggtt tagagggagt gagtttttgt gtgggtgttt ggtttgtagg ttttgtaggt 1620 

ttaggggtgt gttttgtttt tatttttatt ttggattttg gttttttttt ttagatagtg 1680 

gtttttttta tttttggttt ttgtaggttg ttagtagttt gtgttaggtt ttgttggtgt 1740 

ttttagggtt ttttagattg tgtagatttt gatatttttg tttggttttg ggttttggga 1800 

gttgagagtt ggttagggtt ttgtttgtat ttttgggtgt ttagttttgg gtttgttttt 1860 

tgtggatgtt ttaatttttt tggttgaatg gatggtggtg tgtgtgtgtt ttattttggt 1920 

ggtgttggtt ttttttgttg ttaaaattag atttaaattt ttgtatggga tttgtttttg 1980 

ggtttttatt ttgtgtgttt agtaaatagt gggtgagtta tgaagatgtg tgagttagtt 2040 

ggattttttt tgttaggtgt ggatttgttg tggttagaga atttagtttg tgttagtttg 2100 

gtttgtttgt gaagttatgg gttttattga tgtgattttt taagatgtgg gggttattat 2160 

gggtagagga tattggtttg gagttagatt atgggtttta taagtattag attataagta 2220 

gtgttgttat tgagagttgt ttggaatttg tttagtatgt tgggtttttt agttagggtt 2280 

tggtgtatgt ggttgagggt tttggaagtt ttgatggttt aggaggagta ggtgggtggg 2340 

gtggtgggtg ttgttggttg gtagagagtt ttggtttgat ttagtgtagg tttggtgtgt 2400 

gtagagaata attttaagtg tattgatgtt tgtgagtttt ttttaaatat tgaatgggat 2460 

ttagagtttg agtttatagg tggtggttgg gggagggagt agggtgttgg ttgttgtttg 2520 

ggagtgtttg tgttttgggt gatttttgga aggatgtggg gtttaaattt tggttggggt 2580 

tgggagagta gtttttagag gttttttgtg ggattttttg ttgggtggga ttgtggtttt 2640 

ataggagaag tgggtggtaa gttttgtttg gtggaaagta gttgtttttt tttttttggg 2700 

tttggggtgg tgttttttat ttttgttttt tgttttttat ttttgttttt tgttggttat 2760 

attttttgtt ttttggattt taagtgtttt gtgtgttgag gagtttagtg ttagtggtgg 2820 

tggttaggag agatttgggt gttaggaaag atgggttgtt tgggggatag tagggagttt 2880 

gggggaaatg taggtgttgg gtatagagtt ggtattggtg tttttagttt tgttgaagat 2940 

tgtggttggg tttggtttgt gggaggggtt ttggtgttgg atttgttttg gttttgtgtg 3000 

ggtggttttg ttgggttttg taggagtgat gtgtgttaaa aggtggtggg aaggaggtgg 3060 

ggtagagtgt gtttgggatt ttgatttgga tgtggttagt tggagaggtg gagtgttggg 3120 

aggagatttt ggttttgttg tgatttggtg gtttgtgttg tttttttgtg tgttgggtta 3180 

aaaaggtgtt aatgtttgtg gttgtttatt ttttgtggtg tttttttttt ttgtgtttat 3240 

ataatttgtt taggggttgg gtagtttgtt ttgttttttt gtttgtgtat ttgtttggag 3300 

gtttgtgtgt ttgtgaaggg gatgtagtga aattggggtt tgtgttaggt tagttgggat 3360 

ggatgttgat gtttggggtt gtgatggttg taggtaggag gtttagggtt ggggggtggt 3420 

ttggttttgt gggtgggggt tggagtgtag tgttgggtag gtatttgggt ttgtagtttt 3480 

gaagttggga ggtgagggga gagtgattgg ggatgagttg ggataaggtg atataggggt 3540 

ttttttggag ttggattggt ttttgggatt tggtgtttgt gagaggttgg agtggttaga 3600 

gtttagtttg tgaggagatg tgggttttgt ttttagtgtt 99ttgttttt g^gtaaag 3660 

atagttttgt aggggttttg ggagggtttt ttttttgttg tttttttttt attttgggtt 37£U 

ttgagggttg ttgggagggt aattttggga agaggttggg gtgtggggtg tgggtgtagg 3780 

tggaaattgt tagtaagttt ttttttgttt gtgtgttttt tttgatttgt agggttgtgt 3840 

taattttgag gttttagttt ttttgaggag ttagggttag gttttttttt ggatagggag 3900 

aaggatttgg gtgggggttt tgatttatgg agttggttat taagtggttt tgatggtttt 3960 

ttgagggata gttttttgtg gttttgagtt tgtttgttga gggtttttgg tttgtttttg 4020 

gagtggtttt aggtagagaa agtttgtgaa gaaatggttt gggttggttt ggagggagat 4080 

attttatgtt tttttagttt ttgggttgtt tttttttgta gtttttgttt tttttggggt 4140 

ttggatttgg ggagtgatga ttatttttgt ttagtttgta ttttggtttg gatgttagga 4200 

aataagttta tgtagtatgt atatgtttgt tatataaata ggggatagat agatgatttt 4260 

taattagtaa gggtgtaggg aaaagtaatg tattttaaat ttttgattag aggttatttg 4320 

tttttaaaga tgttgttatt tgtttattta ttgtttggat atttggaaat ggtttaggtt 4380 

tattaatata atgttttggt ttttgttgtt ttgttttttg ttgttgttat tgttgtttat 4440 

ttgtttagtt ttagttttgg gggaggagta aataaagtgt gtggtttttg gtatttattg 4500 

agtgttgagt tatttttttt tggatttatt tggggaaaga ttaaaaagta ttttattaag 4560 

aataggatat ggtgtttgaa atgttgttat atatgaatgt atgtatt ; 4607 

<210> 357 .. r > " -; " ' 

<211> 4607 •<* • • •■ • ■ • ■ ••' • » 

<212> DNA .'I '>:■». ■ S y . -pr v 
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' r » * t 

I, 



JO 



<213> Artificial Sequence 



seq_prot.txt 



<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 357 



aatgtatata 
tttttaattt 
aggttatgtg 
atagtaataa 
tttaaatgtt 
gttagaagtt 
tgttttttgt 
gttaaaatat 
taggggttgt 
ttggtttggg 
ggagtttttg 
ttgtttagta 

ggggggtttg 

ggttggaggg 
tttgtatttt 
gaggggatag 
ggtggttggt 
gttttttgtg 
tttgttttag 
taggtgtttg 
tttgggtttt 
tggtgtgggt 
tgtgggtggg 

ggggaggtgt 
gggaaggtag 
ttttttagtt 
gttgtttttt 
agtaggtttg 
ttggggatgt 
ttttttagat 
tgggtttttt 
aataggggtg 
gaatggttgt 
tgtttggtag 
tttgggtttt 
gtattttgtt 
tttggaagga 
gtgttaggtt 
tttttttagg 
ggatttgata 
atgtttatgg 
tgttttggaa 
attgggtttt 
atttttatgg 
tatgtagagg 
gtgtgtgtat 
aggttgggtg 
ggttaggtgg 
ttggtgtgga 
agaggattgg 
gtggattgag 
tgattttttg 
tttatgtgtt 
ttggagtgtg 
aaaattttaa 
gatagatatt 
atatatattt 
ataggttgtt 
ttttgtgtaa 
ttttattgtg 
tggtttttga 



tttatatatg 
ttttttgaat 
ttttgtttat 
aaaataaaat 
tagatagtga 
tggggtgtat 
ttatgtagta 
aagttgagta 
aggaggaggt 
ttattttttt 
atagataaat 
attaatttta 
gttttggttt 
agtgtgtggg 
ggtttttttt 
taggaggagg 
gttgagggta 
agtgttaagt 
tttgtttttg 
tttagtgttg 
ttatttgtag 
tttggttttg 
gaggtagggt 
tgtggggagt 
tgtgggttat 

ggttgtgttt 

ggtgtgtgtt 
gtgttagggt 
tggtgttgat 
ggtttatttt 
ggtgtgtggg 
aggggtgggg 
tttttgttaa 
aggattttgt 
atgttttttt 
tttttttttg 
gtttgtgggt 
aggttgaagt 
ttattggggt 
tgttgggtga 
ggtttgtgat 
agttgtgtta 
ttggttgtag 
tttatttatt 
tttgggttta 
tattatttat 
tttggaggta 
ggatgttagg 
ttattagtgg 
ggtttggagt 
tgtttgtgta 
tttggtgtta 
tgttttgttt 
aggtaagggt 
gtaaatgaaa 
gtattattta 
atatataaat 
taattaaatt 
tatttgggaa 
ttttttatat 
ggtaaggttt 



gtaatatttt 
aaatttaaag 
ttttttttta 
aataaaaatt 
ataaatagat 
tgtttttttt 
gatgtgtgta 
aaggtaatta 
ggtttaggag 
atgggttttt 
ttagagttat 
tgggttaagg 
tttagggaag 
tggggaggag 

tggggttatt 

gtttttttgg 
ggatttgtgt 
tttaggggtt 
attgtttttt 
tgttttagtt 
ttgttgtagt 
ttgtgttttt 

gggttgtttg 

aggtggttgt 
tgagttgtgg 
aagttggggt 
gtttttgtag 
tttttttgtg 
tttgtgtttg 
ttttgatatt 
gtgtttggaa 
aataggggtg 
gtagggtttg 
ggagagtttt 
aggggttatt 
gttgttgttt 
gttggtgtgt 
tttttattgg 
ttttagggtt 
gttttgagtg 
ttggttttga 
gtgaagtttg 
tgggtttgtg 
gtttgttggg 
gttttggtaa 
ttggttgggg 
tgagtaggat 
gtttgtgtga 
tttgtgggag 

gggggtgaga 

ggagtttgtt 
gtttagatgt 
agttagtggt 
taggtggtag 
taaatattta 
ttaaagatat 
ataaatatat 
tgaattttag 
tatatttata 
ttttattagt 
ttgggttgfla 



aaatattgtg ttttgttttt 
aggggtggtt tagtgtttag 
gagttaaggt tgaataaata 
aaagtattgt gttaatgagt 
ggtagtattt ttggaagtaa 
tgtatttttg ttggttgaag 
tattatatgg gtttattttt 
ttgtttttta agtttaagtt 
ttaagggggt gtgaggtgtt 
tttatttggg attgttttgg 
agggagttgt tttttgggaa 
tttttgttta gatttttttt 
ttgaggtttt gggattggta 
tttgttggtg atttttattt 
tttttaatgg tttttggagt 
aatttttgtg gggttgtttt 
ttttttgtag gttagatttt 
gatttaattt tgagggagtt 
ttttattttt tagttttgga 
tttgtttgtg gagttgaatt 
tttgggtatt ggtgtttgtt 
tttgtgggtg tgtgagtttt 
gtttttaggt gggttatatg 
gggtgttagt gtttttttag 
tggggttaag gttttttttt 
tttgggtgtg ttttgttttg 
agtttggtga ggttgtttat 
ggttagattt gattgtgatt 
atgtttgtgt tttttttgga 
tgggtttttt ttggttgttg 
tttaaggggt aggggatgtg 
aggggtgttg ttttaggttt 
ttatttattt tttttgtgga 
tgggggttgt ttttttaatt 
taggatgtga atatttttgg 
gtgggtttgg gttttggatt 
ttggagttgt tttttgtgtg 
ttagtgatat ttgttgtttt 
tttgattatg tatattaggt 
gtttttagtg gtggtgttgt 
attgatgttt tttgtttatg 
tggttttgtg agtgagttgg 
tttgatgggg agggtttggt 
tgtatggggt ggggatttaa 
tagaaaaggt tggtattgtt 
aggttggggt gtttgtgggg 
tttggttggt ttttagtttt 
tttggggggt tttagaggtg 
ttaggggtgg agggggttgt 
atgaggtatg tttttgagtt 
ttttttgggt ttttaggttt 
gtgtgggtga tgaatgggtg 
aaatttgttt tttttttgat 
gtagaggggt tggatttagt 
tatattatat atatgatata 
atttgtaggt aaatatatat 
atatatgtat gtgtatatat 
attagtgagt aatttttttg 
attagatttg ttgtgaattt 
agttttagtt ggaggtgttt 
.aatttttggt-Jtggaagaaa^ 
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aatgaaatgt 
taagtgttag 
aatagtaatg 
ttgagttatt 
atgatttttg 
attgtttatt 
tagtgtttag 
ttgggaaagg 
tttttttagg 
agataggtta 
attattgaaa 
tttgtttaga 
tagttttgta 
gtatttgtgt 
ttggggtgga 
tggtgttaaa 
ggttgtttta 
tttgtgttgt 
gttgtgagtt 
gttttttggt 
ttggttggtt 
tgggtgggtg 
ggtgtgggga 
tttggtgtgt 
ggtgttttgt 
tttttttttt 
gtagggttga 
tttggtagag 
ttttttgttg 
ttattagtgt 
gttggtgggg 
aggaggaggg 
gttatggttt 
ttagttggag 

gtggtggtta 

ttgtttggtg 
tattagattt 
gtttgtttgt 
tttggttagg 
ttatggtttg 
gtgattttta 
gttggtgtag 
tgatttgtat 
agatgaattt 
ggagtaggat 
agtagatttg 
tagaatttag 
ttggtggggt 
tgtttaggga 
tgtgaaattt 
tagatttgta 
gtggttgtga 
tttagtggta 
aaaagtaaat 
atatatgtgt 
ttatatgtaa 
atgtagatat 
tattagtatg 
gaaatttaaa 
aaaggttttt 
gtgggtgttt- 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 — 



tttttaaagt 
tgttgatagg 
ttgttgtagt 
gtggtaggtg 
gttttttttg 
ttttggtgtt 
tggggatttt 
ggtgtagtgt 
ggtggagggt 
ggtgtggggt 
ggtaggtgtt 
ggaggtgttt 
gagatgttgg 
gtaaattggt 
ttttattata 
agttgttttt 



tttgtggtat 
ttgtagttta 
ttgttgtgtg 
gtggttatga 
gatggtaagt 
agtgtttttg 
gtaggtgtgg 
ttgagtaagt 
ttggggaggt 
ttaggtaggt 
tgggtaggtg 
ttggaaaggt 
ttttttgttg 
aagtggtgtg 
tgttgaataa 
atggttggga 



aagaattata 
tttttgtttt 
tttaggggtt 
atttttttga 
atttggtgag 
gtgggtaggg 
tgtttggata 
gttttagata 
gttttaagta 
attttaagta 
tgggatttag 
gggtattggt 
ttattttgtt 
gttattttgt 
atgatttaaa 
tttgaggttt 



seq_prot . 
agttttataa 
ttgatttttg 
ttgtgggttt 
ggtatttaag 
tggtggggat 
ttgggggtgg 
ggtgtggggt 
ggtgtgggat 
ggtgtggggt 
gatatggggg 
gaagatattg 
agtgttgatt 
ataaaaggta 
tatttttatt 



txt 

tgttgtgtgg 
gttttggagg 
ttgggtgtag 
ttttttagtt 
ttgggttatg 
ttgtgtgggt 
ttgggtaggt 
ttaggaagat 
ttaggtaggt 
ttggataggt 
tagataggtg 
gtatataggt 
aattgatgat 
ttttttttaa 



tattagtatt ttatttttag 
ttgttttatg ggtggta 



aagttggtgg 
ttggtgttgg 
ggtggtgttt 
tattttgtaa 
ggtaggatat 
ggaggtagtt 
atttggggta 
attttagata 
attttaagta 
gtaggatttt 
gggggtttgg 
gagggtatag 
tttgttgttt 
tttttttttt 
gtaaggaagt 



<210> 358 
<211> 4453 
<212> DNA 

<213> Artificial sequence 

<220> . , . 

<223> chemically treated genomic DNA CHomo sapiens; 



<400> 358 

ttggttttga 

gtaagtatag 

taatttgtat 

aggagttttt 

ggtggattta 

ggaaagggtt 

gagtgagttt 

agatgggggt 

tgtgtgtgtg 

gtagttattt 

tttatttgaa 

taaaaaatgt 

taagagtgta 

ttttaatttt 

ttagaattgt 

aattgtttat 

agtttatgtt 

taatttttat 

ttggttttgt 

aataagtttg 

aatagatgta 

gtatgttttg 

gggggtttga 

gattagagga 

tttttttttt 

ggaggaagta 

tttttgttgg 

ttgggttgtt 

ttggggtggg 

gttttggagt 

gtagttattg 

tttttgagtg 

tgtatggagt 

ggagtttgga 

ttttgtgttg 

tttttttttt 

tggtgtgttt 

tttgattgait 

ggagttgggg 

tagggtggta 

aggttgttgg 



agtttatagt 

aagttttttt 

tttttttagt 

ttttttttta 

gatttaaggt 

taaagttgtt 

tattgggttt 

gatagaatta 

tgtgtgtgta 

atatatggtt 

tagaataaat 

ataattattt 

aaaggataaa 

tttatttaga 

ttatatgttt 

tatggaaatt 

ttaggatttt 

taagttttgt 

tttttttttt 

gttttgtttt 

tagagggagt 

tatatttgtt 

ggtagtttta 

ttttaggagg 

tgtaatttgg 

tggagaattt 

agattgtgtt 

gagtgttgta 

ttttgatata 

ttgggttagg 

gtgagtgttg 

tttgtttttt 

taaggtgttt 

gtttagagag 

tagttgtttg 

ttgtagagtt 

gtttttttgt 

gttgtttaag 

ttttttggat 

gtggaggttt 

§attgtttta 



attgttgatt 
agaagatagt 
aggtagttgg 
tttttttggt 
ttagatttgt 
agtggattgt 
aaaatgattt 
tagaattttt 
tatataaatg 
tgtttttttg 
aaggttttat 
ttttggaatg 
tagtgaaaaa 
ggtgagaata 
ttattgtttt 
ttttaaaaga 
gtttttataa 
taaaaatgga 
ttgatagtat 
atttattggt 
ttttataggt 
gtgttgatga 
ggtaagttgg 
ttttagagtt 
ggattttttg 
ataagttttt 
ttttttaaat 
agatagggga 
gagtaggtgt 
gtttatttgt 
tggttttgag 
ttgttttggg 
tgttttgggt 
gagagagata 
tttggtttgt 
gttgtttagt 
gttttggttg 
gatttttgat 
tttatttata 
gagaattatg 
gtttgtgggt 



tagtttgttt 

gggttatttg 

tattttgagt 

attttttttg 

aaggtaggaa 

tgggatttag 

taagttttgg 

gttatgtttt 

tttgtttatt 

gaggataatt 

gatgaagtaa 

ggttatatag 

taaatttttt 

gaatttttat 

tatttttatt 

tttgttaata 

ttttgtaata 

ttttaaatta 

tattaaataa 

tgtgttggtg 

aggggttttt 

agattttata 

ggtttagtgg 

gggtgaggta 

tattggggta 

tgatttttta 

ttttgtgagt 

gggagttggg 

tgtgggttgt 

ttttgtagtg 

atttttgggt 

ttgtttgggt 

gttttttgtg 

gttggggagt 

ttttttgttt 

gttttgattt 

tgagtgattt 

tagtattagg 

gtagggttag 

ggggttggta 

atttggtaga 



tttggaaggt 
ttttttaaaa 
ttttggttgg 
tttttttttt 
aatgttgtag 
tttttttttt 
tttttgatat 
tttaagtgtg 
tttaggtagg 
ttatttgata 
aatattaaat 
agatgtgttt 
ttttattttg 
attttttaga 
tttgtttgta 
ttttaatagg 
ttatattatg 
agttgtaaat 
atttttttat 

atatttgggg 

ttgtttgtgt 

gttttattag 

ggagaagttg 

gagagttttt 

ggtttttggt 

gtttagatgt 

gttgtggaag 

tgggagaggg 

agtatagtgt 

ttggtttgtg 

tggatgtgtg 

tttttgggtt 

ggtgttgatt 

ttggttattg 

ttttgttttt 

tgttattatg 

taaagtgagt 

ggagaggagg 

atttttttta 

ttggttggta 

tgaagtttgt 



tggtagttta 

gttgaaaggt 

ggtagagtaa 

gttattttta 

gtttaggttg 

tttattaaga 

taggggaaag 

tttagagatg 

aagggtggat 

aataattgtt 

atatatgtat 

tttaaaatgt 

ttttttagtt 

atttttatag 

taaataaatg 

aagtattaat 

atatttaatt 

ttttagtaat 

tgttgaaagt 

attgttattg 

ttttgggaga 

ttgtgggtaa 

tagaagaatt 

tgtgtgtttt 

taggtgtatg 

tgttgggttt 

tatgtggggt 

aggggtggtg 

ggagattgta 

tttttttgtt 

gtggttttag 

ttttggtggt 

taggttgttt 

tgggtatttt 

tgttttgatt 

agagttttgt 

gtgtttttgt 

ggttgtttag 

ggaaatggga 

agggaggaag 

ttgggttaat 



3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4607 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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ttattttttt 
aggtgagagg 
agtttttgga 
tgaattttat 
ttgggaaggt 
tattattttt 
ttaaatggag 
ttaaagaaat 
agagaaattt 
tggaggtagg 
ttatgaaata 
ttggtaggtt 
gtttgtaatt 
aagattagtt 
ggtatggtag 
tgaatttggg 
gatagagtaa 
tttttatatt 
gaatggttat 
ttggaatttt 
gttgggtttg 
ttttttatag 
tataagagaa 
tttgggttgg 
tgattttggt 
gattgtgtag 
tttatatgtt 
atgttttttt 
tttttttttt 
taagattatt 
taggaggtat 
ttgggtgatt 
ttttgtttta 
ttttttattt 



tggttggaaa 
gagagggaag 
gtatgggatt 
taagttttat 
tttaggggat 
attttttttt 
gaatatgtgt 
ttggagggta 
ggggataggg 
gtgggtgata 
gtggttatat 
tgaaatgata 
ttagtatttt 
tggataatat 
tggatgtttg 
aggtagagat 
gattttattt 
tttttatttt 
ttttattgag 
gttattgttt 
gggaaatata 
ttttttagaa 
Qtgtggtttt 
aatgatattt 
gttatgtgta 
atggtgagta 
tttttatttt 
ttatttgttt 
ttagaagaat 

gggggagaat 
tgggggggtt 

agtgttatat 
ttttaagtat 
gtt 



tttatggttt 
agtgggtttt 
tttgatgaag 
gtgagtattt 
attttttttt 
gtttatttta 
gtttaataag 
gtattgtgaa 
agggatgggt 
tttttatttt 
aaatgtgagg 
tttttaaaat 
gagaggttaa 
ggtgtaattt 
tagttttagt 
tttagtgagt 
taaaaaaaaa 
tttgtttttt 
gaaaggttag 
tttagtaaat 
attattgtag 
attttgttat 
tggttgagtt 
tttatttttt 
ggtgggttta 
ttattgattt 
attataggag 
ttttaagata 
taaaatttta 
ttattattat 
ttgttattta 
attgttttat 
attttttttt 



seq_prot . 
tttatttgag 
gggattgggg 
tttttttttg 
atttttataa 
tgttttttgt 
tttttgtttt 
tattttttta 
ataggtatgg 
gggaggtaag 
tatgtgataa 
tggggttgga 
gtttgttggt 
ggtgagtgga 
tgtttttatt 
tatttgggag 
tgagattata 
aaataaaagt 
tgtagataag 
tattgatatt 
gtattatgtt 
gtgaggtggg 
tatttttttt 
ttttttgagg 
tgtgttgtta 
aagttgtttg 
gttgatgata 
gattgaggag 
tttttttgtt 
gtgtggttaa 
tgagaattag 
tgtgtgtgga 
gtatggtttt 
tttttttagt 



txt 
aattagatat 
ttagtttatt 
aatttttttt 
tagttggttg 
tgtagggttg 
ttttagtgaa 
atatttattg 
ggatttttat 
agtaggtagg 
gtataaatat 
aggagatttt 
agttgggtat 
ttatttgagg 
aaaaatgtaa 
gttgaggtag 
ttattgtatt 
tagttggaat 
ttaaaaattt 
atgggttggt 
tatagatttg 
ggtaataagg 
ttttagaggg 
ggaggaggta 
ggaatttaga 
tttaagagtg 

gtggggtgga 

gtggggggtg 
tgttttttag 
aagattttga 
ttttggtttg 
ggtagtttta 
gggttttttt 
aaagtgtttt 



gaatagggtg 
tttattttgg 
agggtagtaa 
tatagataag 
tgttattttt 
ttgtgattgt 
gtgtaattgt 
tgtaattggg 
agttaggagt 
atatatatgt 
gtttagtttt 
ggtggtttat 
ttaggagttt 
aaattagttt 
gagaattgtt 
ttaattgggt 
gttttttttt 
gttatgaggg 
tttgtttgtt 
atgtttttta 
attaaaagtt 
ttggttatag 
gggaaggttt 
taattggagg 
tatggtgtat 
aggggataaa 
tttgagaggg 
gaaaaaagtt 
ggttttgttt 
tggttattta 
ttagtttttg 
tttttgattt 
gttttatttt 



<210> 359 
<211> 4453 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 359 

ggtagatgag 
gggtggggta 
gttgattatt 
ttggtgtttt 
tttaataatt 
tggaggagag 
taggagaaag 
ggggatatgt 

tatttgtgta 
agtattaggg 
attttaattt 
tatttttttt 
aagttgtagg 
ttttaggttt 
ggtagagttt 
aaagtgatta 
gagaatatga 
gttttgtttt 
ttttttaggt 
ttattattat 
ttaggttggt 
tgggattata 
ttagatttgt 



ggagaaatga 
gaagagttga 
tagtaagggt 
ttgtagatgg 
ttaaagtggg 
gagggttttt 
tatttttttt 
aagtttgttt 
gttatgtatt 
ttgtttttgg 
agagggtttt 
gtgttatggt 
gagggttttt 
agttgaagag 
tagggtaggt 
ttttttttat 
gaaagagaag 
gttgtttagt 
ttaagtaatt 
gttaggttga 
tttgaatttt 
agtgtgagtt 
tagaagattg 



ggtggaatat 
agaggagaaa 
tgatgaggtt 
ttgtaaatta 
gttttagagt 
tttttggagg 

gggtattttt 
ttttttattt 
atgtattttt 
ttgtttgggt 
ttttgttttt 
tagttttttg 
ggtttttgtt 
tattagattt 
agggttggtt 
agtaggtttt 
aatattttaa 
tggagtgtag 
tttttgtttt 
tttttgtatt 
tgattttaaa 
attatgtttg 



tttgttggga 
tttagggttg 
gtttttatat 

gggttggttt 

tttttggtta 
ataaatagag 
tattttttta 
tattgttatt 
ggataagtag 
ttttggtaat 
tttttttagg 
ggagaagggg 
gtttttattt 
gtgggtatgg 
tatggtgtta 
tgatttattt 
ttaattttta 
tggtgtgatt 
agttttttaa 
tttagtagag 
tgatttattt 
gttgttaatg 
ttttttaatt.- 



aggagaaagg 
tatatgaagt 
atgtaggtga 
ttgatggtgg 
tattgaaatt 
ggatgtttta 
gtttttttgt 
agtaggttag 
ttttaggttt 
atagaaagat 
gaagatttaa 
atggtaataa 
tatttgtagt 
tatgtttgtt 
gtgttggttt 
ataaggggat 
tttttttttt 
ttagtttatt 
gtagttggga 
gtagggttat 
attttgattt 
gatattttaa 
ttattttat-a 
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gatgtgtttg 

agtgtgtggt 

tagagttttt 

tgaatttttt 

ttaatttttt 

gggaggtagg 

gatggaataa 

tgatgtttat 

atttgtatat 

aggggatgtt 

ttagaggttg 

ggtttttggg 

aattatgttt 

gaaggatagt 

ttttttggta 

aggaggatga 

ttgagatgga 

gagatttttg 

ttataggtat 

attatgttgt 

tttaaaatgt 

agatgttgtt 

tttgtgtggt- 



2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4453 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



seq_prot. txt 

tattgttttg tgagtgtgtg tgtgtgttta tgtttgttat atagggatga agatgttatt 1440 

tattttattt ttagttttta atttttgttt gtttttgttt tttatttatt tttttttgtt 1500 

tttaaatttt ttttttagtt gtagtggaga tttttatatt tattttatag tgttgttttt 1560 

tgaatttttt tgggtagttg tattagtgaa tgttggagaa gtatttgttg gatatatatg 1620 

tttttttatt tagatagtta tagtttgttg gagagaataa aggtggggta agtgaggggg 1680 

agtggaagtg gtaaggggtg gtgtagtttt gtagtagagg gtagggaggg gatgtttttt 1740 

gaagtttttt taatttgttt gtgtagttaa ttgttgtagg ggtggatatt tatatggaat 1800 

ttgatgaagt ttattgttgt tttggaagag atttgggagg aggttttatt aaaggtttta 1860 

tgttttaggg attttagggt gagggtaaat tggttttaat tttaaaattt attttttttt 1920 

tttttttttg ttttattttg tttgtattta gtttttaaat ggaagattat gggtttttag 1980 

ttaggagaaa tggattgatt taagtaagtt ttatttatta gatgtttgta ggttggggta 2040 

gttttggtgg tttttttttt ttttgttagt tagtgttaat ttttgtggtt tttaagtttt 2100 

tgttgttatt ttgttttatt ttttggggag agtttggttt tgttgtggat ggaatttgga 2160 

ggattttagt tttttgagta gttttttttt ttttttggtg ttgattagag gtttttgggt 2220 

agtattagtt aaagtaagag tgtatttatt ttggagttgt ttatgattag gatgtagaga 2280 

agtaggtgtg ttagtagggt ttttatggtg gtgaggttgg ggtgttagat ggtggttttg 2340 

taaaggaagg agaagttagg gtaagaggtg gaggaatggg aaggtaggtt aggtgggtga 2400 

ttgtagtgta ggggagatgt ttgtggtgat taggtttttt agttgttttt tttttttttg 2460 

ggttttggat tttgggtagt ttggattggt atttgtgggg gatgtttggg atggggtgtt 2520 

ttgattttgt gtagttgttg gggagtttag ggagtttggg tagtttaggg tgggggaggt 2580 

agatgtttgg gagttggggt tgttgtgtat ttggtttggg gattttagga ttgtggtatt 2640 

tattggtggt tgtggtagga gggtgtgagt tggtgttgtg gggataggtg gattttggtt 2700 

tgggttttgg ggttgtggtt tttgtattgt gttgtgattt gtggtgtttg ttttatatta 2760 

gggtttgttt tggtgttgtt tttttttttt ttgtttggtt tttttttttg ttttgtagtg 2820 

tttagtgatt tggattttgt gtgtttttgt aatgtttata aagatttggg ggaagtgtga 2880 

tttttagtgg aggggattta atagtgtttg gattgaggaa ttgagaggtt tgtaaatttt 2940 

ttgtgttttt ttttatgtat ttggttgggg gtttgtttta gtgtaaggag tttttgaatt 3000 

gtagagagga gagaaggtgt ataggagatt ttttattttg tttagttttg aagttttttg 3060 

gggtttttta attagttttt ttgtaatttt tttttgttgg gttttaattt gtttaagatt 3120 

gttttagatt tttttgtttg tagttgatgg agttgtgaag tttttattaa tgtgataaat 3180 

gtatgagata tattttttta gaagtataga tagaaaaatt tttgtttgta ggggtttttt 3240 

ttgtgtgttt gtttagtggt agtttttaga tattattaat ataattagtg gatggaataa 3300 

agttgggttt attgtttttg gtagtaaggg ggtttgtttg atggtgttat tagaggggga 3360 

aaggtaaggt tagattattg aaaatttgta gtttggttta aagtttgttt ttgatagggt 3420 

ttgataagga ttgggttagg tgttgtgata tgatgttata ggattgtggg aataaagttt 3480 

tagggtataa attgttggtg ttttttattg aagtgttaat gggttttttg ggaagttttt 3540 

ataatgagta atttatttat ttgtgtaggt aagaataaaa gtaaagataa tggaaatatg 3600 

tagatagttt taattgtgga ggttttggag ggtgtggaag ttttgttttt atttttgagt 3660 

agaggaattg ggagattgga ggataaaata agaggaagat ttatttttta ttgtttgttt 3720 

ttttatattt ttaatatttt aaaaagtata tttttgtata gtttatttta aaaagataat 3780 
tatgtatttt ttaatgtatg tgtatttagt gttttatttt attatagagt tttgtttatt 
ttatttagat agaaataatt gtttattaaa taaaattgtt ttttagaaaa atagattatg 
tgtaaatgat tgtatttatt ttttttgttt gaggataagt agatatttgt gtatatatat 

atatatatat atgtattttt gggtatattt ggaggaatat agtagggatt ttgtgatttt 4020 

gttattttta tttttttttt ttagtgttag gaattagggt ttggggttat tttgaattta 4080 

gtaggatttg tttttttagt gggaaggagg aggttgagtt ttagtaattt attagtggtt 4140 

ttgggttttt ttttagttta ggtttgtagt atttttttgt tttgtaaatt tggattttgg 4200 

gtttgggttt atttgagagt gatagaagga aggtagggag agtgttagga aggtaggaag 4260 

gaggaaggtt tttttgtttt gttttagttg agggtttagg gtgttagttg tttgttgggg 4320 

aaagtataag ttagtttttt agtttttggg aggtaggtga tttattgttt tttggagaga 4380 

tttttgtgtt tgttgagttg ttagtttttt aggggataga ttgagttagt gatgttatag 4440 

gttttagagt taa 4453 

<210> 360 
<211> 6001 
<212> DNA 

<213> Artificial Sequence t 

* ■ . ' -. 

•<220> £ ' s 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 360 ■■ t ' L ■ l ■ 

.•.*-" ' 

aaatttatag gtgttggtgt tatagaggaa gagattattt tttttgttat tataatttta ^60 

aatttaaaga gaatatttta gttattaaaa tattttatag atttttaata aaaaaaagta 120 

ttattttgaa gttttaaaat atttgtttta aattttaaat ttaataatta tagttgtatt 1°0 

qtaaggttat tgtttattga taattttaaa attrtagttaa gtgattaata tttgttttta 240 

ttttttataa ttttttttta gttaattttt ttttagtatt ! ttttttrtgat agtgttattt 300 
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3840 
3900 
3960 



tttaaagttt 
tttttttatt 
attttggaaa 
ttgttttttt 
gaaataaagt 
gtttaaatga 
tgataattta 
tttggtaaat 
tttaaaataa 
ttaatatttt 
aaagtgatta 
atgaattaag 
ttaaaaaaga 
ttgtaaagga 
taattaggaa 
agttatatat 
aattatagta 
gaatagatta 
aaattatatt 
ttaggagtgt 
aagtttgtta 
tgatttattt 
ttgtttttgg 
gagatttgta 
aaagagaggg 
aatgagagag 
agagagagag 
tttaatggga 
agttattatt 
attttagtaa 
gattggtgtt 
tttttgtagt 
tgtttatttt 
atagaggtgg 
ttagggtagg 
atgggggttt 
gaggtaggga 
agtagtagtt 
tatagtatta 
gattgtgggg 
tggtggtggt 
gttttagtaa 
gttgtgttgg 
taggaggaga 
gagttaaggt 
ggtgtttggg 
gttttgattg 
ggatgtaagg 
tgagtggggt 
gtatatttat 
gtttggatta 
ggtggtgttt 
agggtagaga 
tgtgtgggag 
aggttaggga 
attgtttttg 
gttgtttgtt 
tttttttttt 
tttttttttt 
Qtgtggttag 
aaaggttttt 
ttgggaaggg 
tgggtttttg 
gtttttggtt 
gatttgtttt 
attttagttg 
tttttttatg 
gttttttttt 



gtgttaatat 
tttatttttt 



agagttagtg 
gttttaatga 
tttaataata 
agattgatta 
tttgtaaaat 
atgtaatttt 
tgtaagggtt 
taaaaattgt 
ggtaatagta 
aagttataga 
gaataattta 
taaattagaa 
ttatataggt 
attttagttt 
aaattatatt 
tatttagggg 
ataattttgt 
agtaattttt 
agtgtttgtt 
tatagaagtt 
tttgtttata 
aagtgttttt 
gtatatttgg 
agagagagat 
agagaaattt 
gattagtgtg 
tgatttattg 
gatttttttt 
tttttaaaat 
tttaatgtaa 
agaattattt 
gtggtattag 
ttggtttgtg 
tggatttatt 
aagtagtgag 
agtaaaagtt 
tgattagttt 
gttgggtagt 
agtggtattt 
tatttaggga 
agatgtggga 
tgggagttta 
gttttgtagt 
aagtagtagg 
gtggagtagg 
aggaggtggt 
taggtgggga 
atattatttg 
gtgttttttt 
ttttgagttg 

ggggtgagtt 

tgtgttgttt 
tgttttttta 
ttgatttgga 
gaggtagttt 
tttaggtttt 
attatttgtt 
atgtgtttgg 
ttgtatatgt 
gagagtgggt 
ttggatggtt 
ttgtgtgttt 
ttgtttgttg 
ttagtaatgg 
attggggtta 
tggtgtgtga 



tgatagtggt 
gtttattagg 
atgatttttg 
ttgaaataat 
aagataaata 
tttttaagtt 
tagtttattt 
ttgattgtaa 
aggaatgaag 
ttttgtaggt 
aagtgaaatt 
aatatgttga 
ttgatttaaa 
tgatgtataa 
ttttattttg 
atatatattt 
tatagaagat 
ataatattta 
ttagtatatt 
attttttggt 
gttagaattt 
tgtttagaaa 
tggagtttga 
ttttttaatt 
attgtaaaat 
agatagatag 
tattgaaatt 
attttatggt 
tttttttgtt 
tttaatgatg 
agtgaattta 
gtgtaagtat 
tttatttttg 
gggtttaggg 
ttggatatgt 
agagtaattt 
ttttattggg 
tttgtaaagt 
gtgtaatttt 
gggttattga 
ggtggggaag 
ttgggtttag 
agttgttgtt 

ggggtagatg 

ttggtagttg 
tttttagttt 
atgtaggttt 
tgtggaagtt 
tggtttaaat 
ttttattttt 
tttgtgtgtg 
gagatggtgg 
ggtgtgggag 
aggttgtagg 
gggatttata 
agtaatagta 
agttattttg 
ttgtagtttt 
tttttttagt 
aaggttagat 
tgtgggtggg 
tttatttgtt 
ggtgatttgg 
tttttatttt 
tattttttag 
gattaagagt 
ttattgtagt 



seq_prot.txt 
gaatgaaagt ttaatatttg 
tggataatat ttatgattat 
aatatttttt tattattttt 
tttttatttg aaatgtattt 
ggtatagtgt tttttgtgat 
aataggggtg gaagtggggt 
tttttagttt atgtaatttt 
atgttaattt tatatttaag 
taaattagtt tgtgttggat 
ttataattat tattataata 
tattttttta aaaataatat 
gttttattaa aatgttaaat 
tttaataggg ttgtatattt 
taattttttt tttggtattt 
agttatagtt ggttattttt 
ttgaaagata ttttatttag 
gaaaaattat atatatattt 
ggtatgttaa tggagtttaa 
ataggttgtt aaattgaaat 
tttatttaat taggaagttt 
gaaggaattt gagtgagtaa 
tggaggagtt agtgtttatt 
ttagttttag ttatgtttat 
tttttgtatt attttgaagt 
taatgttttt tgttgtttag 
agagagagag agagagagag 
tagttttttt agaatttgtg 
atttttgtta ggaattagtg 
tatttttttt tataaagtta 
ataaagtttt tgttttagtg 
gaaaattatt ttgtttaata 
gtaaaggttt tttgttatat 
ggtttgtaat agtttttttt 
tagtaggagg tgaggggttg 
gtgttttttt gtggagttaa 
tagttggtgg gtgtttggta 
tgggttatga tgagtagtat 
gtttagttgt tgtattgttg 
gtagtagtaa atggtttttg 
gtattttgtg gatggtggta 
tagtagttaa atttgtgtat 
ttttggagtg agagggttgt 
ataaggaggg agttttggga 
agtggagttt gaggaggtag 
ttttttgagt tttgttgttt 
gtttggggtt atgtgggaag 
tgggagggag gggttgatga 
atagatgggt ttgtttgtta 
gagaagtttg ggttttaggg 
tgttttagga tgttttttat 
attttgggtt gtgagtgtgg 
gggtggaggt gttagaggag 
agggtgtttt gttggtgatt 
gggatgtagg ttgggaatgt 
ggaaagaggg tgagaggtga 
gtatttttta taagagtgtg 
gtaggtgttt tttttttttt 
gatttagttt aagtgtttgt 
ttgtagttta ttagtgtgtt 
tggttgggat aagtggtttg 
ttgttgggag tattggttgg 
ggtttaggta gtgattttgt 
ggtgatgaga gaaggtttaa 
ttttagtggg aagggtaaat 
gtagttagat atatttttta 
ttttggggat aagggtggag 
tttagtgttt tggatgtttt 
atttt tg t t*_ ttgtttttgt 
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ttttttgttt 
aaaattttat 
ttatttttaa 
agataaagag 
ggtttgtttg 
gttaagtttt 
ttttaaaata 
ttagttattt 
tataaagtta 
aagtatttaa 
atatgtatgt 
ttagaaattg 
taatttgttt 
atattagtaa 
ttttttttaa 
agttagattt 
gttaaaatta 
aatgttaagg 
gttatgttag 
tagtagagtg 
gaagagtgtt 
gaagttggtt 
tttggtttgg 
ttagggaagt 
gagagaaggg 
agagagagag 
tgatttggtt 
atttttttgt 
tttttttttt 
ttttttttag 
ttttttaaaa 
ttgtattttt 
gtttttttgg 
aggaggtgtg 
agggttgggg 
gttatttaag 
gatgggtagt 

tggggatttt 
aggaatatag 
gtagaggtgg 
gattttgaga 
ttgttgagaa 
agttggagga 
ggtggaggga 
gtattttttt 
aggtagttgg 
ggaggtgtaa 
ggtgttggtt 
tgggttattt 
tgaaggtggg 
gttgtggttg 
tagtagtagt 
ggtgttttag 
tgtggtggag 
tggtgttaga 
taattttaag 
tttttttttt 
aggtttgaat 
tatttgggag 
agagaaagag 
gtagtggtgt 
gttttttatt 
tttggtagga 
ggtatagtgg 
gtttaatgga 
aggaatattt 
atagggaaga 
ttattt-tttt — 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
43m— 



seq_prot.txt 

tttgtttttt ttttttattt ttttttgtta tttttttttt tttttttttt tttgttttta 

aaagtttttg gatttttttt ttttttattt aaattttttt tttgtgtttt tttttttgtg 

ttttttgaat ttaggagagt atttgataat atttaatagg taattagtgt ttatttttaa 4560 

ttatttaaaa gaggtattta tatattttga aaatgggatt atttattttt tgtagatatt 4620 

agtagaaaaa taaattgtat ttgagtaatt tttttaagta ttttaatttt taattttttt 4680 

ttattttttt gttttttaat tttttttttg agagatgtga ttgtgtagta ttttagtgtt 4740 

ttaatgaaat tttttttttt tttttgtgtg aaatttattt ttttatttta tatttttgtt 4800 

tttgtttgag attgtttttt ttttttttta tttttaaaga tttttgaatt ttagtgtttt 4860 

ttatttttgg taattaagta gtagatttta gtattttagt tggtggtatt ttgtttttta 4920 

5040 
5100 



<400> 361 



4440 
4500 



ttgatgaaga ttttattaaa atagattaat tagattagat gttggaggta ttagaaaatt 
ggtttttaga tagagtagtt aaatttttta aggaaataga atatttatta gatagagttg 
ttaattaata ttgtaaaata aggaattaga aatttttttt gttataggtt tttagtagag 

aaggtaatat aaatatagat taagatttaa taattttata gtagagaatg agaatatgtt 5160 

attttttata gtaaggttgg tgtggtaatt aattaggttt atgaaaataa gttatgtttg 5220 

aaattaaagg taaagttttt aaaagtgttt atgtagtaat tatgataatg aaataggatt 5280 

tgttaggatt ttagagtttg gttatgtaag tagaatttta gagaattttt tagtagagga 5340 

aaattgtttt tgaatttttt gttaagtaaa tttttggtat attttttaat aatatatgtt 5400 

ttttttaaga tgttttgtta aaagtaagtt aaaattttaa aggagttaat tattggttgt 5460 

aattggttaa taaatgtggt tgtttttata gaggtttttt aaattattaa atagtttgaa 5520 

gtaaagtttt tttaatggga atgttgtaat tttgttgtat ttattttgta tttagtgtta 5580 

tagtgttatt aagaaataaa ttttgaaatt ggtaagtatt attaagtggt agaagaatat 5640 

tatttattga gtagagaatt gtattattga atatgtaaat aaaaatatat atattattta 5700 

gatttgttat taggtattaa agaagtagat aagattgtat tagtaattgg attagtgttt 5760 

taattttttt ttagtaaggt aaaattagtt tatttattag aattaaattt aagtttatga 5820 

attgtatttt gtattgtgta ttatatgatt gttagtaata tgatataatt atattatgta 5880 

tttgtaaaat ttttatttta aaatattata ttatatttat ttttaatttt tttgagttag 5940 

aatattttat ttgtggtata tatattttag aattgatgta gaggagtaga gtttagttgt 6000 

-£ 6001 

<210> 361 
<211> 6001 
<212> DNA 

<213> Artificial Sequence 

<220> . „ . 

<223> chemically treated genomic DNA (Homo sapiens) 



60 



aataattgga ttttgttttt ttgtattaat tttagagtgt atatgttata aataaagtgt 

tttagtttaa gaagattgaa agtaaatatg gtatagtatt ttaaaataag aattttgtaa 120 

atatatggta tgattgtgtt atattattag taattatatg atatgtaatg taaagtatag 180 

tttatagatt taaatttaat tttaataagt aaattgattt tgttttgttg gggaaaagtt 240 

aaagtattaa tttaattgtt aatgtagttt tgtttatttt tttggtattt agtgataagt 300 

ttaaataatg tatatatttt tatttatata tttagtaata taattttttg tttaatgagt 360 

gatgtttttt tgttatttgg tggtgtttgt tagttttaga atttgttttt tggtggtatt 420 

ataatattaa gtatagagta agtgtaataa aattgtagta tttttattga aaaggttttg 480 

ttttaaattg tttaataatt taaaggattt ttgtggaagt aattgtattt gttaattagt 540 

tataattagt aattaatttt tttggagttt taatttattt ttggtaaaat gttttaggaa 600 

gagtatatat tattagaaag tatgttaaaa atttatttag tagaaaattt aaaaatagtt 660 

tttttttgtt aagaggtttt ttaaaatttt atttatatag ttaaattttg aaattttagt 720 

aggttttgtt ttattattat aattattgta taaatatttt taaggatttt gtttttagtt 780 

ttaagtatga tttattttta taagtttgat tagttattat attagttttg ttatggaaaa 840 

tgatatgttt ttattttttg ttgtagagtt gttaaatttt gatttatatt tatgttgttt 900 

tttttgttga aagtttgtag tgaaagaaat ttttaatttt ttgttttgta atattagttg 960 

gtagttttat ttaatgggta ttttgttttt ttaaagaatt tagttgtttt gtttagaagt 1020 

tgattttttg atgtttttaa tgtttggttt aattgatttg ttttaatgga gtttttgttg 1080 

gtgaggagtg agatgttatt gattagaatg ttgggatttg ttgtttaatt gttaggagtg 1140 

agagatattg agatttagaa atttttggag gtgggagggg agagggatag ttttggatgg 1200 

aggtggagat gtaagataaa gggatggatt ttatatagga aaaaaaaaaa gattttgttg 1260 

aggtattgag gtgttgtatg attatatttt ttaaaggaga agttaaaaag taaggaagtg 1320 

qgaggaggtt ggaggttaaa gtatttaaaa ggattatttg ggtataattt gtttttttgt 1380 

tggtgtttgt aaaggataga tagttttgtt tttaaagtat atgaatgttt tttttaagtg 1440 

attgggaatg gatattaatt gtttgttaaa tgttattaaa tgttttttta aatttagggg 1500 

atatagaaag aggggtataa aaggagaatt taaatagaaa aagggaggat ttggaggttt 1560 

ttgaaagtgg ggggagaaga aggaggaggg ataatagaga ggaatagaga aggagagtgg 1620 

agagaagata aataaaaata aaaataggaa ttattgaata attatatatt aaaaagaaag 1680 

ttttttttta tggggtattt aaaatattga gattgtaata gtgattttgg ttatggaaga 1740 
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aagatgtttt 
ttttattagg 
tttgttatgt 
tttttgttga 
gagtaaggaa 
gatgttgttg 
tttttttttt 
ttttttaggt 
gatttaaatt 
gggagaagag 
ttttggggtt 
ttttaatatt 
tttttgttgt 
gttggagtgt 
tgttgttgtt 
ttagttgtgg 
ttttgttttt 
tgggtagttt 
gggttagtgt 
tttgtatttt 
tttgattgtt 
tagagggatg 
ttttttttat 
gttttttggt 
ttttttagtg 
tttttgagat 
gttgtttttg 
tttgtgtttt 
gggagttttt 
tgttgtttgt 
ttttaagtga 
tttttgattt 
gtgtgttttt 
tttagagaaa 
agaaagtgta 
attttgagaa 
tttaggggaa 
tgaggaagaa 
tgtaggggaa 
agattaggtt 
tttttttttt 
tttttttttt 
tgttttttta 
tttaggttgg 
gaattgattt 
aggtattttt 
ttgttttgtt 
gttggtatag 
tttttggtat 
ttagttttag 
taaatttggt 
tttagagagg 
attattgatg 
aagataagtt 
gtagttttta 
tgtgtatatg 
tttagatgtt 
atggttttgt 
gaaataatta 
atattttaaa 



tttttatttt 
ttaaagagtg 
tgtttgtttt 
gttagatttt 
tgtagggtta 
tttggttgat 
ttagattatt 
ggatatatta 
tgtgaatgtt 
agaaggaagg 
gtatgttttt 
attgtttttt 
gatattttta 
tggttgttag 
gtttttttga 
tttgtatttg 
ggtagggggt 
attttgaagt 
ttggtgagtg 
tttgttgatt 
tttttttatg 
tgggtggtag 
tttgtttttt 
tttttagagt 
gatgattttt 
tatgtgtggg 
ttgttgttgt 
ttggaagttg 
gtggtagtgt 
tatgttattt 
ttgttaaatg 
tttaattttg 
ttagtttttt 
taaggggaat 
ggtgtagtag 
atattaaatg 
atattgaaat 
ataattttat 



ttgttgattt 
atatagattt 
tttttttttt 
tttaggtggt 
agttttaagg 
agtgggtatg 
tagtgggtgt 
tttgtttgtt 
gaagtttttt 
tattttggtt 
tttaaatttt 



eiagaatttgg 
ttaaatagat 
tttttttgtt 
attaaagatg 
tgtggagttt 
aaaataggaa 
eaaatagtat 
atggaaatga 



taagtgtgta 
tttgaataaa 
agggaatgat 
taaatgttaa 
gagatatgta 
aatttggtat 
tgtgttgttt 
ttattatggt 
agtttaatat 
atttaaatgt 
ggggattgta 
tattttgttt 
tattatagaa 
tggatatatt 
gggaaatggt 
tgtggttata 
gtaaatgtta 
tattagaaaa 
atattgatta 



tgtttttgaa 
tgtttaattg 
ttttgttgga 
tttttgttgt 
ttgtttgggt 
gtttttggta 
tgttttgatt 
ataggttatg 
tgggttgggt 
agagtgtttg 
gtgggagatg 
attttttttt 
gtttgtattt 
taggatgttt 
tatttttgtt 
tggtttgggg 
gttttggagt 
ttgagttttt 
agtttatttg 
tttttttttt 
tgattttggg 
agtttgagag 
tgggttttat 
ttttttttta 
ttgttttggg 
tttggttgtt 
ttgtgggatg 
tttgttgttg 
agtagttgga 
attgtagttt 
tttattggtt 
tagaagaata 
attttttatt 
tgttgtaggt 
gaaatttttg 
ggatggtttt 
agaagttttg 
gagaaagaat 
ttggtaaagg 
tagaggagtt 
tttttttttt 
aaaagatatt 
tagtataggg 
attgaggttg 
tgattttttt 

*gggtttttt 

aattaaatag 
tgatagtttg 
gttaatatgt 
tgtaattttt 
gtgtttttta 
taattgtaat 
aaggaaaatt 
attttattag 
tttaataaaa 
ttgaaagaat 
aataattatg 
agattggttt 
gaagttgata 
taaattaaag 
ttatttttgt 
aatattgtgt 
ttaaataagg 
aaaagggtat 
aatattgttt 
agtttttatt 
ggatattaaa 
tttaattaga 



seq_prot.txt 
agtttttggt tttgttattg 
tttgaagaat gtagtagatg 
gagaatgaaa gaaatgtgta 
tttaggttat tggttatttg 
taataaatgg agtttgtttt 
atttatttgt ggtgtatgta 
aatttgattt tttaaatata 
ggttggagag gagtgggtga 
tggagttgtg gggggtttgg 
ttgggatggt tgagttgttt 
ttgttgttgt ttttaggttg 
ttgtaaattt ttagagaaat 
ttttatagtt taggtggtgt 
ttttttgtgt tgatttgttt 
tttattattt ttagtttgga 
ttatgtgtgg aagaggggtg 
ggagagtgag gtgaatggta 
tatttgagtt atttttgttt 
tggtttttgt ggttgttttt 
gggatttgta ttttgtttta 
tgggttgagg atttgttgtt 
gtggttgttg ggttgtgggg 
ttgtttgttt ttggattttt 
tggtagtagt tttttgtgtt 
gttgagttta gtttttggat 
gtttttttgt tgggtgttat 
tttagtagtt tgttgtttgg 
tagagttgta tgaattagtt 
tattttgtga gggtttttgt 
gtttggtgaa gtttgttgtt 
ggaattgttt tggtaagttt 
tgtgtattta gtatagatta 
gttttagatt tttaatatta 
ttgggggtga ggggtggttt 
tatgtttgtg tttatattgg 
ttgggtttat tgttttgaaa 
ttattattaa agaaaaaagt 
gagtgagaaa gtaataaatt 
tgttatgagg ttgtattggt 
gggttttaat agaatttttt 
tttatttatt tatttttttt 
ggttttgtag tttagatatg 
gagttgagaa aaagaatatt 
gttaggtttt atgtaggtga 
atttttggat aggtttttgt 
tagattttga tggtgaatgt 
ggttagagga tgggagttgt 
tagtatattg ggtagaattg 
ttgggtattg ttttttaggt 
tattttttgt gaatataatt 
aagatgtata taagttgaag 
ttagggtgaa agtttgtata 
attatgtatt attttaattt 
atttgggtta atagattgtt 
tttaatatgt ttttataatt 
aagttttatt ttgttattgt 
agtttgtaaa aataattttt 
gttttatttt tagtttttgt 
tttgtaatta agaaattata 
agaataaatt ggttttgtag 
taatttagag gtgattaatt 
ttgtttattt ttattattga 
ggttgtttta gttgttgaag 
tftagagatta ttattagttt 
atttaatgag taaaaaataa 
tattattgtt agtattaatg 
ggagaattga ttagaaaaga 
ttttgaggtt attagtagat 
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gtgattaaaa 
gaaggtgggt 
gagttagaga 
gtaaagattt 
tttttttttg 
gaggagtttt 
tttgattgta 
tgaggagagg 
gaggagagag 
tggtgagtag 
gtaagagtgg 
gtttttggtt 
gtttttgtat 
ttttttgttt 
gagatgttat 
ttagtttgga 
tatgagtgtg 
agttttattt 
tttttgtatt 
ttaattagag 
ttttaaatgt 
tgttttgatt 
gttttttttt 
tttggtgtag 
gttgttgaaa 
tgttattgtt 
tttttgtgat 
atggtgttgt 
tggttgttgt 
ttttttattt 
agaatttttg 
gtttatttta 
tttattttta 
tgggatgggt 
agttgtgagg 
gagtattaat 
tttattagga 
aaatggtgat 
tttttgttga 
tttttttttt 
ttttttttat 
tttttttttt 
ttgtgggttt 
gttgagggtg 
ggagtgggtt 
ttggtaggtt 
tgtattttta 
tggtgtaatt 
gtggtttgtt 
ttgttgtagt 
tgtatgtaat 
gtttttagtt 
attttttata 
tttttttttg 
ttttgattta 
ttaattattt 
ggaaatgttg 
attgttttag 
tatttattag 
ataggttgtt 
tttatttgag 
ggttttgttt 
taaaagaata 
ttttttaaaa 
aaataaaaaa 
taagttttaa 
attgtggaaa 



1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 



seq_prot.txt 

tagtatagtt atagttgttg gatttaaaat ttaggataag tattttaaag ttttaaagta 5880 

gtgttttttt ttgttaaaaa tttgtaagat gttttaatga ttggagtgtt ttttttgaat 5940 

ttgaggttat gatgatagag aaaatgattt ttttttttgt gatattaata tttgtaaatt 6000 

6001 

<210> 362 
<211> 7001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 362 

aatgtaatgg aaaaagagag attgtaaagt tagaaggttt aggaattgtt ttttgattag 60 

gtgtggaagg taagggaaaa ttagtttttg aagaagatag tgagatttta atttgggtgg 120 

ttggagagat agtgatgttg ggtatagata tggggaagtt gagaggaata ttatgtttga 180 

gaatggtgat ttatatttga ataagtttgt aatgtttagt agattgttgg aaaagtgggg 240 

ttggagatat atttaatgga ggagttagat taatttttat ttttttttat ttgagagagt 300 

tagtaagtta tggttggaat gtgtgtgttt agtaggagag ggtagggagg gaagttaaga 360 

gagttgggag tttgagtgaa gtttttgtta aaggtagaag aggaaagttg gtgtagtata 420 

gtatattttt ttatttatgt ttattaagtt tagggataag gtttattaag atgagtttgg 480 

aagagaatgt tggagagaaa gtggttaaga aaattgtttt tattgaattt tttgggttaa 540 

ttttgattgt aagtttttga ataattaaag tttgtgagga gatagttaat ttttttattt 600 

tttttatgtt aatagtgaat aattgtagat tttttttttt tttttttttt ttttttttgt 660 

tttttttttt tttttttttg aatatttttg tttttttttg ggattggttt agagtatggg 720 

tggttattgt tgatttatag gaggtattat tgttattaat aaagggtaat agtttttttt 780 

tttaatattt atttatattt agtatttatt tttaatattg attatggaga gagttttttt 840 

gtgtttaaat attgtaatat tgggggtttt ttaaagtata aaaatatata tttgtatgat 900 

ggtattatta atatttttat ggttttttat tttttttttg tattggtttt aagagttatt 960 

tataaatttt ttagtaattg tatagtgttt tagggttaga gattggttat ttttggtatt 1020 

gtgattagag ttatttaata tttaaggtgg tgattaatgt ttggtaataa agtttttatt 1080 

gggtgttatg tgttttggga ttttgagtgt gggtatttta ggagtatttt agtattgtgt 1140 

gttagtatta tggttgagag aatagttgag aaagtggtta agaggtggat ttatgtgaat 1200 

gttattggga aatgagagat tttgttttta attatggtta gtgtaatttg aaagtttaaa 1260 

attagtttaa aataaaggta tttattttta ttttatgttt atattttagg tttttaataa 1320 

tatgtatttt ttatatgttt atagaaagta gttaattgag ttatttatgg aaaggtttgt 1380 

gggtttggtt aatgaagtgg aggagtatta tattttagtt ggaaatatat ttttagaatg 1440 

ttaaaatatt tattttaaag tttggttttt tggtgtaatt ggaggtatgg taatgttttt 1500 

gtttagagat tgggggttag ggttagtaag gtatttgatt tatatgtatt ttagaaggtt 1560 

tttattgtta aattatattt ttttggaaaa attatttatg ttttattttg taaatttgat 1620 

atttatatat ttttgattgg tattttattt tagttgtaag attatgattt atagtaagtt 1680 

tgtttttttt tttgtttggg gtggtagtag aaagtatagg gtatttttta gtttttaagg 1740 

gtaggggtaa aggggttggg gttttttttt tttagtatag ttttttttgg ttgtgttata 1800 

ttgttttttg tgagtagata gtaagttttt ttttattttt tattgttatt tatttagtgt 1860 

tgtgtagtag tttagttgtg tgtttgttgg gaggggttgt taagtgtttt gtttattggt 1920 

tgttttttga atttttgtta ttttatgtat aaatatattt atatattttt tttgtttagt 1980 

ttatatattg agttatttgt atatgtgagt atattttttt tttttttttt atttttttgg 2040 

tttttgattt ttataagttt atggaatatt tttggaaaga tgtttttgat ttagtagggt 2100 

aggtttgttt tgattttttt ttttgtagtt ttagtatttt gagaaagtaa tttatttttt 2160 

tggttagtgt ttgtatttta gtagggagat gaggattgtt gttttttatg ggggtatgtg 2220 

tgtgtttttt ttttttttta ggatttgtag gattttttgt gttatttgta tataatttgg 2280 

taggtttata ttttttaaga gttttatgaa gtgttttttg tatgtgtttt aaaaaggtat 2340 

ttgaaaattg aaagtgtgat ttatggaaat taaattattt gtaaaaaatt gttttggaaa 2400 

gtaatgattg ttggttataa agggaaatat ttgtgatgta tttaatgtgt ttttaatttt 2460 

ttatttgttg ataatttata gttattaatg ttaaatttga ttttggtttt agttatattt 2520 

gtatattgtt taataatggt ttatttttgt aagaattaga taaaatgtat atttgatata 2580 

aaatagttaa aaatgtaatt tttagtaata gtaagtttgg tatttagata gattatgaat 2640 

attttgttag atattttgtt gggtgtttgg gatagtaatt aaaataaagt attgatagtt 2700 

gtattagagt ttattaggtt gtagtaaagg aagtttattt aaaagtataa attatttaag 2760 

^attatagatg tatgatatat tttatttatt ttttgttttt ttaatatgta tatatatata 2820 

itatatatata tatatatata tatatgtgtg tgtgtatgtg tgtgtgtatg tttaattttt 2880 

aatttagtta aaaatttttt tttatttgtt ttttatttgg atatttgatt ttgtatattt 2940 

tagtttaagt gaattgagaa gattgagttg taggattaaa ggatagatat gtagaaatgt 3000 

attttaaaaa xttgttagtt ggattagatt gataatgtaa tataattgtt aaagttttgg 3060 

tttgtgattt gaggttatgt ttggtatgaa aaggttatat tttatattta gttttttgaa 3120 

gttttggttg .tataattaat ttgtggaagg ta£gaatatt tatgtgtgtt ttaattaaag 3180 
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gtttttttga 
ttaatataaa 
attttgggag 
ggtaaaatgt 
ttttttgtta 
tgtatatttt 
tatgtatata 
aatagagata 
tttttggatt 
gatttttgtg 
aagtaaagta 
gagtgatgtt 
ttgtaatgta 
gttggtggag 
tggatttgta 
gttgtgttgt 
ttttttgttt 
ttttttatat 
agtttttgaa 
tggatagtag 
ttgtggttgt 
ttgaggttgt 
ttttgagttt 
atggttagta 
ttggtttgtt 
gtttggtagg 
tgtgggatgt 
ggttttgtgt 
aaatgtattt 
ttaataaaat 
tttaggtaaa 
aaagttgggg 
ttttgtttaa 
agttatgatg 
tttatgtttt 
ttggttgagt 
taggtgtttt 
ttgttagttt 
ttagttaggg 
gagattattt 
ataagaattg 
tttgtttttt 
gtgaatattt 

a gggtgttat 

ttggtgtttt 
tttttaggga 
attgaagggt 
gagtatttta 
aatttttaaa 



agagaaaaag 
gattaagttt 
gatgtgaatt 
ttagtttttt 
tttggtagat 
gggattgtat 
tgttttattt 
agagataagg 
tttagtttta 
agaattttgt 
gatagatatt 
aaaggaaagg 
agttttgggg 
ttagtttagt 



attatttttt 
tatataagtt 
tattttgtag 
agtttttttt 
gtttttttag 
attttatttg 
ttatgtattt 
tattggagtt 
tgttttttgt 
tgtttttgtt 
aagtttaggg 
taagttaatg 
tatgagtttg 
ggtgtttgtt 
ggtttttggg 
ttttaatttt 
tgtggggata 
taaagtattt 
ttgtttgtag 
taagtttgtt 
tgttaatgtg 
ggtgtttggt 
gttgatgtta 
ggtgttttat 
ggtattttat 
agggagggag 
gtgatttgag 
gtagtttgtg 
tttatttatt 
atatgttgga 
taaatatggg 
gtgtttggag 
tgtgtttttt 
atgatttttt 

ggggttttag 

ttgtgttgga 
gtgagtttaa 
agtgataagt 
tttgtgtttt 
tgtgatttta 
tttttttttt 
gttttttgtt 
ttttaggtgg 
atggttgggt 
tatatttttt 
tatttatagt 
agattttggg 
tagttatgta 
ggttggtata 
gatttttttt 
aattttatat 
ttataatttt 
tattgtaaat 
tatttattgt 
tttatttaaa 



atatgagaat 
aggttgagag 
aggaagaaat 
atatgtatat 
ttgttaaata 
ttttattttg 
ttagttaatg 
tggtatgggg 
gtttatttta 
ttttggttgt 
aagttgtttt 
ttagggtaag 
ggagattagt 
ttgggattgt 
gtagggttgg 
gggttgtgtt 
tggtttgtat 
gggatggttt 
tttaagattt 
gtgtataatt 
taggtttatg 
tttaatggtt 
ttgtatttgt 
tatttggaga 
aggtatttgt 
ggagggaggg 
ggtgtgtgta 
ttgtgtttag 
tattgtttgt 
aaattagata 
gtgttttgag 
ttagtagagt 
aaattttttt 



tatatttttt 
aatttaaaaa 
gaatgatttt 
ttgttattat 
atttagtatt 
attgattggg 
ttgaaagtag 
gagtttagaa 
gtgtatggta 



gggaagtgat 
agagtgattg 
gtggattgtt 
aataagttat 
ttgttttatt 
gtagttattg 
atggtgaagg 
tttttttttt 
tttttgatgg 
ttataggata 
tttgaattag 
tgttttgtaa 
agaaggaagg 
ggtatttagt 
gatttaagtt 
ttttaaatta 
gtttattttt 
aatatttaaa 
atgatggatt 
taatgttgtt 
tttgttaata 
attgtttatt 
gtattatatg 
ttaaaaaaaa 
ttttgtgttt 
ttagtaagga 
tttttttttt 
tttttttagg 
tgtttttaaa 
tattttttaa 
ttttgtaagg 



seq_prot . 
ttttaatggg 
aattttagaa 
tgaggttttg 
tttgaatttt 
tagttgtttg 
gttatagtgt 
aggaggggga 
tatataaggt 
agtttagttt 
gaaatttagt 
ttgggattgt 
gtaatagttt 
atttaaagtt 
atttgttttt 
ggttagagtt 
ttttttttag 
tttgtttgtg 
tattgtatta 
ttttggagtg 
attttgaggg 
gttagattgg 

tggggggttt 

tgttgtagtt 
atgagtttag 
gtttgtgttg 
agaagggaga 
gggagtttgg 
agttaagttt 
gtgagaggta 
aagtagtagt 
ttatttggga 
ttagtagagt 
ttatttagat 
ttttgttatt 
ggagttgaat 
ggtatgtgat 
atggaagtat 

tggggagaat 

tggaaggagt 
ttgtgtgttt 
ttttttattt 
gtttgtagag 
ggtgtatttt 
gtatttttta 
gttgtggatt 
ggattaaagt 
gtgtattttt 
gtttttattt 
attttatttg 
ttttttttaa 
attgtttgat 
ttagaaatta 
gaattttaat 
attataaagt 
attttagtga 
atatttaaat 
aatttgtata 
tattttttgg 
aaagaagtat 
tagttttttt 



ttgtttgtta 
aaggattaaa 
ataagttttg 



txt 
attaagtata 
ggttgtggaa 
gtaggttgta 
tgtttttttt 
tggttggttg 
agtttttttt 
attaaataga 
agtatattag 
tttttgggtt 
ttttatttag 
tttaaattga 
ttggttgttt 
ggaggtttgg 
gttgggttgt 
tgtgtgttgg 
gtggtttgtt 
gttatggatt 
gatttaaggg 
gtttttgggt 
tgttgtttat 
ttttttttat 
ttttttattt 
gttgtttttt 
tggttatatg 
tttgttgggg 
gtttagggag 
ggtgtgtggt 
ttttgttggg 
gatttgaaag 
tatttgtggg 
aggttttgtt 
tttatttatt 
tatttgattg 
tgtttttttt 
gggtttgatt 
ttttgatagt 
aagtgtttaa 
gttttggagt 
gtggttggtt 
ttattttaat 
ttttttgttt 
ggattaggtg 
ggattgggtt 
attgtatatt 
agtttttgtt 
gtagtttggt 
agttgttttg 
aaaagataaa 
ttttaattta 
atggaagaat 
gtgaaggaag 
tttttttttt 
gggtattaat 
gaatttggtt 
taagtggtat 
atttttagaa 
aatgggattt 
attatttttt 
ttatgtaagt 
gtttaaatga 
ttttttaagt 
tgatttagag 
atggaaagga 



gtattgtggt 
gggtttattt 
tttttttgat 
tttttagatg 
tgtatgtaat 
agggttattt 
aagagagata 
agaaagttgg 
atttttagta 
tagtgatgat 
gttgtgtttg 
tttagtattt 
gagtttagga 
ttggttttat 
tgggatatgt 
ggtttttgag 
atgattatga 
aatgagttgg 
gaggtgtatt 
gagtttaatg 
ggttttgggt 
aatagtgtgt 
ttgtagtttt 
gtgtgtgagg 
tggttgttgt 
ttgtgggagt 
ttagtttggg 
tagttgaaaa 
tttgggtttt 
ggaaaatatt 
tttggtattt 
tttttaatgt 
gaaatatgtt 
ttttttttat 
ttggagttag 
tggaaatttg 
aaataatttt 
ggtgtgtggg 
taggatatag 
tttttttttt 
agtttttttt 
ggtgtttttg 
tggaagtttt 
ggtatttgga 
tagtattttg 
tttagaggat 
gagaaattta 
taatgaataa 
gggttaaaat 
aaagtatagt 
gtattggtat 
tatttaattt 
gagattgttt 
aaatatagag 
tagtgtaata 
taataaaaaa 
tgtgtggaaa 
tttttttgtt 
tttttatatg 
ttttgggtat 
agtttttaaa 
ggttatgaat 
atataataga 



3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7001 



<210> 363 
<211> 7001 
<212> « DNA 



■ ■ * 
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seq_prot.txt 

<213> Artificial sequence 

< 220> . ^ 

<223> chemically treated genomic DNA CHomo sapiens) 

<400> 363 

ataaaattta tttttatgaa atattatgta ttttgatatt ttttattata ttttttttta 

ttttagtttt tttaaaaaat attttagatt tattaaattg agtttatgat tttttgggtt 120 

atgataagta gtttgaaaat attgttttta gttttaaaat ttttgagaat tatttaagaa 180 

atttaaagtt ttttaaaaga gtttaattaa tttttttttt tatatttaga gttatttaag 240 

tagaaaaatt gagaggggaa aaatattaaa taatatttgt ttatatgaag aatttgtata 300 

gatgtttttt ttttttgttg gataataata ggtagaattt taataaaaga ggaaagataa 360 

tttgggaaat aaaatatagg aaaaattatt ttaaaattga atttttatat agagttttat 420 

ttgtgtaagt ttttttttta atttttaagt tttttgtttt tttttttatt attttaagag 480 

tgtttgaata ttatgtaatg tagaaagtgt aagatagggt atattatatt gatattattt 540 

attattggga tgatgaataa ttttgaataa gatgtgattt tttttgtatt tgattaggtt 600 

tattttgtaa ttattagtaa ggtagtaaat aatttattaa ggagtagttt tattagtgtt 560 

tattgaaatt tagtagtatt aatttgtaat aaaagaattg aaaattaaat agggaagaaa 720 

atggtttttg gagtttatta taaggttatg gaatttatat tatattagtg ttttttttta 780 

tattaagtag ttttaaatgt tgtgtggaat tagatttaat tgttgtattt tgttttttta 840 

tttaaaaaaa aaaaggtggg tagaagaaat tttttttttt tgttttaatt ttaaattaaa 900 

ataagtaaaa ttaatttgaa atatgttaat ttttaaaagt tttgtttatt gtttgttttt 960 

tgagtaaaga tagtttggat ttgtgtggtt gtgggatgtt ttaaattttt ttaaggtggt 1020 

tgaagatgta tattgaatat ttttagaatt tgttttttag tattttttgg ggttaaattg 1080 

tattttagtt tttttttttt tgttataaat atttttggaa atagaatatt gaataaaaat 1140 

tggtttatgg tttataaggt agaaagatat agggatatta gtttggatat tagtgtatag 1200 

ttgggaaatg tttaatttag gatttagtta tgtggtgttt tgaagttttt aaatttagtt 1260 

tggggtatat ttgttttgtg gttatttagg aaggtgttta ttagaagtgt ttatttaatt 1320 

tttttgtagg tttattagga aaataaaaaa taaaaaataa aaggagaaat tgggtaagag 1380 

aaaatgggag ggagaggaga gggagaaaga ataatttttg tagggaaaaa aattaaaatg 1440 

aggatatata atttttgtta ttgaagttat aaagtggttt tttgtgtttt ggattggttg 1500 

tgtttttttt atagtggttg tgaggtgtag attttggttg atttgtatgt tattttgggg 1560 

tatttttttt gggtgggata ggtttgttat tgggttggta ggagattatt tttaagtatt 1620 

tgtgttttta tatggtttgt tttaaatttg tgggatattt ataaattttt ggttgttaga 1680 

agttatatgt tagtaatttg ttttagtgtg gatttagtta gttaattttg aaattagatt 1740 

tatttaattt ttaattgttt tttaaagttt taggatgtgg ggtggggagg aggggaaagt 1800 

gggtgatagg aattgttttt tagaaagtta ttattatagt tgatatattt ttaattaaat 1860 

agtttagatg aaaggaaatt tggggagtat attaaataaa aatattaaaa ggataaataa 1 x 20 

aattttgttg agttttgtta attttaaata tttttaattt taaatgttaa gagtgagatt 1980 

tttttaagtg atttaaagtg ttttgtgttt atttgtttgg aggtgttttt ttttataaat 2040 

aattgttgtt ttgtttggtt ttttaatgtg tgttttgtta ggaagtttgg gtttttgggt 2100 

ttgttttttg tatggatggt aagtgggtgg agagtatgtt ttttttagtt gtttggtgag 2160 

agaatttgat tttgaatgta gtgtgggttg tatgtagaat ttttgggttg ggttgtgtgt 2220 

tttgggtttt ttgtgtgtat ttttgggttg tgtgttttgt ggtttttgta gttttttagg 2280 

tttttttttt tttttttttt tttttttttt ttttgttggg tgtggtggtt attttgatgg 2340 

gtggtgtggg tgtgggtatt tgtagaatgt tggtgggttg gttttgtgta ttgtgtagtt 2400 

gttgggtttg ttttttaggt agtagggtat ttgttggttg tggggttgta ggaaaggtga 2460 

tagttgtggt ggtgggtgta gtagtattag tgggtttgga gatatgttgt tgagtggggg 2520 

gaaatttttt aggttgttgg agttgaatgt tgtagtttta gatttggggt tgtaggggag 2580 

gttggtttga ttgtagattt gtgtgttggt ggtggttgtg gtgttgaatt tgtaggtggt 2640 

gtttttgggg tagttgtata tggtgggttt gttgttgttt aggtatattt tgtttagggg 2700 

ttgttttagg gggattttga gttgtggatg gtttaggggt tttagtttgt ttttttggat 2760 

ttgatgtagt agggttattt tagatgtttt ggtgtggagg gttatggtta tggtttgtgg 2820 

ttgtgggtag ggtgtagatt gtgtttttgt agggtagaag gtttagaaat tggtgggtta 2880 

tttggaaaaa gagtatagtt tgaggttaga ggtgatgtag tgtatgtttt gttgatatgt 2940 

gagttttggt tttggttttg ttttgggagt ttgtgggttt ggtgaagttg ggtgatttga 3000 

tgggagtaag tgtagtttta ggatgaatgt tttttgttag tttttgggtt tttgggtttt 3060 

taattttaag tattggtttt ttgagtttat atgtattata aaggtgttgg aggatggtta 3120 

gggattgttg ttttgttttg atattggttt aaatattatt ttaggtataa tttgatttgg 3180 

agtgatttta aagagtagtt tttttgaatt ttattttatt tgttgttgtt gttggataga 3240 

ggttgagttt tatggttagg gggtgggggt gtatgaggat ttgttaaagg tggtttaggg 3300 

aagattgggt ttaaaataaa tgtgaaagat ggatttaggg gttggttttt tttaatgtgt 3360 

tgttttatgt gttttgtgtt agattttgat atatttttgt ttgttttttt ttttgtttga 3420 

tttttttttt ttgttggtta gaaatatgta gtgtgtatat aggatgattt tggggaggat 3480 

tatattgtaa ttgagatagg gtagatagaa tggggtgtgt ggttgtatat gtagttagtt 3540 

atagatagtt atatttagta gttgggggaa ttgatagggg gtatttgagg ggaagggggt 3600 

ggagatttag ggtatatata taggaagagt tgtattttgt tattaggaga atgtaatttg 3660 
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ttaggatttt 
tttttgagat 
ttttattagg 

gggtgtttat 

aatgtgattt 
gttatattgt 
ttttagtttt 
tttagatgaa 
gtatatatat 
aggagataaa 
tgaataaatt 
taattgttat 
gttaagttta 
atttaatttt 
attgagttta 
gtgtattgta 
. tagatgattt 
gtaaaaagta 
tataaagaat 
tagtaatttt 
taaaatgtta 
gtttttttag 
ggaaggaatg 
tggatgtgtt 
ggtagttttt 
ggggagtgag 
gttgtattgg 
ttttatgttt 
ttttatggtt 
atatgggtgg 
ggattaaatg 
tgattgtatt 
tagttgaaat 
gtttagttga 
tgaatataag 
ttgattgtga 
ttgattattt 
ttagagtgtt 
gatattagtt 
ttttggtttt 
ataaaaaaga 
ttgtgttttg 
ttagtattga 
tgttggtgat 
ttagaaaaaa 
aaggaaaggg 
ttttttatag 
aaaggtagtt 
ttttgttttt 
ttttttgttt 
ttttttttgt 
ggtaaagatt 
tgttgggtat 
taattttttt 
attgtttttt 
taagtttttt 



agtttttttt 
ttttttagtt 
aatttttatg 
gttttttata 
ttttatatta 
tggtttggtt 
ataatttgat 
aaataaatag 
atatatatat 



tttgtaaaat 
tgatttgtgt 
tgaaggatga 
ggttggttat 
aatataattt 
tagttaatag 
ttttttaatt 



aaataggtga 
ttttttgttg 
tttaaatatt 
ttgttattaa 
tataaaaata 
gtattaatga 
gatatttttt 
aatttttata 
ttttataggg 
tttataagtt 
tatttttttg 
aagttataga 
aaatgtttta 
tgtttgtatg 
tgtgtgtgga 
tttggtagat 
gggagatttg 
ggaggagaaa 
tttgttgtta 
aaaatagaat 
tttttttgaa 
ttttattggt 
aggaaattag 
gtaatatttt 
ttgttttttg 
ataaaggtag 
ttgggaatga 
ttttaattat 
tatgtttagg 
attattttgg 
gggatattat 
aatagaaagt 
aaagattttt 
aaataaatat 
agtggtgttt 
gtaagagtat 
gatttgtaat 
gttttgattg 
tttttaatta 
gggtttggtg 
ttggtaaaaa 
tggatatata 
gatttggttt 
tgtaggtttg 
gtgtttgtgt 
ttgagggttg 
agttttataa 



aaaaaagttt 
atgtgtatat 
agtatattat 
tagtttaata 
taatagagta 
gagttatatt 
gattattgtt 
ttatagattg 
tttatggtta 
aattatattt 
tttttaaaaa 
ttgaaagaaa 
ttaggatata 
gagagaaatt 

tgggtttgta 

tgtgagtggt 
atggtaggga 
atgtagttgg 
ttgtttgttt 
ttttagtttt 
ttttaagtaa 
gttagttaaa 
agaatataat 
tttagttttt 
attttggaat 
tttattttgt 
taaatatgtg 
atatttttgt 
ggttttttat 
ttttttggtt 
gttttaggat 
atattaaatg 
gtagttattg 
tataaaaatg 
agtattgtag 
tggatataaa 
tttgtaggtt 
ttagggaggg 
tgtttattat 
tttagagatt 
tttttttttt 
agtatgggtg 

ttttatttgg 
tgttttagtt 
ttttgttgaa 
tttaaatatg 
ttagtattat 
attttttttt 
tttttttttt 



seq_prot. 
gtttttaaag 
gtttatgttg 
tttagaaaaa 
gtaattaaaa 
taggttatga 
atttttaaaa 
tatttgggtt 
ttaattgaat 
atatatatat 
gtgtttataa 
gattttgata 
tttgatgaag 
tttgattatt 
ggataatatg 
ttagtaaata 
gtaattatta 
ttaattttta 
atgtgaattt 
aaggagatat 
gatattagtt 
aaaataagtt 
gaagttaagg 
ttagtgtgtg 
tttgggaagt 
gttattgtat 
atagggagta 
tttgttttta 
gaggaaaaat 
agtgtatgga 
ttaataataa 
agtttttgaa 
aaatgttttg 
taattaaatt 
aaaaatatgt 
tttaaattga 
tttttagtgg 
atagtattaa 
atatgatatt 
attttaatta 
agagatttat 
ttaatgatgt 
tgtttgagta 
taaatattga 
aataatggtt 
aggagagagg 
tgatatagga 
tataattaaa 
agtatttttt 
ggaaagtata 
gtttttagtt 
gtgatttatt 
tgtgttttta 
agttattatt 
tgttttttta 
gttttttata 
tttattgtat 



txt 
tagatagatt 
gattatagta 
tttttggtta 
ttttagaaaa 
attaaagttt 
tgtatttttg 
aggatatgta 
taaaagttaa 
atatatatat 
ttttggatag 
taattattaa 
tgtttatggt 
ttatattaag 
taaatgtagt 
aagggttaaa 
ttttttaaag 
aatgtttttt 
gttaaattat 
atatatattt 
agaaaggtaa 
tattttgttg 
gttgagagag 
aattaggtag 
agttagtagg 
agtgttggat 
gtgtggtata 
tttttggagg 
aggtttgttg 
tattaagttt 
aagttttttg 
tagaggtatt 
gtattttagg 
tataaatttt 
attattaaaa 
ttttaggttt 
tgtttatatg 
tatgtaatat 
taatggaggt 
taatgttagg 
gagtggtttt 
tattatgtaa 
taggagagtt 
aaagaaagat 
atttatgttt 
aataggggaa 
agaataagaa 
gttagtttaa 
tttaaattta 
ttgtgttatg 
tttttggttt 
ggttttttta 
gttttatttt 
tttaaatatg 
gttatttaga 
tttaattaag 



tttttataat 
ttgtatttgg 

gggtgtatat 

ttgaatataa 
tggtaattat 
tatgtttatt 
gaattaaata 
atatgtatat 
atatatatta 
tttatatttt 
tattttgttt 
ttatttaaat 
tatatatttt 
tgaagttaaa 
aatatattag 
taatttttta 
taaaatatat 
atgtaaatgg 
ttatggagaa 
gttgtttttt 
gattaagaat 
tgagaaggaa 
agagagtgtg 
tagggtattt 
gaatggtagt 
gttagagaaa 
ttggaaagna 
tgaattatag 
ataaaatagg 
ggatatatgt 
gttatgtttt 
gatgtgtttt 
tttatgaata 
gtttaggata 
ttgagttgta 
gatttatttt 
tgaggtgttt 
tttgttgtta 
agtggttggt 
tgagattagt 
atatatgttt 
ttttttatag 
tgttattttt 
tagattagtt 
aggagaagga 
ggttagttgt 
gaagtttagt 
ttttggtgag 
ttgatttttt 
ttttttttat 
ggtgaagaag 
tttagtggtt 
gtgttttttt 
ttaaaatttt 
aggtaatttt 



<210> 364 
<211> 3501 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 364 
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3720 

3780 

3840 

3900 

3960 

4020 

40S0 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7001 



atggttattt 



60 



seq_prot . txt 

ttttgggtag ttgaagtaaa attaagttta atgataagta aatataatta ttattaaaat 120 

tttttattta tgtttgttaa agtaatttaa gtatgatttg agaaggattt tgtattttat 180 

atttgagttt ttgtggatga attgtaattt agtttaatag gtagataaga ttgaaaattt 240 

aatttaggag tatgtgtttt taataatagt tgagttttgg ttaattttag tggttatatt 300 

ttaattattt atatattgtt gagtgtttaa attgtgttta aagaaggtaa aagttaattt 360 

gtaattaatt tagttgtttt tttgttttat "ttaat-ttt tgtatgttat tttttttttt 420 

540 
600 
660 



ttgtttataa atatgttttg attatgaggt atttttggag tttttgaatt tgttgtgatt 
ttggaagttg ttttatttgt aaattattta ttatttaatt aaattgtttt aaatttaatt 
ttgttgaagt ttttttttaa taggtttaga aaaaataatg gtaaaaatga atgaaaattt 
aataattttg gaagtagaaa aggttggggg ttttaataag tgtaaatagt tttattttta 
tatttttttt atggtaatta taatttagta tattatatat atattttttt gttttttgta 720 



ttttggttta gggtaaagtt ttttaaaata ggtattgtta attagtgtta ttaagaaggt 

ttggatgttg ttttgtggga atattttaaa gaggaatgtt taaaaggaaa agggggatgg 840 

gttgggagaa gggtattagg tgggtatttt aaaattattt ttagggttat aggtttaatt 900 

tatttggttg tggatgttag agttgttatg gtaagaagga agtaaagttt tttgtaataa 960 

ttaaagtttt tagaagtagt gtgttttatt gtttattagt gtgttgtgaa gtttggtgtt 1020 

tatttatagg gtttttttta gtattgttta ggttttttga gtgttttagt atagtagttt 1080 

ggagtttgtt ggtttggtga ttaagatata ttttagggaa tatgttatgt agtggagttt 1140 

tttttttggt attgtatagt aaaaggaaag ggttgttggg tgtttgtggg ttttgggtag 1200 

ttatagaagt tattgtgttg gtggggagga gggggattga tgtggtttat gttttgggta 1260 

gttttatttt ttttgtttgt gaagggtttt tgtttggtgg gaggagagag gtgtgtttta 1320 

tttgggtttt tttatatttg ttgttgtttg ggttgatttt gtgggttttg tttggtgttt 1380 

tagttgattt ttgtttagtt ttgggtttat gggtgtggtt agtagggtgg gttagggtgg 1440 

tggggtgtga tattgggagg aagtgtgggt tgtttgtttg ggtgtgttaa ggaagttgtt 1500 

taaaatgagg aagagttgtg ggtttggtgg ttgaggttat tttggtggtg gttggagagt 1560 

gaggaggagt gggtggtttt gtgttgtgtt tgtttttgtt ttatttggtg taggtaggtg 1620 

tggttgtgtt ttttatttgg ttgggatttt ttggtaagga gaggaggtta tggggaatga 1680 

tgtgttgttt ttatgttttt ttttgtttta tttttattgg ttgaggtaaa agtgttgaaa 1740 

ttatgtgaat aaaatatagg tgggttttgt tagttttgtt tttgaattta tttgtgtttg 1800 

ggatttagaa gttgtgttgg gagagagggg tttaggtttg ggtggagggg atggaggtta 1860 

gattgtgtgg aaagtgattt gggtatttta gggtgtttag gtttttaggg agtgtggaaa 1920 

gtgtggttgt ggtttggttt ttgggagatg tgggattggg attaggtata gtgtgaggaa 1980 

gttgattttg gagttagaat attttttttt ggttatttat atgaatttat tggaaaatgt 2040 

tgtagtgttt attaaagtta tttaaagtag aaatgtttag atgttttatg agtttagata 2100 

aattttttat tataaaaaga aatagtagtt gtatttaaat aataattttt ttgaattatt 2160 

attaaaattt agtataatta tttttgtgga tatattttta ttgttaagta atttatttag 2220 

ttaatgaatt tggagagtaa gaaagtttta tgtagtaaaa tgtaaatttg agttaagagt 2280 

taagggtgtt tttttttgtt tgtttgtttg tttttggtaa tgtggtttta aaattttaag 2340 

tttatttaaa aattatataa atgtaattta tttttttgtt ggaatgttta aggattagaa 2400 

agataattgg agaagtgaga gtttgaattt ttttttatgt tggaaatagt gttgtaaaat 2460 

attttttgag ttttgtttga tttagtaaag atttagttga atttaagtag agtttaaagt 2520 

attgtagtgt gtagtaaaaa aaaaaaagag ttgaagatgt tgtgttatat ttgatttttg 2580 

gtattaaaaa taaaaaaagg aattatttaa tttttaagag tttttggaag aaatggaatt 2640 

gattttatta tttgattttt ttgttttagt agggataaat ttattgtttt tatataggta 2700 

gtattttaat taaatgtaat taaatatatt gatgtttttt ttttttttag tgaaagtttg 2760 

agtagtgtat agatagaatt atattttttt aaaaaggttt aaatatatgt aaattattaa 2820 

gtgttttaag tgagaaattt ttgttagttg aaattatttt attaaattat ttttgttttg 2880 

aatggtattt ttgttttaat gtatttagaa aatttttgta ttaaaataga aattgatttg 2940 

tattattttt tgtatttgaa gtgtatttta aagtgtattt gaatgagaga ttataattaa 3000 

attatttgat ttgtgtttta tttttttatt tttttatttt ggatttgaaa aggttggatt 3060 

tgaatattag gaaaaaagag atttttttat tgagggattg tgggaagatt tttttttaag 3120 

tttttgtatt tgaagtgaaa tataaaatag gattttataa tagttttaat aattagaaag 3180 

ttttaagtta ggataaaaat ggggtgtata ttttaggttt ttattttttt tgagagatga 3240 

aattggtaga aatgttattt tattgttaga ttttatttta gaaatgaatt attttgattt 3300 

ttgttaggtt tttatttaat attttaaatg taggtttaat ttttttgttt ttttttaatg 3360 

gaagattttt tagtagaatt ttatttattt agtatattgt ttgtttatgg ttataattta 3420 

atatgtttgt gttaagatgt attttggaat ttaagttttt ttagaatttg gaattaaaat 3480 

ttagaattta aattttagaa t « 3501 

<210> 365 i • 

<211> 3501 
<2l2> DNA 

<213> 'Artificial sequence 

*?220> .* i i. . : 1 : -v tr ; ■ * 

<?23> chemically treated, genomic DNA (Homo sapiens) 

<400> .365' :. - ;•* : . .i • . » • -i . : > ' ; * . ? • ■ i< * • * ' 
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gttttgaaat 
tatgttttgg 
aagttttatt 
tattaaatag 
aaaataatgt 
tatttttatt 
attttatttt 
tttttttttt 
ataagatata 
ttttaaatat 
tgttaaaata 
aagatttttt 
taattttatt 
agttgtattt 
aagaagttag 
tttttttttt 
ttttttatta 
gttaggtaaa 
tttttatttt 
tttatatagt 
agataaaaag 
tttgtttttt 
gtaattatgt 
ttttttttgt 
gtaattttga 
tgttttagtt 
agggttgggt 

tgggttattt 
tttggtgtag 
atttgtattt 
aaagggtatg 
gttgggtagg 
tggggttatt 
ttgtggtttt 
tttttttagt 
gagttgggtg 
gtaggtgtag 
tgtaggtaga 
ttagtgtggt 
tgttatgtag 
gttattaaat 
ttgagaggga 
tgttgttttt 
tttgatgttt 
gtttgatatt 
tttttataaa 
aagttttatt 
gtaattgtta 
ttttttgttt 
gttaaaagaa 
tgtgaatggg 
ttagaatata 
agaaataatt 
tagtagtgta 
aaaggtatat 
tttatttata 
tttaataaat 
ttttgtttta 
tatagatggt 



ttagatttta 
tataagtata 
aagaagtttt 
gggtttaata 
ttttattaat 
ttagtttaaa 
aaatatagag 
tttagtgttt 
aattaagtaa 
aggggataat 
aaagtgttgt 
atttaaaata 
tatgtattgt 
aattaaaatg 
atggtagaat 
atttttaata 
tatgttgtaa 
atttaaaaaa 
tttaattgtt 
ttttaggtgg 
aaatattttt 
aagtttattg 
taaattttag 
agtaaaagat 
taaatattgt 
ttaagattga 
tgtgattgta 
tttgtatggt 
tttttggatt 
tatttatatg 
aaagtagtgt 
ggatgtggtt 
tgtttttttt 
tttttatttt 
gttgtgtttt 
ggaattggtt 
aggagtttgg 
gaaggtgggg 
ggtttttgtg 
tgttggggaa 
taataagttt 
ttttgtaggt 
gaaggtttta 
ataattaaat 
ttttttttaa 
atggtattta 
ttaagttaaa 
tggagaaaat 
ttagggttat 
aattttagta 
gtagttttta 
tttatagata 
ggattggtta 
tgaatggttg 
atttttaaat 
aggatttaaa 
ataaatggaa 
attatttgaa 
ttttagagtt 



aattttaatt 
ttggattata 
ttattaaaaa 
aagattaggg 
tttatttttt 
attttttagt 
gtttaaaaaa 
aaatttaatt 
tttgattata 
ataaattaat 
ttagaataaa 
tttaataatt 
ttggattttt 
ttgtttgtgt 
taattttatt 
ttaagaatta 
tgttttgaat 
tattttgtaa 
tttttagttt 
gtttaaggtt 
aatttttaat 
gttaagtaaa 
tagtggttta 
ttgtttaagt 
ggtatttttt 
tttttttgtg 
ttttttgtgt 
ttgatttttg 
ttgagtgtgg 
gttttagtat 
gttgtttttt 
atatttattt 
tgttttttag 
gggtaatttt 
gttgttttgg 
gaagtgttgg 
gtggggtgtg 
ttgtttggga 
attgtttagg 
gagattttat 
taagttattg 
gaatattaga 
attattataa 
aaattaaatt 
tttatttttt 
gatttttttg 
atgtgagaag 
atagggatgg 
tggattttta 
gaattaaatt 
gaattatagt 
aaaaaaggga 
taggttggtt 
aagtatggtt 
taggttttta 
tatagaatat 
gattttgata 
aaaatagtta 

g 



ttaaattttg 
attatgggta 
aaagtagaga 
tgatttattt 
aggaaagata 
tattaagatt 
aaattttttt 
tttttaaatt 
attttttatt 
ttttgtttta 
aatgatttga 
tatatgtatt 
attggggaga 
gaaggtaatg 
ttttttagga 
gatatggtat 
tttatttgag 
tattgttttt 
ttgaatattt 
ttggagttat 
ttaaatttat 
ttatttgata 
gaagggttgt 
ttataagatg 
agtggatttg 
ttgtgtttga 
tttttggggg 
ttttttttgt 
gtaggtttag 
ttttattttg 
gtaatttttt 
gtgttaggtg 
ttgttgttgg 
tttaatgtgt 
tttgttttgt 
gtgaggtttg 
tttttttttt 
tatggattgt 
atttatagat 
tgtatggtat 
tgttggagta 
ttttatggtg 
aaggttttgt 
tatagtttta 
tttttttttt 
ataatattgg 
taaaaaaata 
gattatttat 
tttatttttg 
taaagtagtt 
ggatttagag 
agtgatatat 
tttgtttttt 
attgggattg 
attttgtttg 
agagtttttt 
atggttatat 
ttgtttttga 



aaaaagtttg 
aataatgtgt 
gattagattt 
ttaaagtgaa 
aagatttaaa 
attataaaat 
ataatttttt 
taaggtagaa 
taaatatatt 
atgtaagagt 
taaaataatt 
taagtttttt 
aggaggatat 
aatttgtttt 
atttttaggg 
aatattttta 
tttgattgaa 
aatgtaaaaa 
taataaaaga 
attgttaaaa 
attttattat 
atgaaaatgt 
tgtttgaatg 
tttggatatt 
tgtaaatggt 
ttttaatttt 
tttgggtgtt 
ttaggtttga 
gagtgaagtt 
gttgatgaag 
ttttttatta 
aggtgagggt 
ggtggtttta 
ttgggtaggt 
tgattgtgtt 
tggaattggt 
tttgttggat 
attggttttt 
atttagtggt 
attttttgga 
tttgggaggt 
tattagtggg 
ttttttttta 
agaatggttt 
ggatattttt 
ttggtagtgt 
tatatataat 
atttattgga 
ttattatttt 
taattgagta 
attttaggga 
agaaattgga 
ttgaatatag 
gttaagattt 
tttattaagt 
ttagattata 
ttgtttatta 
gatggagttt 



aattttaaaa 
taaataagtg 
gtatttagaa 
atttagtaat 
gtatatattt 
tttattttat 
agtagaaaaa 
gagtaagagg 
ttaaagtgta 
tttttaaata 
ttaattgata 
tgagaaaata 
taatgtattt 
tattaagata 
attaaatggt 
attttttttt 
tttttgttaa 
aagatttaaa 
atggattgta 
atagataaat 
ataaaatttt 
atttataaga 
taattgttgt 
tttattttga 
taaaggaaaa 
gtgttttttg 
ttggggtgtt 
gttttttttt 
ggtggaattt 
gtagaataag 
gaaagttttg 
gggtgtagtg 
gttgttgggt 
ggtttgtatt 
tatgagtttg 
ttaggtggtg 
aagggttttt 
tttttttttg 
tttttttttt 
gtgtattttg 
ttgggtagtg 
taatggggta 
ttatgatggt 
tgagatattt 
ttttaaaatg 
ttgttttaaa 
gtgttggatt 
gtttttagtt 
ttttaaattt 
atgaatgatt 
tgttttatgg 
ggtaaggtag 
tttgaatatt 
agttattgtt 
taggttatag 
tttaggttgt 
ttggatttaa 
gtagaatgtt 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3501 



.. * t 



1 1 



» t 



seq_prot.txt 

<400> 366 

aggtaataat aattatgatg tattaggttt tgggttaaag ttattatttg tattaggtta 60 

ttaaattttt aggttaattt attatagtta tgagttattt aataatagat atatttaata 120 

aattttttgt taggtaattt tattattgtg ggaatatttt agagtggatt tatataaatt 180 

tagatggttt agtttattat atatttaagt tatatggttt agtttattgt ttttaggtta 240 

taaattttta tagtatatga ttgtattaaa tattgaaggt agttgtaata tgagggtaag 300 

tattagtgtt tttaaagata gaagatggtt aggtgtggtg gtttatgttt gtaattttag 360 

tattttggga ggtttaggtg ggtggattat ttgaggttag gagtttaaga ttagtttggt 420 

gaatatggtg aaattttatt tgtattaaaa atataaaaat tagttagtgt tgtggtattt 480 

gtttgtagtt ttagttattt aggaggttga ggtagaagaa ttatttgaat ttgggaggta 540 

gaggttggag tgagtttaga ttaggttatt gtattttagt ttgggtgata gagggagatt 600 

ttattttaaa ataaataaat aaataaataa ataaaatata gaagatgtat agtaaaaata 660 

tggtaattgt ttttgtttgt ttgttttgag atagggtttt gttttgttat gtggattgga 720 

gtgtagtggt attattaggt ttattgtagt tttgattttt ttggtttaag tgtttttttt 780 

attttagttt tttgagtatt tgggattata ggtttatgtt attatgtttg gttaatttgt 840 

tttgaatttt agtagagatg gggttttatt gtgttgttta ggtttgtttt agttttttgg 900 

ttttaagtaa tttttttata ttagtttttt aaagtgttaa gattatagat gttagttatt 960 

gtatttagtt agtaatataa ttttatggga ttatttttat atttgttgtt ttttgttgat 1020 

ttatatattt ttatgtaatg tatgattgtt attattatta ttatttttat tttttagatg 1080 

gggaaattga ggtataaaga atttaatttg tataagttta tttgtttagt gatggaataa 1140 

agatgtgaat ttaggtagtt tggttttaaa gtttatatgt ttaataatta tattagatta 1200 

ttagattgtt tttttttttt tttttttttt ttttttgaga tggagtttta ttttgttatt 1260 

taggttggag tatagtggtg agattttggt ttattgtaat ttttgttttt tgggtttaag 1320 

taattttttt gttttagttt ttttagtagt tgtgattata ggtgtttgtt attatattta 1380 

gttaattttt gtatttttag tagagatggg gttttattat gttggttagg ttggttttaa 1440 

atttttgatt tttggtgatt tttttatttt ggttttttaa agtgttggga ttataggtgt 1500 

gagttattat gtttagttta gattgtttta tttttgtatt tgtatttatt tatttattta 1560 

ttttgagata gggttttgtt ttgtagttta ggttgaagtg tagtggtgta atttagttta 1620 

ttatagtttt tatttattgg ggtttaaagg atttttttgt tttagttttt ggagtagttg 1680 

gggttatagg tatgtattat tatgtttagt taatttttaa atattttttg gtagaagtag 1740 

ggttttatta tgttgtttag attggtttta aatttttagt tttaagggat ttttttgttt 1800 

tggtttttta aagtgttgag attataggta tgagttatgt atttagtttt tttttaaaat 1860 

ttttttgaga gataagattt tgatttgttg tttaggttgg agtgtagtgg tgagattata 1920 

gtttattgta gttttaattt ttgggtttaa gtattagatt ttttttatta tattttattt 1980 

tatatgtgtg tggttttaat tttgtttttg ttatttttta gttgtatgtt ttaatttaat 2040 

ttgtttggtt ttgttttttt taatagaagg atggttttgg ttatgggtta tagttagtaa 2100 

tgtttaagta ttagggttgg tgagtgtttt gttgtggtat ggttttagtg ttgtgttttt 2160 

gaatttattt gtttttttta atgagagaag gttttagatg agggttgaat tttttttgtt 2220 

ttgtttatgg tttttgaatg ttgggggagg agtgtatggg gaggggtggt ttttaaatgg 2280 

gttattgtta ttaatagaga ttttaaatat tgtttgttaa aaatatttga ttggaggagt 2340 

ataaaagtgt agttgagttt agtgttttgt atttttttga gtagatgttt agagtagagt 2400 

tagttagtat gattgagtgt tgtgtttttt ttttgttttt gtggggtttt agttgggatt 2460 

ttttttgtga ttggtatttg tatagttgtt tttttgatta ggtttttggg ttgttttggt 2520 

tgttggagga gtggttgtag tggttaggtg gtagtagttg gttaggttat gtgtgttttt 2580 

tgttttttgt tgttattgag agttttgtag tggttgtgtt tgtttatagt tgtgtgttta 2640 

gttggtaatt tagtagtggg gttttggaga tttggtatat tgtggattgt tggtgtgtgt 2700 

ttttggatgt taattatttt gttttggatg agttgatggt taagattaag gatggtgtgg 2760 

tggagattat tggtgagttt ttttgttttt gtaggggaga ggaggaggtt agtagggtgg 2820 

gtagggttgg gggtgtgtgg ttgaaatggg ggttttgggg gtttggggag ttaaatgttg 2880 

gtttagtatt gggaaaaata ggatttttga tttttttgtt taggaattgg gagtgtgggt 2940 

tgtttttaag ggtgtttttt gttttgtaat tttagtgttt tgggaggttg agatgggagg 3000 

attgtttgag gttaggagtt taagattagt ttgggtaata tagtgagatg tgtttttttg 3060 

ttttgatttt gtgttattat aaaaaaaaag taaataaaaa tttttttaaa gattattgat 3120 

gaagagagaa aatgtgtttt tttatagagt ttttttttta tttatagttt tatttttaga 3180 

taagtgggga gttttttggt gtggtgttag tttttagttg ttgagtgggt gtgtgtgtgg 3240 

ttttaagtgt gtttgtgtat tgtttatttt ttagttttgt gttttgtttt gtttttttta 3300 

aaattttgaa ttgaagaatt ttttggaagt ttttgagagt ttagattggt gggtatgttt 3360 

ttatttttaa ttttttttgt taatttttat tagtttgtag ttttggttgt ttttaagtag 3420 

gaggtggggt ttttggttta gtggggttga aaggtagttt ttttttttgt agtttgattt 3480 

tttttttttt tttaaaggta agtatgagga gtggtaggat gagtatggtt atattttttg 3540 

gtgttttatg tggaaatata tgtgagtttt ggtgttaggt tggggtgggt gggtggtgtg 3600 

ggggtggggt tagggaagag ggtataggga tttatttggt gtgtaatgta atgtttgttt 3660 

tttttttttg tatgtttagg ttgttttttg gtgtggattt tatttaagtt tttttttttt 3720 

tgttttttga gggtatattg attgtggagg tttttatgtt taagttagtt atgtagttta 3780 

atgagattat tattttagtt atttttgagt tgtgggttta gtttgggggt ttagaagttg 3840 

ta[aaatttga tgagattgtt gttaagtaaa gttttagttt ggatgtttat ttttgttgtt 3900 
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gttattggtt 
tttttaaata 
ggaagtttta 
ttgggttatg 
gatagaggtt 
ttgaagtagg 



gtgttttttt 
aagtttaaag 
ggtatttgtt 
tgggtatatt 
ttagggttgg 
aggagt 



tgttatttgt 
taattatttg 
atttgttggt 
ggtgtaggtt 
gtgtagtggt 



seq_prot . 
gtgttttttt 
ttattggttt 
ttttaggagt 
gttggatata 
ttatatttat 



txt 

gatatattta 
aggttttggt 
tatttttgtt 
ggttgattta 
aattttagta 



ttttttgttt 
gtttgtggaa 
taggtttgtg 
tatttataaa 
ttttgggggg 



<210> 367 
<211> 4216 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 



<400> 367 

attttttttg 
ttttaagatt 
tatttatatg 
ggtgtttgag 
gaattttatt 
aggtatagtt 
gttttattgg 
gggatggtga 
gtgtttttag 
gatgtgtaga 
tttttgattt 
tttttgtgtg 
"tgggggga 
ttagaggttt 

agggggttgg 

ttttgattta 
gtaggtgtat 
gggaattttt 
tgtatttttt 
tggtgtgggg 
tttggtttta 
agtgttttta 
ttttttggtg 
atgtttttgg 
ttattggtga 
tggttgatat 
ttgttgagtt 
atggtggtgg 
gattattttt 
tattagttgt 
ttggttatgt 
ttggttgtgt 
tattaatggt 
taggggttgt 
gaaagtaaat 
ttttggtgtt 
ggatagagtt 
gaattatatg 
tgaggttgta 
tttgtttttt 
gtattttggg 
gtaatatagt 
tgtatgtttg 
tgagtagagg 
aaa f ttttgtt 
ttgggtgtgg 
attagaggtt 



tgaggtagga 
attgtatttt 



ttttaatttt 
tttgttttta 
gtttagtatg 
gttttttttt 
tgagaaaaat 
agtggtggta 
attttgtagt 
ttttgttgga 
gggataggga 
gaggaaaggt 
tatttttatg 
aagtattggg 
agagggaaat 
tattttttgt 
ggatgggggt 
gagttttggg 
ttggagttgt 
tgtttatttg 
ttttttattg 
"ggggtggg 
agtgattttt 
gaagtgattt 
ttgggttaat 
ttttgtttgt 
tttttattat 
ttagggatat 
gtnggttgag 
ggggtagggg 
ttggtagttg 
ggaaggggtt 

tggttgattt 

ttttatgttt 
aatgatttgt 
gggtggggtg 
gaatttgaga 
taagtgttgt 
agataggttg 
tgtgtgagat 
gtgagttatg 
aaaaaaattt 
aggttaaggt 
gagattttat 
tggttttagt 
ttgtggtgag 
ttaaaataag 
tggtttatgt 
aggagtttga 
attagttggg 
gaaittgtttg 
agtttgggtg 
atttagtaat 



ttaaagtgtt 
tggatgtgag 
ggtttgagta 
ataaatatta 



agaagataaa 
gtaggggtgg 
ttttgggttt 
ttgtgtggtt 
ggaggaaatt 
aagtgttata 
ttatttattt 
agatgtagtt 
tagattgtgg 
ttggaagtag 
gtgtttgttg 
aggaatggag 
gtatatgttt 
gggatggggt 
atgattttta 

ggggtgtgtt 

ttgttttggt 
gtatttttaa 
gtttaatttt 
tttgttagtt 
gttatttttg 
gtgttagtgg 
tgtgtggttg 
gtgtatgtag 
gggtagtttg 
ttagttgggg 
tgttttggat 
ttttagttgg 
ttgagggttg 
aagagggttt 
gtgtgatgtt 
tggttgtggt 
ggttggggta 
agaatgtgat 
attttattat 
taaaaagggg 
agaagggttt 
ttttattaaa 
tattttagag 
ttggattgta 
taaatgaata 
ttgtaatttt 
gattagtttg 
tgtggtggtg 
aatttagaag 
atagagtgag 

ttggtgtggjL 



gggattgtag gtatgagtta 
ttagtttgtg tttagtaatt 
aagatgattt ttgaaagtta 
gggtttgggt tagtgatagg 
tgtattaaaa gaatatatag 
gtatttgggt taaggtttta 
ttaagttggg tttgtgattt 
agtttgggta tgggggtttt 
tgggtggggt ttatattggg 
ttatatattg ggtgggtttt 
attttgattt ggtgttagga 
atgtttgttt tgttgttttt 
gggaggggat tgttttttgg 
ttaggattgt aggttggtag 
gtttgggttt ttagaaattt 
tagggtgtgg agttggggag 
atttagtggt tagaaattgg 
tgtgggtggg aaggggattt 
aaaaaatttt tgtttgtttt 
trtgttatgtt gtttaggtta 
tttttaaagt gttgggatta 
tttttgagta agggaattag 
ttaggttttt gggatttttg 
tttttttttt ttttgtagga 
gttttgattg ttagtttgtt 
tttgtagtgt gttggatttt 
taggtgggtg tggttattgt 
tttggttagt tgttgttgtt 
aaggtttggt tgaagaggtg 
ttttgtagga gtgagaaggg 
gtttgtttag aaaagtgtgg 
gtatttttag taggtggtgt 
ttttttttta tgtatttttt 
agtttttatt tggaattttt 
ggagttgtgt tatggtaggg 
ttgtggttag ggttgttttt 
tataattgag aagtggtaga 
aaaaggagtt tggtgtttga 
tgtattttag tttaggtaat 
ttgggtgtat ggtttatgtt 
tttgaggtta ggagtttgag 
aaatatttaa aaattagttg 
gttgaagtag gaggattttt 
ttattgtatt ttagtttggg 
aatataaata taaaaataaa 
agtattttgg gagattgagg 
gttaatatgg tgaaatttta 
ggtgtttgta attgtagttg 
gtagaggttg tagtgagttg 
attttatttt aaaaaaaaaa 
tgt t aggtgi: _gtg gat-ttt g 
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ttgtgtttgg 
tatattagtg 
gtaaatggta 
tggttgtttt 
gtggtggggg 
tttggtggta 
gaaggtgatt 
tatggttagt 

gggtagtttg 

tgtgtttttt 
tttatgtgta 
tgtgtttgtt 
ttttgttagg 
ggattaatag 
ttggaaagtt 
tgagtagtat 
tattgtgtta 
tgtagaaaag 
ttttttgtaa 
gttttgaatt 
tagagtagaa 
gagttttgtt 
ttttaattgt 
gtaggggggt 
tggggtgaag 
tgagattntg 
ggggtttttg 
taattattgt 
gttatgtggg 
gatgtggtgt 
ggtgttgggt 
ttgaggtttt 
ttttagtgtt 
ttttgttaag 
tatttgttgg 
ttgttaagga 
ggtaggattg 
gtttaggagt 
agattaaagt 
tgtaatttta 
attagtttgg 
ggtatggtgg 
tgaattttgg 
ttatagagta 
gtagtttggg 
tgggaggatt 
tttttattaa 
ttggggaggt 
agattttatt 
aaagaaaaaa 
aagttagatt — 



3960 
4020 
4080 
4140 
4200 
4216 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060- 



3900 
3960 



seq_prot.txt 

gtttgaattt atatttttgt tttattatta agtagatgaa tttgtgtaag ttagattttt 3120 

tgtgttttag tttttttatt tggaaaatag agatgataat gatggtaata gttatgtatt il80 

gtataaagat gtgtaagtta gtaagggata gtaaatatga aggtggtttt ataaaattat 3240 

attattggtt gggtgtagtg gttgatattt gtaattttag tattttggga ggttgatgtg 3300 

ggaggattat ttgaagttag aaggttggaa taagtttggg taatatagtg agattttatt 3360 

tttattaaaa tttagaataa attagttagg tatggtggtg tggatttgta gttttagata 3420 

tttaggaggt tgaggtggga ggagtatttg agttaaggag gttgaggttg tagtgagttt 3480 

qatggtgtta ttgtatttta gtttgtatga tagaataaga ttttgtttta aaataaataa 3540 

atalialtaa ttattgtgtt tttattgtat attttttatg ttttatttat ttatttattt 3600 

atttattttg agatggagtt tttttttgtt gtttaggttg gagtgtagtg gtttgattta 3660 

ggtttatttt aatttttatt ttttaggttt aagtgatttt tttgttttag ttttttgagt 3720 

agttgggatt ataggtaagt gttgtgatat tggttaattt ttgtattttt agtatagatg 3780 

gggttttatt atgtttgtta ggttggtttt gaatttttga ttttaagtaa tttatttgtt 3840 
tgggtttttt aaagtgttag gattataggt gtgagttatt gtgtttggtt attttttatt 
tttagaaata ttaatattta tttttatgtt gtagttgttt ttaatatttg gtgtagttat 

atgttgtaga ggtttgtagt ttaggagtag taggttagat tatatggttt aggtgtgtag 4020 

taggttagat tatttaggtt tgtgtaagtt tattttgaga tgtttttata atgatgaaat 4080 

tgtttaataa ggaatttgtt gaatgtgttt gttattaagt gatttatgat tgtaatgggt 4140 

tggtttgagg atttggtgat ttaatgtagg tgatgatttt ggtttggagt ttggtatatt 4200 

ataattattg ttattt 4 ^ lb 

<210> 368 
<211> 11001 
<212> DNA 

<213> Artificial Sequence 

< 220 > . ^ 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 368 

ttaagttaga tgttttttaa ttatttgtgg ataggttagg tatattttga gtttaatttt 60 

attttatagg tttaatattt tggagttaga aagtttttag gtaaaaagtt tgaagggggt 120 

ttttttatgt tattagatgg atttttgtat ttttagaaga ttttttatat taggaaagat 180 

taaagtatta aggtaatttt ttttggtttt ttgggataat tttaggtttt ggtatgagtg 240 

gtttggaagt ttttgtttta gttataatgt ttatatattt ttggaattgt tttgtagggt 300 
ttgtttttta gtataatttt ttttttaagt tttattgtag ttatagttta ttagttttgt 
ttagtgataa ttaagaaatt aagaattatg tatttatgtt tattttttta gagttatttt 
tttttaggat aaagtttagg gttttgttat tgggttttgt taggagttgt agttgtaggg 

gttgtttatt atttaatagt ttttgtaagt atattgtgaa ggggaagtaa tgattagaga 540 

tagggttagt tgtttagttt ttgtatgttt aggtgtatgt gtatatattt ttatataggg 600 

tagggtgggg tgggaagttt attttggttg tgatgtttag tgtgtttaaa gagtgtaaat 660 

ttgtgggggt tatttattat taagaatttt tgtagtaggg ttttaatgat tttatgtgtt 720 

tttgtgtgtg atagtatatg taggtgtttg atagtatttt tgggtaggta aaaggaagtg 780 

tqgtgtttga tatttgtgtt ttgttttggg ttttgttggg tattttgtaa ggagggtggt 840 
tttttgttgg agggtataga gatagggtgt attagtttta gttgaatttt gatgaagttt 
gtgtaagaat tgtttttgtt ttaaagaaat agagaaatta aattttgata ataggtttta 

ggtgagatgt tagtttattt ggggttaggt tgggtatgta taaattattg tttgtgttta 1020 

tttaagataa ttttagttgt gattttttga gtattaggta tatagttggg tatttgtttt 1080 

tttttatgtt ttttttttga gtagttaatt tattaagttt atgaagaggt tgttgttgat 1140 

ttgggtattg tattttttga ttttttgttt aattttagtt tgagaaaggt taggtgtttt 1200 

ttattttata ggtttgtttt gtaagatggg ttagtatgga tatagggttt ttgaggaatt 1260 

tagggttttt ttgaaaaatg gtttttgggg tagtttttgg aaattgattg i«ttggttt 1320 

tttgtttttg atgtatatat atatagttgg tgtttatttt gaatttatta ttgtttttgg 1380 

ttttgtatgt tttgggtgga taagggaaag atagaattat ttggtttttt tttgttgttt 1440 

gtttagggtt ttagtattga atgtagtttt aaggatatta tagaagtagg ggtaattgaa 1500 

ggtatatggt taggggttag gaatagttga gggattttga agagggattt ttatttaaag i5bo 

taaaattagg ttgggtgtgg tggtttatat ttgtaatttt agtattttgg gaggttaagg 1620 

taggaggatt atttgatttt taggagtttg agattagttt gggtaatata gtaagatttt Xb»o 

atttttatta aaaaaagaaa aaaaaaaatt agttaggtgt ggtggtgtgt ttgtagtttt J./4U 

aattgtttag gaggttgagg tgggaggatt gtttgagttt gggagattgt agttatagta i»oo 

agttattatt gtgttattgt attttagttt ggggaattga gtgagatttt gttttaaaat I860 

ataaaaaata aaaataggtt gggtatgttg gtttatgttt gtaattttag tattttggga i|ZO 

ggttgaggtg ggtggattat ttgaggttag gagtttgaga ttagtttgat taatatggag 1980 

iaattttgtt tttattaaaa atataaaatt agttaggtgt ggtggtatat gtttgtaatt 2040 

ttagttattt aggaggttga ggtaggagaa ttgtttaaat ttgggaggtg gaggttgtag ziuu 

tgaattgaga ttgtgttatt gtattttagt ttgggtaata agagtgaaat tt: 99ttttaa ziou 

aaaaaaaaaa aattagtaaa attatatttt aattgtatat tttgattata gtattttagt 2220 
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360 
420 
480 



900 
960 



tgagttggag 
tggggttatg 
ttattagtgg 
aattattgtg 
ggttttaatt 
ttgtttggag 
tttaggttgt 
agtgattttt 
atttggttaa 
tttgaatttt 
ggtgtgaatt 
tttagggatt 
attgattttt 
gttagatagg 
aatattttgt 
taggagagtt 
tgatggggat 
tttttattta 
gttagggttt 
ttttaagtat 
aattttattg 
tggaaaatga 
tgggaagaag 
agtgttaggg 
gttaatttgt 
tttttttttt 
tttttttttt 
tattgtatag 
tgtttgttgg 
tttttttttt 
gttttttttt 
ttggtttatt 
tagttgggat 
atggggtttt 
tttggttttt 
tgtttttttt 
tttttttttt 
tttagttttg 
ttttttagta 
ttagtataga 
gatttgtttg 
tttttttttt 
atgattttgg 
ttgggattat 
ggttttatta 
tggtttttta 
ttttgagata 
attgtaattt 
ggattatagg 
tttttttatt 

ggggtttata 

agtattattt 
tatttttttt 
agtttttgtt 
atggtatttg 
atagttttgg 
gttttttttt 

. ttttttattt 
ttatatttta 

. tatataggag 
tattttttaa 



gagggagtag 
tttagttata 
ttgtgtttaa 
tggaattggg 
•ttatgtagtt 
.ggtggagttt 
ttttggtgtg 



tgagggtttg 
atttattgta 
atagagtagt 
atatgatgtt 
tgttagaggt 
atttggttta 
agtgtagtgg 
ttgttttaga 
tttttgtatt 
tgattttaag 
attgtatttg 
ggatttttgt 
tttttttaga 
ttttagttag 
tttagaagaa 

agggtttttt 

aaatggagag 
gaatttgttg 
tataggaaaa 
ttaaaggttt 
gggatagtgg 
gggttggagg 
tatttagatg 
taagaaggaa 
ttttgtgatg 
tggatgattt 
tggtgatgaa 
tttttttagg 
aagaggattt 
gtattatagg 
tgagatagag 
gtaatttttg 
tataggtgtt 
attattttgg 
taaagtgttg 
ttttttttta 
ttttttttga 
atttattgta 
gttgggatta 
tggggttttg 
ttttggtttt 
tttttttgag 
tttattgtaa 
agttatatat 
tgttggttag 
aagtattggt 
gagttttatt 
ttgtttttta 
tgtgtatttt 
tttttatagt 
tttaatttaa 
attttgtaag 
atttaaataa 
tttttttttt 
gtatatagtt 
gttagtaatt 
tttttgtggg 
atttttaaat 
gtagttggtg 
tgtagatgga 
tttttggtga 
gagtatttga 
gttggataat 
tttgtttgtg 
gttttatgta 

aggagggtgg 
tagtgattgt 

gtttgtgagt 



ttttggagaa 
gggtgagagg 
atttggagtt 
taattttgat 
atataggtag 
aagatttttt 
ttggattttg 
tttttgagta 
tttagtagag 
tgatttgttt 
atttagagat 
tggaagggtg 
tttttttggt 
aatgtgagtg 
tttttataag 
ttgattttgt 
gatagatgag 
ttattttgta 
aggtaaaggg 
tttggatgga 
gtaattttgt 
ttgtgaataa 
ggaggagtat 
gattattttg 
gttttgagtt 
aggagttttg 
tttttttatt 
attttaaagg 
taatagttat 
gatttatttt 
ttttgttttg 
ttttttgggt 
tgttattata 
ttaggttggt 
ggattaaagg 
atatataatt 
gatagagttt 
atttttgttt 
taggtatata 
ttatgttggt 
ttaaaatgtt 
atagagtttt 
tttttgtttt 
tattatgttt 
gttgattttg 
attagaggtg 
ttgttattta 
ggtttaagtg 
tgtggttagt 
ggaaaatggt 
tttttgtatt 
gaattttgta 
ggatttttgt 
ggtgtttagg 
ttttgataat 
tggatttttt 
taaagtgttt 
ttttttttag 
ataggttaga 
ggagtaggat 
ggattttttt 
atgtggaaat 
gtttggttta 
atagaaatta 
tggttttgtg 
ggttgtttat 
tttttttttt 
tagtgaggtt 



seq_prot.txt 
ggtagtttat tttttttttt 
agtggagagt ggtgtatatt 
agttttttat gttttatata 
ttaagttgta taaaaggtag 
ttttttggtg ggtttttgta 
ttttttttga gatgaagttt 
gtttattgta agttttgttt 
gttgggatta taggtgtgtg 
atgggatttt attatattgg 
gtttttattt tttaaagtgt 
ttttaatttg attatttatt 
gagtgtggga tagggtagtt 
tttattgtat tagttttatt 
ttatagatat agttaagttt 
gttttttgta gaatgatttt 
tttggagttt ttttaagtat 
ggagtagggt ggagtgtttt 
gttagtaagg atgtggggtt 
ggaggggtgg gaatttaagt 
gaagagtatt ggagtaaaaa 
tgggttagta aaaataaatg 
agtagtggat gtgtttgttt 
taggggtagg agaaatgtta 
tttgtagaat agggagggta 
tttatttaat aatgagaaag 
ggttgggatg tagtgatttt 
tttatttaga aaatagattt 
aggaagagtt tttttttttg 
tttagttgga tgtatagtag 
ttattttatt attgtttata 
ttttttaggt tggagtgtag 
ttaagtaatt ttttgtttta 
attggttaat tttttgtatt 
tttgaatttt tgattttatg 
tgtgagttat tgtatttggt 
tgggattttt tagtttttta 
tgttttttta tttaggttgg 
tttgggttta agtgatattt 
ttattatggt tagataattt 
ttggtaggtt ttgaattttt 
gagattatag gtatgagtta 
attttgttat ttaggttgga 
ttggttgaag tgatttagtt 
ggttaatttt tgtatgttta 
aatttttgat tgtaaatgat 
tgagttattg tatttggttt 
ggttggagtg tagtggtatg 
atttttttgt tttatttttt 
tttttttttt aattggttag 
ttaggattga ttgatatgaa 
taagttttgg gttaagattt 
aaattttatt tgaagtatta 
tttatttttg ttaggtatat 
tttggttttt tgagtttggt 
ggggatattt aggaggtttt 
tttttatttt tttaggtatt 
ttgaatgttt atggtttaaa 
atttatttta gaggaaggga 
atagggaaga ggtgagggtt 
ttttttttgt tttttaagtt 
ttttttatat ttttttttag 
tgaggtgtta gtttaaattg 
ggtttattat aagttatata 
agggggtttt ggtatttagt 

ggtaggtttt tggttaggat 
tgatttagga tgtggtaatg 
gtttgttggt attttttggt 
tgtgtggtga agtattgttt-. 
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tgttttggta 
agtagtttag 
tagtgagaaa 
ttttaggtta 
tttgtttgtg 
tattttgttg 
tttgggttta 
ttataataat 
ttaggttggt 
tgggattata 
ttttatttta 
agggttttga 
ttttgttgat 
agtgttgatt 
gtgtttagtt 
ttaaattatt 
agtagaatgt 
aagaattaag 
ttattttttt 
ttttagtata 
gtgtgggttt 
agtatattaa 
gatagatttt 
tagggatggt 
tttgtttttt 
atttttagtt 
ggattagagg 
tttttaaagt 
gattatggaa 
aaaattgatg 
tggtgtgatt 
gttttttaag 
tttagttgag 
atttatttat 
ttaaaattga 
tttttttttt 
aatgtagtgg 
ttgttttagt 
ttttgtattt 
ggttttaagt 
ttaagtttag 
gtgtagtggt 
ttttaagtag 
gtagagatag 
ttatttgttt 
ttttttttat 
attttggttt 
taagtagttg 
tgttttgtgg 
gataagttta 
tggtgtgttg 
tttataattt 
tgagttttat 
ggttatatta 
gagatatttt 
ttataattta 
ataagatttt 
atagaatttt 
tagttggttt 
tttttaaata 
tttttttaag 
tttggttggt 
gttgtttttt 
atttaggtgg 
ttgtggggag 
gattggggag 
ttttatttgg 
gagttttgag 



2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 
6060 
6120 
6180 
6240 
6300 



seq_prot.txt 

tttgagtttt tttggttgta gtagttattg tttgttgtgt tgttttaggt tattttgaaa 6360 

gaaggtgttt ttgttttgtt tatagttgta tttgtttgtt ttttagtttt gtgtgtttgt 6420 

agttgttaat tattgttttg gttgtgtgtg tgtgtgtatg tgtgttagtg tgtgtgtgtg 6480 

tttgggttag agttgtgttg taattgttaa gattgaaatg tagattgttg ggatttagtt 6540 

tttgttttat tggggtagga atgttggggt ggggatatgt atgttttgtt tttaggaatg 6600 

attttattgt tttggagttt tatttataga ttttatttat tatagggaat gggggtgggt 6660 

gttagtgttt gggtaagtgt ataagagtgg tttttggttg gaggtgaggg tgggaaggtg 6720 

tgggaagtgt gtgtgtgtgg agtttgggtt agtttgggtt tgggtttgtt tgtagtgggt 6780 

ggagtatttg tggagttggt aatttaggtt ttttttttag tttttgtgta gaattagttt 6840 

ttttgtgttg ttgggaaatt ggtaattaga atgttttttg tgtgtggtat ttaggtagtt 6900 

tttgagaatg tttgtattgt ggtttgttta tttttgtttt ttttatatgt ttttggtttt 6960 

gtgtttatta tgtttgtgtt ttgttttatt gtgggttttt agtttaggtt ttggggtttg 7020 

taatagttta ggtagatgag tgtgtggtag tggtagtggt aggtgaattt gtaatttgta 7080 

gagaggtttg gtggtgaggt ggaggagttt taggttgggg aaatgttttg gagattgaag 7140 

ggaagtttta gggagagggt tgttgtttgt taggttttgt aggtttgatt tattttagtg 7200 

ggttatttta tattgttatg tggattttaa tgttggttat ttgggtgttt ggaaattggt 7260 

tggaaggtta taggtagaga ggtttgttta atagttggat ttttattgtt tagtatagaa 7320 

tttttttttt ttttattggt aattaaaaaa ataataataa aaaattgtgt tttgtttttg 7380 

ttatttaggt tggagtgtaa tggtgtgatt ttggtttatt gtaatttttg ttttttgggt 7440 

ttaagtgatt tttttgtttt agttttttga gtatttagga ttataggtgt ttgttattat 7500 

gtttagttaa tttttgtatt tttagtagag atggggtttt attatgttag ttaggttggt 7560 

tttaaatttt tgattttagg tgatttattt gttttggttt tttaaagtgt tgggattata 7620 

ggtttgagtt attgtatttg gttttttatt gggaatgtat atggaatata tttgtttatt 7680 

tatttgaagg aaaaattaaa tatttttaat ttatgtttgt tttgtggttg ttatttgttt 7740 

ttattttttt tagattaaga tattggtttt tatatatttt aattttttgt ttttattttt 7800 

ttttttattt attttagtta ggtttgtttt tttttttagg aaattgtttg ggatagggtt 7860 

tttagtgatt tgtgtattat taaatggaat ttagtgtttt attttttatt tttatttttt 7920 

8040 
8100 
8160 



tagtattatt tgaagtttgt tttttttgat ttttaggggt tatatttttt tagttttttt 
tttatttttt gtagtttttg tttagttttt ttgtagattt tgatttaatt tttatatttt 
atgatgaagt ttgggtttag ttttgattat tggtttggtt tgtttatatt tatttgtttt 
agatttatgg ttgaaatatt gatttaaatt tttagattag attttttgtg ttagtatttt 
tattaggatg tttaaaagat gttttaagtg aatatggtta aaatttaatt tttttttttt 8220 

8340 
8400 
8460 



tagttttatt gttatatttg tttagttttt tttttgtagt aaaaatggtt attaggtttt 
tagttattgg agataaaagt ttaaatttat ttttgatttt ttttttgttt ttatttttga 
taaatatgtt taaattattt tgttttttat ttttatggtt attttatttt tttttgagaa 
tgttgtaatg ttttagtttt gttttttttt tttttttttt tttttttgag atagagtttt 
attttgttgt taaggttgga gggtagtggt atgattttgg tttattgtaa tttttgtttt 8520 

— ~ 8580 

8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 



ttgtgtttaa gtaatttttt tattttagtt ttttgagtag ttgggattat aggtatttgt 
tattatgttt ggtttatttt tttttatttt ttagtagata tgaggtttta ttatgttggt 
taggttggtt ttgaattttt gattttaggt gatttatttg tttttgtttt ttgaagtgtt 
gggattatag gtatgagtta ttgtgtttgg ttttttgttt attttttgta ttttgttata 
attttgtgtt ttttttagtt gaatttgtga tgtttttttg tattggatga gagggttttt 
atgtatatat agatttggga tattatttat ttataagttt ttaaataggt tagagtagtg 
atgtttaatt tagattttat gttataataa tttggggagt ttttaaaatt tattgatgtt 
tagggtttat ttttagtagt tgatttaata ggtttgtggt gggatttagg ttagtgggga 
ggattgtaaa agtatttttg gtgattttag ttggtgttta tttaggggag agtaattttt 
gtttgttggt gatttttagg ggtgtagaag gattgttggg tgtgtggttg tgtgtatatt 9120 
ttagtatttg atttattggg ttagaaaagg gtgtttgtta aataaagatt taataaaatt 9180 
tttgtttgta gggggtttat taaaggtttt aaattttttt aggttttttt ttataggtgg 9240 
taattttttt ttattttaaa ggttttggag ggggttatga gtgtttgaga agaggtaagt 9300 
ttgggaagat ggattttgag gatagtaggt ataaattttt ttttaagaag ggttaaggta 9360 
ttttaaagat aagaaattta aaattagtgt atttttatat ataagtagtt atttttgttt 9420 
atttgtggtt tagatatgag tggagtgtga taagggataa attatttttg tgtatttttt 9480 
agtgatgggg tgaaagtaat ggatttagtt tttgggagtt gtttttgttg attttttttg 9540 
ttgtgatttg atttgtggtg attgtgttgt tttttggttg ttttttttgt ttttgtaggt 9600 
gtgtggggtt attatttatg tgtgtattgt aggtttttgt gtatgatgtt ttagatgaag 9660 
ttgttataga ggttgtatta tgtgtgtgtg gtgggttttg tgggttggaa gtggtggtta 9720 
tggttaggga ttagttgttg tgtggggttg tatgtggtgt tttgtgtgat gtgtagtgtg 9780 
ttggtatgtt ttagttgggt gtggtttttt ttagtgtgtt tagtgggtgt tagtttttgt 9840 
agtttaatga gtttaggttt ttttgatatg gtttggttgg gtttgtgttt tgttggtttt 9900 
gggtgttagt aagtgtgggt tgggtggggt tatagggtgg gttttgattt tagtgttttt 99bU 

tttaggattt agattgggtg gtgggaagga gttgaggaga gttgtgtaat ggaaatttgg i£0£0 

gtgtagggat tgtggggttt gaaggtgggg ttgggtgtgt ttttgtagag ttttttttgt 10080 

tttgtttttt tttttttttt ttgttttttt tttatatttt attttggatg gttataatga JxinR 

tggtgattgt aaagtattat gtggagatat ttgtgttttt ggaggttagt tttattgtgt 1^200 
tagaggaaga gggtttttat atttggtttt ggtttttttg gtttggtttg ttgaagtaat 

atatttggtt tatttattgg gtggggtagg aagttttgag tttttratttg gggtgaggag i£^u 

gagtjgagatt ggttagtagt tttattgttt gttttgtttt ttattgtgga gattggggtt 10380 

seite 327 



ttggtagagg 
gtatgggtgg 
taggatataa 
tgggtattga 
ttttaggata 
aagtttttgt 
ttagtttgta 
99tgggggta 
ttataggttt 
ttggagttgg 
ttagattttt 



ttggattgtg 
ttggttttta 
gtttttttat 
ggatggtaat 
tttttatttg 
attgtagtaa 
ggttttagtt 
ttttatttat 
tatttttgtg 
tgtttggtaa 
gggatttagg 



attttgaggt 
gtaattgtag 
gtttttttta 
gtatgtgggg 
tagtaatatt 
gtatagtggg 
tatttggggg 
atttatgttt 
tttgtagttg 
ttgtttgtta 
a 



seq_prot . 
ttaggggtgt 
tttttatttg 
gtgtttgatt 
gatgtgggag 
atttattttg 
gttgttttgg 
aaagttagga 
tttgtatttt 
taggttttgt 
tttgtttttg 



txt 

attttgggtg 
gttttgttat 
tggtattttt 
tagggtttag 
gtattgtgag 
agttattgtt 
aggtttaatt 
ttttattttt 
tttgaggggt 
tttttttgtt 



gatttttttg 
tttgggttgt 
tgtaggtagg 
aggtttaagg 
tagtgtttag 
tttagtatat 
ggttttggaa 
tttgttattt 
tgaatatatg 
ttagttgttt 



<210> 369 
<211> 11001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 369 



ttttagattt 
attgttaagt 
gtgtgaaggt 
tgtgggtggg 
gggttgagat 
tttattgtag 
gtaggtgagg 
tattgttatt 
tatggaaaga 
ttgagggttg 
ttatggttta 
agttgttgat 
tttagtgggt 
tgtggggatt 
tgtggtgttt 
aaggagggaa 
tgggttttat 
ttgtttagtt 
ggtttgtgtt 
ggagtttgag 
tatttggttg 
atggtagttg 
gtggtgtgat 
gtgtgagtag 
ttgttgtggg 
tgttattttt 
atttgtattt 
ttaagttttt 
ttttggagtt 
ttttagggtg 
ggtgggtttt 
atttagtgaa 
ttttgggaat 
ttaggggtgt 
aattgttggg 
tatggagttt 
gttttaggtt 
tagttggggg 
gtggtttatg 
taggagttta 
aaaaatgagt 
gggagaattg 
ttttagtttt 
taaggttggg 
aggatgattt 
ggtttttgtt 



tagaaatttg 
attagtttta 
aaggtttgtg 
gtgtttttat 
ttataggttg 
tgtagaggtt 
gtattttaga 
tttaatattt 
tttgtgtttt 
attatttatg 
gtttttgttg 
tgattttttt 
aggttaagtg 
tttttttttt 
tgtggttgtt 
ggaagggtaa 
agtttttgta 
tggattttgg 
tgttagtgtt 
tttattgagt 
gagtgtgtta 
gtttttggtt 
ttttgtggtg 
tggttttgtg 
ttaagttgtg 
gttttattgt 
gggttataga 
tatttttaaa 
tattttttta 



aagggaaatt 
ttgtaagtag 
ttagatgtta 
tgttagtaag 
ttttatagtt 
agtggatttt 
gggttgagta 
tgtgtgtgta 
gagtataaag 
tttgtaattt 
agattagttt 
taggtgtagt 
tttgaatata 
ggtgatagag 
atattgtagt 
gggtatgttt 
tttagtaatt 



ggagtggttg 
gtatgtgttt 
gaaatggtag 
tttttagggt 
gatgtgttag 
tttaagtgtt 
attttggatt 
atttgtttgt 
ggtagtttag 
ttaagggaat 
gagttttagt 
tttttttatt 
tgttgtttta 
agtatagtaa 
gttgttgtgg 
ggtggggggg 
tttaggtttt 
gggaggtgtt 
taaagttagt 
tgtgggagtt 
atgtgttgtg 
gtggttattg 
attttatttg 
tgtttatgag 
gtagaggggg 

tgaagagtgt 
tgagtagagg 
atgttttggt 
ggtttgtttt 
attatttatg 
gagttttgtt 
aaatatgtat 
taaaggttgt 
ttttttgtta 
aggtattagt 
ttattgtttt 
tggagatttt 
ttgtgataga 
tagtattttg 
ggttaatatg 
ggtgggtgtt 
ggaggtggag 
tgagattttg 
gtttttaaag 
attagaggta 



gagtgagaaa atagaggtaa 
agttttttag agtaggattt 
ggagggtgga ggggatgtag 
tagttagatt tttttgattt 
aggtagtggt tttagagtgg 
gtttatgatg ttagaatgag 
tttaagtttt atttttatat 
agggagtgtt aagttaagta 
ggtgatagag ttaaatgagg 
ttatttagaa tgtatttttg 
ttttgtagtg gagagtagag 
ttaagtgaag gtttgagatt 
gtaaattgga ttaggagggt 
agttggtttt tagaaatatg 
ttgtttgggg tggggtgtga 
gttttgtgag agtgtgttta 
tattgtgtgg ttttttttag 
gaagttgggg tttgttttgt 
gaagtatggg tttaattggg 
ggtatttgtt gggtgtgttg 
tattgtgtgg ggtattgtgt 
tttttagttt gtggggtttg 
gggtgttgtg tgtaaaggtt 
agtggaaggg gtagttaagg 
ttggtgggga tagtttttga 
gtgaaaatgg tttatttttt 
tggttgttta tatgtaaaaa 
tttttttgag aaagggtttg 
tttttaaata tttatgattt 
ggagggagtt tggaaaaatt 
gagtttttat ttagtaaata 
gtagttatat atttagtagt 
ttttttttgg gtagatatta 
gtttggattt tattgtagat 
aaattttaaa aattttttaa 
ggtttattta ggaatttgtg 
tttatttggt ataagaggat 
atgtaaagaa tgaataaggg 
gaaggtggag gtgggtggat 
gtgaaatttt atgtttatta 
tgtaatttta gttatttggg 
gttgtagtga gttgagattg 
ttttaaaaaa aaaaaaaaaa 
agaaataaag tagttatgga 
gagataaggg agaaattaaa 
tggttatttt. tgttgtaaag- 
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gtggtaggta 
gtggttgtag 
gaggtatgga 
ttttttaggt 
ttttgttgtg 
tggtattgtt 
tttttatatg 
ttgggaaaag 
gttgtagttg 
aattttaaga 
tgggtggtaa 
ttttgtttta 
tagggttgga 
ggtatttttg 
ggaggggatg 
gttttgtttt 
tttttttttg 
ggttttgttt 
ttatgttggg 
ggaagggttg 
gtaattttat 
ttatgtatat 
tgtagtgtgt 
ggtagtgtag 
ggattaggtt 
gttgtatttt 
tatgttgatt 
tgtttattgt 
tttttagaat 
tagaattttt 
tttttttttg 
ttttttgtat 
gttggaatta 
ttgttgaatt 
attattgtaa 
gatggatagt 
attataaatt 
gttgagtgtg 
tatttgaggt 
aaaaataaaa 
aggttgaggt 
tgttattgtt 
aaaaaaagaa 
gataagaagt 
gataagtttg 
aggaagttgg- 



10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11001 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 



seq_prot.txt 

gtaagtgtag tagtgaggtt gaagaaaagg gaattaaatt ttggttatgt ttatttgaaa 2820 

tgttttttag atattttagt gaaggtattg gtatggagga tttagtttga gggtttaggt 2880 

tagtgtttta gttgtggatt tggggtagat gaatgtagat agattaggtt agtgattagg 2940 

attgagttta gattttattg tgagatatgg aagttgagtt agaatttgta aaggagttga 3000 

gtaggagttg tagggggtag gaggaaaatt gggagagtgt agtttttggg agttaaaggg 3060 

agtaagtttt aaatgatgtt gagggggtga gaatggagaa tggaatattg gattttattt 3120 

ggtagtatat agattgttga ggattttgtt rtgggtagtt ttttggagga agaggtaagt 3180 

ttggttggag tgggtagagg ggagagtgaa ggtgaaggat tagagtgtat agagattagt 3240 

gttttggttt gaggggagta gagataggtg ataattatag ggtagatgta ggttaaaggt 3300 

gtttagtttt ttttttaagt aaatgggtag atgtatttta tatatgtttt tagtgaaggg 3360 

ttgggtgtgg tggtttaagt ttgtagtttt agtattttgg aaggttgagg tgggtggatt 3420 

atttgagatt aggagtttga gattagtttg gttaatatgg tgaaattttg tttttattaa 3480 

aaatataaaa attagttggg tatggtggtg ggtgtttgta attttaggta tttaggaggt 3540 

tgaggtagaa gaattgtttg aatttaggag gtggaggttg tggtgagttg aaattgtgtt 3600 

attgtatttt agtttgggtg ataaaagtaa gatgtagttt tttgttgttg tttttttaat 3660 

tgttaatgag gaaaggggaa gttttgtgtt aggtgataga gatttaattg ttgagtaggt 3720 

ttttttgttt gtggtttttt ggttggtttt tagatgttta ggtggttaat attagagttt 3780 

gtgtagtagt gtgaggtaat ttattgagat aggttgggtt tgtggagttt ggtgagtagt 3840 

ggtttttttt ttggggtttt tttttaattt ttgggatatt tttttgattt ggagtttttt 3900 

tgttttattg ttaggttttt ttgtagattg taagtttatt tgttattatt gttgttgtgt 3960 

gtttgtttgt ttggattgtt gtgggttttg ggatttgggt tgggaatttg tggtggagtg 4020 

ggatatgaat gtggtgagtg tggggttgag ggtgtatggg aagggtgagg atgggtaggt 4080 

tatagtgtag gtatttttga gggttgtttg ggtgttgtgt gtaaggagtg ttttaattgt 4140 

tgattttttg gtggtataga gaggttaatt ttgtgtgggg gttgggaggg gagtttggat 4200 

tgttggtttt gtaagtattt tatttgttgt aagtggattt gggtttaggt tgatttaggt 4260 

tttgtgtatg tgtatttttt gtattttttt gtttttgttt ttggttagag gttatttttg 4320 

tgtgtttgtt tggatgttgg tatttgtttt tgttttttgt ggtaggtggg gtttgtgagt 4380 

ggagttttgg agtgatgagg ttatttttgg gggtgaagtg tgtgtgtttt tgttttggtg 4440 

tttttgtttt aatgagataa gagttagatt ttggtgattt atgttttagt tttaatggtt 4500 

gtggtgtggt tttggtttgg gtgtatgtgt atattgatat gtgtatatgt atgtatgtga 4560 

ttggggtggt ggttggtggt tatggatgtg taggattggg ggatgggtgg gtatggttat 4620 

gggtgaggtg gaggtgtttt tttttgaaat gatttggagt agtatgatga gtagtggtta 4680 

ttgtagttaa gaggatttgg atttggagtt tgagtagtat tttattgtgt gaattttgtt 4740 

agtttgtagg ttgtgttggg attaggtggg agttaggggg tgttggtggg tgggagggga 4800 

agtggttgtt ggagttttgt tttttttggt ttgttgttgt gttttgggtt ggtgggtagt 4860 

tttatttttt tggttatgtg gttttttgtg ggttttggtt ggggatttgt ttgtggaatt 4920 

gtgtgtaaga ttttgatttt attgtttaga tgttgggtgt tggggttttt ttggtttttg 4980 

ttatagatag gttgaatatg gaaaaagtag ttgtatggtt tgtggtagat ttgagttggg 5040 

tattatttag ttatgattaa agttgattga gtagtttgga ttagtatttt gatttttgtg 5100 

tttgaatgtt tttgtttttt ttttggggag attaggggag gatgtggaga gggaagagtt 5160 

tttgttagga attgagaagt atgtttagga aaatttgaga ggtagagaga gattttgttt 5220 

ttttatttgt atttttgtat ggagttagtt gagtttttat tttttttttg ttttggtttg 5280 

ttattagttg ttggaatgtg gaagattttg tttttttttt ttagggtgga tttggagaaa 5340 

gatttgggaa tagataggaa agaagttttg ttttggatta taagtattta ggagtatttt 5400 

atttatagga agggggaaag ttagattata aaatgtttaa agaggtggaa aaagagattt 5460 

aggttattaa tttaggattg taaggtgttt tggaattttt taggtatttt tattattgga 5520 

gaattgtgtg ttagatgtta ttggtgtgat tattaggttt agagaattag gtttaggtat 5580 

taggaaaaag aaatagggat tgtgaagttt agtatgtttg gtagaaatgg ggtggaaatt 5640 

tttatttaag taaagaaagt ggagttgtga gtgatgtttt agataaaatt ttataaaatt 5700 

ttttataaaa tgggtggtgt ttagtatgtt aaaattttag tttagagttt gggtgtaagg 5760 

gttgagttga gtgtagattt ttgggtttgt ttttatgtta gttagttttg agttattttt 5820 

tattgtggaa aggtgggaaa attataagat attaattaat tgaaaaggag ggttagttat 5880 

ggaggtgtat atttgtaatt ttagttattt gggagggtga ggtagaagga ttatttgaat 5940 

ttgggaggta gaggttgtag tgagttaaga ttgtgttatt gtattttagt ttgagtgata 6000 

gagtgagatt ttgttttaaa aatagaaaag gaagttaagt atggtggttt atatttttaa 6060 

tgttaatgtt ttgggaggtt aaggtaggtg gattatttgt aattaggaat ttgaggttag 6120 

tttggttaat atggtgaaat tttattttta ttaaatatat aaaaattagt tgggtatggt 6180 

ggtgtgtgat tgtagtttta gttatttggg agattgaatt attttaattg ggaggtaaag 6240 

gttgtagtga gttaagattg tgttattgta ttttaatttg ggtgataggg tgaggttttg 6300 

ttttaaaaaa aagaaagaag gttgggtttg gtgatttatg tttgtaattt tagtattttg 6360 

ggaggttaag gtaggtagat tatttgaggt taagagtttg agatttgtta ggttaatata 6420 

gtaaaatttt gtttgtattg aaaatataaa aaaattattt ggttatggtg gtgtgtgttt 6480 

gtaattttag ttattgggga ggttgaggta ggagtattat ttgaatttag aagatagagg 6540 

ttgtagtgag ttgagattgg gttattgtat tttagtttgg atgagagagt aagattttgt 6600 

tttaaaaaaa aaaaaaaaaa aaaagaaaga ataggaggtt gagaagtttt aagttatatg 6660 

ttaaaaaaaa agaaaaaaat attagtttta ggttaggtgt agtggtttat atttttaatt 6720 

ttagtatttt ggaaagttga ggtgggtgga ttatgaggtt aggagtttaa gattagtttg 6780 

gttaaaatgg tgaaattttg ttttgattaa aaatataaaa aattagttag ttgtggtggt 6840 
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aggtatttgt aattttagtt atttgggagg ttgaagtaga gaattgtttg aatttaggag 6900 

gtagagattg taatgagtta agattgtatt attgtatttt agtttggaaa atagagtgag 6960 

attttgtttt aaaaaaaaaa ttattagttt ttatggatag tggtagagtg gagggtgggt 7020 

ttttatggtg tagaagggaa attttatggt tttgttgtgt atttgattgg gatggttgtt 7080 

gaaatttttt tttagtaggt agttttggaa atagaaaaag aaattttttt ttttttagaa 7140 

ttttggaagg gttgtgtagt gtttttaatt taagtttgtt ttttgagtga agatagggag 7200 

gtttattatt agaagggaag gggttggaaa tgaggttatt gtattttagt ttagggtttt 7260 

tgggttattt aggaagggaa gaaggagtaa gtttttttat tgttaggtag gagtttagag 7320 

ttattataag aataagttag tattattttt gtgttttttt tgttttgtaa ataaaatgat 7380 

. tttttttttt gttttggtat tagagtttgt ttggtatttt ttttgttttt agtatttttt 7440 

ttatttgggt attttttttt gttggtgtat tgaataaata tatttattgt tttatttata 7500 

gtttttagtt tttatttttt agggtttata ttatttgttt ttattaattt gataaggttg 7560 

tttattgttt ttagtaaggt ttgtattggg gtttttattt tagtgttttt ttttatttag 7620 

gagatttttg gatatttggg gaagaaaatg agtttaaatt tttatttttt tttttttatt 7680 

ttttttttgt aaggttttgg ttttagtttt tagttttata tttttgttgg ttgtagaata 7740 

gtagtgggtt ttgggtaagg agtattttgt taaaatgttt tattttgttt ttttatttgt 7800 

tttttttatt tgtttttatt agatggttta agtgtttaag gggattttag ggtggagtta 7860 

gggagaattt tggttttttt gggttaggta taagattatt ttataggaaa ttttgtggga 7920 

atttttttgg gataaagtat tggttagtgt tgagtttagt tgtgtttgtg atatttgtat 7980 

tttaattagg gtttatttga tgttaatagg aagtaaggtt gatgtagtgg ggttaaggga 8040 

gtttgggaga agaaagttgg tttagagttt tggttgtttt gttttatatt ttattttttt 8100 

ggtaagaatt tagtttttag atgaggtggg gagtgagtgg ttgagttaaa aatttttggg 8160 

ttgggtatga tggtttatgt ttgtaatttt agtattttgg gaggtgaagg taggtggatt 8220 

atttgaggtt aggagtttaa gattaatttg gttaatgtgg tgaaatttta tttttattaa 8280 

aaatataaaa attagttggg tgttgttgtg gtatgtgttt gtagttttag ttatttggga 8340 

gtttgaggta ggagaattgt ttgaatttag gaggtagaat ttgtagtgag ttaagattta 8400 

gttattgtat tatagtttgg gtgatagagt gaggttttgt tttaaaaaaa aaaaaaattt 8460 

ttgggttaaa tttttagata gtataggtag gtgtagaaat ttattaggaa gttgtttgtg 8520 

tatttttggt agattggagt ttggtttaaa gttgtttttt atgtagtttg ggttaaggtt 8580 

aaatattatg ttatagtgat tttttttatt atgtgtgaga tatggagaat tggttttaag 8640 

tattattttg tttattggtg gttggattat tgatgtgtat tattttttat tttttttatt 8700 

ttgtagtggg ttatggtttt gtgttggggt agaggagaaa aatgggttgt tttttttagg 8760 

ataaattttt attttaattt aattagggtg ttgtgattag aatgtgtaat tgaggtgtga 8820 

ttttattgat tttttttttt tttgagattg agttttgttt ttgttgttta ggttggagtg 8880 

tgatggtatg attttagttt attgtaattt ttattttttg agtttgagta atttttttgt 8940 

tttagttttt taagtagttg ggattatagg tatgtgttat tatgtttggt taattttgta 9000 

tttttagtag agatggggtt tttttatgtt ggttaggttg gttttaaatt tttgatttta 9060 

ggtgatttat ttgttttggt tttttaaagt gttagaatta taggtgtgag ttaatgtgtt 9120 

tagtttgttt ttgttttttg tgttttgaag tagggtttta tttagttttt taggttggag 9180 

tgtagtgata tgataatagt ttattgtagt tgtaattttt tgggtttaaa tgattttttt 9240 

attttagttt tttgaatagt tgggattata ggtatattat tatatttggt taattttttt 9300 

tttttttttt ttagtagaga tgaggttttg ttatgttgtt taagttggtt ttaaattttt 9360 

gaggattaag tgattttttt attttagttt tttaaaatgt tgggattgta gatgtgagtt 9420 

attatattta gtttgatttt attttaaatg agagtttttt tttagagttt tttagttgtt 9480 

tttggttttt ggttatgtgt ttttagttgt ttttgttttt gtggtatttt taaggttata 9540 

tttagtgttg aggttttagg taggtagtag agagaagtta aatgattttg tttttttttt 9600 

atttatttag agtatgtaaa attaggagta gtggtgggtt tagggtgggt attagttatg 9660 

tatatgtata ttagggatag ggggttaaag gtagttagtt tttaaagatt gttttagagg 9720 

ttatttttta gagaagtttt gggtttttta agggttttgt gtttatgttg gtttattttg 9780 

taggatgagt ttgtggagtg ggagatattt gatttttttt aagttgagat tgagtagaag 9840 

attaaggagt ataatgttta gattaatagt aattttttta tgagtttggt gagttgattg 9900 

tttaggaagg gggtgtgggg aggagtaggt atttagttat gtgtttgata tttagagggt 9960 

tataattgag gttattttgg gtgggtgtaa gtagtaattt gtgtatattt agtttagttt 10020 

taagtagatt gatattttat ttggaattta ttattaaggt ttggtttttt tattttttta 10080 

gaataaggat ggtttttata taggttttat taaggtttag ttgaagttgg tgtgttttgt 10140 

ttttgtgttt tttagtaaga agttattttt tttgtaggat gtttggtggg gtttaggatg 10200 

gggtataagt gttaggtgtt gtattttttt ttatttgttt aaggatgttg ttaagtattt 10260 

gtatgtgttg ttatgtataa gggtatgtga agttattgag gttttgttgt gaaagttttt 10320 

ggtggtggat gatttttgta agtttgtatt ttttgagtgt gttgagtgtt atggttaagg 10380 

tgggtttttt attttatttt gttttatgtg agggtatata tgtatgtatt tgagtatgta 10440 

ggggttgagt agttggtttt gtttttgatt attatttttt ttttatagtg tatttgtgga 10500 

agttgttgga tgatgagtag tttttgtggt tgtggttttt ggtagggttt agtgataagg 10560 

ttttgagttt tgttttgaag gaaaatgatt ttggggaggt gaatgtgagt atatagtttt 10620 

tagttttttg gttgttatta gataggattg atgggttgta gttatagtaa ggtttggagg 10680 

aggaattgtg ttggaagata agttttgtaa aatagtttta ggagtgtata ggtattgtaa 10740 

ttaaagtaaa ggtttttagia ttatttatgt taaagtttag ggttgtttta agaagttagg 10800 

aagaattgtt ttggtgtttt gatttttttt ggtgtggaaa attttttgga gatgtaggag 10860 

tttatttaat gatatgagga ggtttttttt agatttttta tttggaagtt ttttggtttt 10920 



seq_prot.txt 

aaggtattag gtttgtggag tgaaattaga tttagaatat gtttgatttg tttataggta 10980 

attggggaat atttgatttg g 11001 

<210> 370 
<211> 4448 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 370 

tttatttttt ttatgtgttt tttttgttta gttttgttta tttggtgttt ttgatatttt 60 

ttttttttat tttattgttt ttgaggttta ttttattttt tggtttttaa ttatttattt 120 

atgtaggatg tgaattttta aattagtatt taagatgtat atttggatat ttattttaat 180 

gttttagttt taaatttaat atattaaaag ttgtattttt tatgtttatt attagtttta 240 

aagaatgtat aattgaataa aattttgatt attattttat attttttttt tttttgagtt 300 

atttatagag gtgattaaga tagaaatgtg tgttattttt tatttatttt attgttatta 360 

attatatgta tttattaaat tatgttagtt ttattttttt aataattgta gtttaatatt 420 

tgaataattg aataaaaatt tataatttaa tttttttaat atttatttat tttaatttta 480 

tttttataat ttatatataa tttataatta tttttatata attggtatag tgataatttt 540 

agataaaatt tattgaattt ttatttttat ttttgttaat gatatatata tataatgtaa 600 

tgttgagagg gttgggaagg aagaatggga gaaaggtaga agttgatagt taaaaaaaaa 660 

aagtttttag atggtttttt tagtgttatt tttgtaattt tattaaataa gggtttaaaa 720 

tttatgttat aatatttgtt tgatgtttta tttaaattgt tttattggat attttttatt 780 

tgttaagttt tttgaaagaa ataaaattgt tttgttatag ttagatttta tgattttata 840 

tatttataat gggtagattt tgtaggggtt tatatttttt aaggtggttt gggaaaatga 900 

tatagaaaag ttttatatta gttgaaaaga aaaatgtata atttattttg gtaattttag 960 

ttttaatttt taataggata aaggaaatat gtatattata aattaatgtt ttgtgttaat 1020 

aaataattaa gtaagtagag ttgtaagtat tggttaaaat gaattttgga tatttttagt 1080 

tattaaattt tttgaggtaa ggtagatata tatttggatt tgaatatttg tattattagg 1140 

gaatttttgt ttgtttgttt tatattgttt tgttattttt aaaagtaggt gttaaattag 1200 

gttatttgtt gtttgggtaa tgttattttt tgttataatt ataaattgaa gaaaattgat 1260 

tgtttttttt ttttagttaa tatgttgtgt ttttagtttt aaatattttt gagaagttgt 1320 

ttagatttat gagtaatgtt tttgttttta ataggttaag atattaggta ggtttttgta 1380 

tttttggagt ttttagtttt ttgtaaagtg aggaagttag attaagtaat tgttaggttt 1440 

ttttttagat tgattaattt gatggtatta gatgtaattg tttttgaatt agggtatgaa 1500 

atgaatttag ttttggtgta ttaatgtgat gattttgttt tattaaagtt tgagtatgtg 1560 

ataggtttag tattatttta tatagagata aagggtaatt ttttgttttt aaaggaatga 1620 

tataatttgt ttttgaagtg atttatatta tttttatttt tgaataattt aatgtttaga 1680 

aaataattta agaatttttg ttgattttag ggatgtaaga tatggttttt tgatagtatt 1740 

tgggattgtg gaaaaaagta attgaggaaa gggtattttt ataatgtaat attgaattta 1800 

gtgtttaagg tttattatag gaatttttaa tgatttttat aatttttttt tttttttttt 1860 

tttttttttt ttattttgaa aataaattga gaagttagta ttgggataat tatttttttt 1920 

gatttaaata aaaagttttg ggtaaatata ggtataaatt gttaaatgga aaaagttttt 1980 

ttttattttt agttagaggg aggttgggga ttttagtttt ttagaagttg gtttgtggat 2040 

gtttagagaa tttttttgga gattaggtta gggttattga gtttgtttag tagggtgttg 2100 

ttttggatgt tgttttgttt tttattttgt tagtgttgtg tttgggttgt gaaggtgtgt 2160 

gtggtgtttg gtgattggtg gtggtttgga gttgtttggt tgttattggt tgtttggttt 2220 

tttttgtttt tgggtttggg ttgtaggttt gttgtggtgg attgggtggt ggaagtttga 2280 

tggtgttggg tgagtggttg ttgagtggtg ttgtgggagt tttgtaggtt ttttgtgttt 2340 

gtagtggagt tggaggttag ttgaatttgg ttgtgggatt ttggatagga ggaggagggg 2400 

atttatagga tgtgttaata tggatttgga aaataaagtg aagaaggtag gggggtgttt 2460 

gtggtgggtg gtggttgttt tatttgtgtg ggttgtgtgg tgtgtggtgg ttggaggtgt 2520 

tgaggtgggt ggggtttgtg ggttttttgg tgtgagtttg gttttgggtt tgtggtgttt 2580 

tgggtttagg gatttggttt tttgggtaga ggaggtgttt ggtggttttg ttagtttttt 2640 

atatttggga gtgatagaat tggaagtgtg agtgagggtg ggtgtgggat tttttttttt 2700 

agntttatgg aattttagtg gttgtaggtg ttggaaattt tatttaagat gttttagttt 2760 

tgtttgtgtt tttttaaaag gaaaggatga gtttagggtg agtgtgggtg agattgtgat 2820 

attttttggt ttaggaagtt gaattttatt aagtttttgt ggtttggggt tttgttgtgt 2880 

tttgatagtt ttgatttttt tttttttggt tgggttgttt ggggtgtttt aaaatgagtg 2940 

ttgatttaat gtattgtttt tgtatttgtg ttggtgataa tttttaatgt gatttttttt 3000 

ttggttattt attatttgtt tattgattga ggtagtgttt tttttttagg gaattatttg 3060 

ttaaagagtg tatttttttt gtttggtagg tagttttaaa gggtggttgt tattagtgtt 3120 

ttagtgtagg gagatatttt tagtttggtt gttgttttgt aggtattgga attgatggtt 3180 

ttatttgagt tatttttggg ttatgtggag tttggtgttg tttgggagaa aagggtgggg 3240 

attagattat gatttagaag gaatgttgat gttatatttg agtgaattgt tttgtgtgaa 3300 
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agtttttttt gttttagggg atttgtttat gttttatttt gtggtaaatt taagttagtt 3360 

gttttttatt aaatagagaa gttttgtaga gagaggaagg gaaaaatata gaaaatagtt 3420 

tttgtttttt ttatatttat ggtttttaat attttatttt ttaaatataa agtttttgat 3480 

ttgagtatta attttatttt atgtttagaa ttggattttt gaaaatggat ataaaatatt 3540 

gtgttaggat tgtagagttt ttggttagga agttattgat taataaaaat ttgaaatgag 3600 

aaaatagttt tttttatttt ttgtgattaa aatttttagg tatttaaaaa ggtatttaag 3660 

aatagaaaaa tataaattgt taatggtagt gtgattgtat tttatttttt ttttatttgg 3720 

tttttttttt tttttttttt aaataattgt aatatttttg gttaattttg tttgttatgg 3780 

gggaaaagtg gggaaagtat tttgtttgtt tagttttttg ttgatgaata ttatttttat 3840 

ttagttttta agtaggtaag agttatttaa tttagtattt gtttaaaatt tggggggttt 3900 

tgtttttgtt tttaagtatt tatttgaaga taattttgta aaagtaaata tattttgggt 3960 

tgttgaggat gaataggtga agtataggat ttttagggta atgatattat tttgtatgat 4020 

attataaaat ggttgatata tgtaattata tattaattaa aatttataga atgtagaata 4080 

ttaagagtaa attttaatgt aaattatgta tgttggggat aatgatgtgt taatgtgttt 4140 

tattatttgt aataaatgta ttgtattgat atgggttgtt ggtagtttgg gaggttgttg 4200 

gagataggaa ggtatggggg aggtatatgg gaattttgaa ttttttgttt aattttgttg 4260 

tgaagttaaa aaatgtttta aaaataaagt ttgtttaata gggggagaaa aattaaagat 4320 

atttttgaaa ttttttgaga taaggagtaa ttttgaggga agaagtttaa atttaaaaat 4380 

tgtatatgtt tgtaaaggaa aaatttagat gttgtttata aataattttt ttagagtttg 4440 

aaaatgtg ~ *" ~ 4448 

<210> 371 
<211> 4448 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 371 

tatattttta aattttaagg aaattgttta taaataatgt ttggattttt tttttgtaag 60 

tatatataat ttttaagttt gaattttttt ttttagaatt attttttgtt ttaaaaagtt 120 

ttaaaaatgt ttttgatttt ttttttttta ttaaatagat tttattttta aagtattttt 180 

tagttttata gtaaaattga gtagaaaatt tgaagttttt atgtattttt tttatgtttt 240 

tttgttttta gtagtttttt ggattattaa taatttgtat tagtgtagta tatttgttat 300 

aagtgatgaa atatattgat atattattat ttttaatata tatagtttat attagggttt 360 

atttttggtg ttttatattt tatggatttt gattaatgta taattatatg tattagttat 420 

tttatagtat tatatagaat agtgttatta ttttaaaaat tttgtgtttt atttgtttat 480 

ttttggtagt ttaaaatatg tttatttttg taaagttgtt tttaaataaa tatttaaaag 540 

taaggataag attttttaag ttttagataa gtattaaatt aagtagtttt tatttatttg 600 

aaaattaaat gagagtagta tttattaata gaaagttaag tgggtaaaat gttttttttg 660 

tttttttttt atgataagta agattaatta aaaatattat agttatttaa aagaaaaaaa 720 

gaaagaaatt aaatgggaag agaatagaat gtaattatgt tattattaat aatttatgtt 780 

tttttgtttt tggatatttt tttgagtgtt tgagaatttt agttgtaagg aatgaaaaaa 840 

attgtttttt tattttgagt ttttattggt taatagtttt ttaattgaaa attttgtagt 900 

tttaatataa tattttgtat ttatttttaa aaatttagtt ttgggtataa aatagaatta 960 

atgtttaagt taaaaatttt gtgtttagaa gataaaatat taggaattat gagtatgaga 1020 

aaaataaaaa ttgttttttg tatttttttt tttttttttt tgtaaaattt ttttatttga 1080 

taagggatag ttagtttaaa tttattataa agtaaaatat gagtaaattt tttggaatga 1140 

ggagaatttt tgtatagggt aatttatttg aatgtgatat tagtattttt tttgagttat 1200 

agtttggttt ttattttttt tttttaggtg atattaggtt ttgtataatt tgggagtggt 1260 

ttaggtaggg ttgttggttt tagtgtttgt gaggtagtag ttaagttgaa aatgtttttt 1320 

tgtgttgaga tgttggtgat aattgttttt tgggattatt tgttaaataa ggagaatgtg 1380 

ttttttgata gataattttt taagagggag gtattgtttt ggttggtaga taagtaatga 1440 

atgattgaaa aaaaaattat attagaagtt attattagta tgggtgtgaa ggtagtgtat 1500 

taaattaata tttattttag agtgttttag atagtttagt tggaagggag gagattagag 1560 

ttgttaaagt atagtagagt tttaaattat aaaggtttaa tgaaatttaa ttttttgagt 1620 

taggaagtgt tatggttttg tttgtatttg ttttaagttt gttttttttt tttgaggagg 1680 

tataggtaga gttgaaatat tttaagtaaa gtttttaata tttataatta ttgggatttt 1740 

gtgagattgg agagaagagt tttgtgtttg tttttgtttg tgtttttaat tttgttgttt 1800 

ttgagtgtgg ggagttggtg gggttgttga gtattttttt tatttagggg gttaggtttt 1860 

tgggtttggg gtattgtaga tttagagttg ggtttatant ggggggtttg tgagttttat 1920 

ttattttggt atttttggtt gttgtgtgtt gtgtgatttg tgtaggtgaa gtagttgttg 1980 

tttgttatga gtgttttttt atttttttta ttttgttttt taggtttatg ttaatgtgtt 2040 

ttatgggttt tttttttttt ttatttggga ttttatggtt gggtttagtt ggtttttggt 2100 

tttgttgtga atatgggaaa tttgtggaat ttttgtggtg ttgtttaata gttatttgtt 2160 

tggtgttgjtt gaatttttgt tgtttagttt gtt^ tagtgg gtt tgtggtt tggatttggg 2220 
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<220> . , . 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 372 
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aataaagggg gttgggtagt taatggtagt tgggtagttt tgggttgttg ttaattattg 2280 

agtgttgtat gtatttttgt ggtttgagtg tggtgttggt aaggtgagag gtggggtggt 2340 

gtttgaggtg gtgttttgtt gggtaagttt agtgattttg atttggtttt tgaagggatt 2400 

ttttgggtgt ttatgggttg gtttttaaag agttgggatt tttagttttt ttttagttga 2460 

agatggagaa gaattttttt tatttggtaa tttgtatttg tgtttgttta ggatttttta 2520 

tttgggttaa ggggaatagt tattttaata ttaatttttt agtttatttt tagggtaaag 2580 

agagaaaaaa aagagagaga gagagattgt agggattgtt agggattttt Qtgatggatt 2640 

ttgggtatta aatttaatat tgtgttgtgg agatgttttt tttttaattg ttttttttta 2700 

taattttaaa tattgttaag aaattgtatt ttatattttt aaagttagtg aagattttta 2760 

qattgttttt tgagtattgg gttgtttaaa agtaaagatg atgtgaatta ttttagaaat Z8ZO 

aggttgtgtt atttttttgg aagtagaggg ttgttttttg tttttgtgtg agatggtgtt 2880 

aggtttattg tgtgtttagg ttttgatgaa gtagagttat tatattggtg tattaaagtt 2940 

gaatttattt tatgttttaa tttaggagta attgtattta atattattag attggttggt 3000 

ttggaaggga gtttaatgat tatttagttt agttttttta ttttgtaggg aattgaggat 3060 

tttagaggtg tagggatttg tttaatgttt tagtttgttg gggatagagg tattgtttat 3120 

gaatttaaat aattttttaa agatgtttgg ggttggaagt atagtatgtt gattaaggaa 3180 

aaagagtagt taattttttt taatttgtgg ttatggtaaa aaataatatt gtttagatag 3240 

taggtgattt aatttggtat ttatttttaa aaataataaa ataatataaa ataaataaat 3300 

aaaaattttt taataatgta gatatttagg tttagatgta tgtttgtttt gttttgagaa 3360 

atttggtagt taaaaatatt taaaatttat tttagttaat atttgtaatt ttgtttattt 3420 

gattatttgt taatatagag tattgattta taatgtgtat gtttttttta ttttattaag 3480 

iattaaaatt gaaattatta aaatgagtta tgtatttttt tttttagttg atgtaaaatt 3540 

tttttgtgtt atttttttaa attattttga aaggtgtggg tttttgtaaa gtttgtttgt 3600 

tgtaagtgta taggattatg aggtttagtt atgataaggt aattttgttt tttttagaga 3660 

gtttaataga tggaaaatgt ttagtaaagt aatttaaata aggtattaaa taaatgttgt 3720 

aatataaatt ttggattttt atttgatgag gttgtagaga tgatattaag aaaattattt 3780 

ggaaattttt tttttttaat tgttaatttt tgtttttttt ttattttttt tttttaattt 3840 
ttttagtatt atattgtgtg tgtgtattat tggtaaggat ggaaatagaa atttagtgag 
ttttatttgg aattattatt gtgttaatta tataaaaata attatagatt atgtataaat 

tataaaaata aaattgaggt agataagtgt taagagagtt aaattgtaag tttttgttta 4020 

gttatttagg tgttaagtta tagttattaa ggaggtagaa ttggtatggt ttgatgagta 4080 

tatgtagttg atgatagtgg aatgaataga agatggtata tatttttatt ttgattattt 4140 

ttatggatag tttagaaaga gagaaaatat aggatgatgg ttagggtttt gtttagttgt 4200 

gtattttttg gggttggtgg tggatataaa ggatataatt tttaatatat tgagtttgag 4260 

attgagatat tgaaatgagt atttaaatat atattttgaa tgttgattta ggagtttata 4320 

ttttatataa ataaataatt aaaggttaag aagtgagatg agttttggag atagtaaaat 4380 

aaaagagaag aatattaaag atattaagtg agtaaggttg agtaagagaa atatataaaa 4440 

gagataag 4448 

<210> 372 
<211> 4408 
<212> DNA 

<213> Artificial Sequence 



3900 
3960 



60 



ttatattgtg agtatataaa aagtattata tggttaatgg aggatgagga attatggtaa 
agtaggtagg taagttttaa gaaataaaat aatttgttaa aaaataattt ttgatgatta 120 

240 
300 
360 



ttgtaagatt gaaagtgtag gaaaaatata gtttgaataa ttttagattt ttttatattt 
tttttttttt tatatatttt gttattttat aataaaattt ttaatggaaa gtttaaaaat 
aaatagtata ggaatatgtg ttttaaatga attaaattgt gaaattagtt agtaaattaa 
tttgtagtaa gtaattattt aaggaaatta aaatattgtt tagtttagtt ttgtatttta 

ttatgtgtat gtgtttttta taattaatta atataagtgt tttaggaata tttgaagata 420 

aatatgttta atttaaggaa taaagtattt aaataattta agtgtaattt tgttgagtta 480 

aagtaaaata ttttataaat gaagtggtta tttaattttt tagggaaagt ttggttattg 540 

aaatgttgta tgtttatgtt atattaataa aaatttttaa tttattttgt ttatgtgttt 600 

tgtttttttg atattattgg tatttgaatt ttagatggat ttttgttaaa atgatatttt 660 

gtgtgataaa agtattttta gttttgattg atagattaaa ataaatgtaa ggaaattttt 720 

ttaaattaga ttaatttttt ataaaaatat tttagaatgt atgaattttg atatttatat 780 

ttataatggt aaaagttttt tttgtttagt ttagtaagat aatatttata taaaagagta 840 

aaaaaaaatt atattatttt atgatagttt gatttttaaa ttgtttaaga aagtaaagtg 900 

gttaaattgg aaaagaggaa tatattttgg aggtttagaa ttgaaaattt tttttttaat 960 

ttttagttgg aaaataattt tttgtattta tttaaagtgt attttttgaa gtgttagatt 1020 

qgagttgatt ggtgattaat ttaaaggagt tataatttaa agaaat;ggtg agagtttggt 1080 

atttaggttt ggtttttagg taatttgttt gggtttgaga ggttattaat tgttagttaa 1140 
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gatggaattt 
ttttagtagt 
aggatttgtt 
tagttatggt 

gggggatgaa 

aatt-tagttt 
tttagatttg 
ggagagagtt 
ttttggttgt 
gtgttaagtt 
gtagtttttt 
ttttttttta 
tagtagagat 
ttgtgtggtt 
taggggtgat 
gagttggagt 
aggagtagga 
agagaagaaa 
agggtagtgg 
gtgataggga 
atgaggtgtg 
gttgaggaaa 
gagggtggga 
gaagaatgga 
aggtttgtta 
gaggttaagg 
tgtattgatt 
ttagaatttt 
ttttttaaaa 
aagtgggggt 
tatgtgaaat 
tttttgattt 
tttgttttgt 
aggtttttgt 
atgtttttgg 
attaattttt 
ttaattgaga 
tatttgtaat 
aaaaaagttt 
ttttgggggt 
atttgagtta 
ttttttattt 
ttttttagtt 
tgtaatattt 
tatttaaggg 
tttgattttg 
tttttttaat 
gaaagatatt 
atgggttaat 
gtagaatggt 
gagaatgaga 
gaagggagtt 
ttttagaggt 
gttttttttt 
gtagatattg 



tttttttttt 
tgaaattttg 
tttgtagtgt 
ttggtttttt 
aggaataggg 
taaaagttat 
tattatttta 
gttgtttttg 
atggtaagtg 
ttgaaggagt 
tttgtttttt 
tttttttttt 
tttattaaat 
ggtgatttag 
gagaaatgag 
tgggagagtg 
gtatggattt 
aagtaatttg 
ggagatagaa 
aagtggttta 
tggagtgtgg 
gaggttatag 
gtggaggtgg 
gtgagttagt 
gttggggaag 
tgggtggatt 
tagtgtttat 
ttttgttgta 
tgatgaggtt 

gggggagaga 

tatagggtat 
tttgatttga 
ttttgagtgg 
taggtatatg 
aaagtgttgg 
gtttgagagt 
ataaagtttt 
tttagaaata 
ggtgtttaaa 
tgtttggaat 
gtagtaaaaa 
tgaaatagtt 
atgaagaaaa 
gtgaaaattt 
gagagaaatg 
agaattttta 
aagttttata 
tttatttttg 
aatttttttt 
tttgttttta 
aaattaaaga 
gtgttgaaaa 
ttgtaaatta 
tgtttggagt 
tgttgagttt 



tttttttttt 
tatttagttt 
tggtattgag 
ttttgatttt 
ttttgtaagg 
aaattattta 
tatttttgtg 
tgttttattg 
tttgaaagtt 
ttatgggtgt 
tttagttttt 
ttttttttgt 
tttattgtat 
gagtgagtat 
gtaggggagg 
aaaggagagg 
ttattgtgga 
tgttaatttg 
atgtattttt 
agagatggaa 
gtgtgagtgt 
taaagaggga 
gtgtggtggg 
gggagagggt 
gttgggatgt 
aggtatgttg 
gaagggtttg 
tttatattta 
aaggtttttg 
gtttttttta 
atttttattt 
gtaaagatgt 
aaggattata 
ggataatgtt 
aatatagtta 
aaaatttagg 
ttttaaaagt 
tatagtgatt 
attaatttta 
atgttatgtg 
taggtaaata 
aatagttgat 
ttgattagag 
aaattttttt 



agggggaaaa 
tttttggttt 
attgaatttt 
tttattgatg 
tttttttttt 
gttggtggtg 
tagaaagatt 
gatgtaaaga 
gggttgagag 
tttgtttgtt 
ttgggttt 



seq_prot. 
taatggataa 
tttattttga 
atttaaggga 
agtattttgt 
tttgtttgtt 
tggtgaagat 
ggatagatgg 
ttttttgggg 
gggtttgaga 
attggggttt 
ttgtttggtt 
ttttattttg 
agtggtgtgt 
agtgtttggg 

gaagtagatg 
ggatttggtg 
aaggaggatt 
gtagttttaa 
ggggagtaaa 
taaaggataa 
gtgtgtgtga 
ttgtggaggg 
gatgggagag 
gggagggtta 
tgggtttagt 

agagtgttgg 
tattggataa 
ataagtttat 
gtgaggatgg 
agtttatatt 
gaaaagtgtg 
gtattttggt 
aatataattt 
tttttaattt 
ttgtttttgg 
tttgttatta 
tgtattgttt 
tgggagaatg 
gttattatat 
tttttttgaa 
aatttgttta 
agatggattt 
atttatattt 
tttttattta 
tgtttatgtg 
ttgaaatttt 
agttatagga 
atttttgtag 
ttattttgta 
aggataggta 
tagagatttg 
tatgtgtgtg 
gaaggggttt 
agaggttggt 



txt 
taatgggaag 
gaatgttaat 
agatattttg 
agattgttag 
gattgtgtta 
tttttgaagt 
tttttattta 
tgatttttag 
ggattgtagg 
ttgaaattta 
ttgtattttt 
tgtggggagt 
gggtggttgg 
tgagtgttgg 
ttagtgggtt 
gggtatttag 
agaagggagg 
taaattaaag 
ttaggatggg 
tgtttatggg 
ttttttttta 
aggaaagtga 
gatgagtgaa 
tagttgggag 
ttagttggga 
tgtataggtg 
ggtttagatg 
tttgggttat 
tattaaattg 
tgtttttgta 
ttttattttt 
agtgagtaga 
gtttggagga 
taagggtatt 
atttggattt 
aaaagatatt 
tttttaaatt 
tgtataaaat 
aggtgttggg 
ttattttgtg 
atttgttttg 
attttatgga 
taagttattt 
gtggaaattt 
ttgtttaatt 
gttgaggtaa 
tattggaaag 
tttttttatt 
gagattaaga 
attttatgga 
tggagagata 
taattttttt 
gggaagttta 
taattttagt 



ggggttaatt 
ttttggtttg 
ttttaaatgt 
tgtttgtggt 
ttttgggtga 
ggaataaatt 
ttggttattg 
tgagttgagt 
gtttttgagg 
gttgttattg 
tttttttttt 
gatgtgatgt 
ttgagtttgg 
ggggagtgag 
gaagagttgg 
gagttaattg 
atgggatgga 

ggggagtgtt 

ttgggaggaa 
gttgtttggg 
ggtttgtaga 
gagattggta 
gagaaattta 
tgaatgagtt 
tattgtgttt 
ggtatggtta 
tttatagagt 
ggatatttta 
tatgggatag 
aagtaaagag 
gaattttgat 
atattttggt 
ttaggtgtga 
ttgttaatgt 
ttttattaat 
tttttggttt 
aatatattaa 
agatatgttt 
ttttttttat 
ttttgaattt 
agtgttaaat 
aagggttagt 
ttaattttta 
aaagtagtgt 
gtattttttt 
gaaaattaaa 
tgtagtttga 
tttttgagta 
ggtgtttgta 
aaagttggaa 
tagggagagg 
tttttttagg 
tgtttttttt 
tttggttgtt 



<210> 373 
<211> 4408 
<212> DNA 

<213> Artificial sequence 

<22o> , ; 

<223> chemically treated genomic DNA (Homo sapiens) 
<4Q0> 373 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4408 



laaSia«»? SSSSH** Stgtttgtgg tggttgggat tggggttaat tagtttttgg 
tgggtgagat tttagataga_aggggggtga_gaggaatgtg_agtt«^ 
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i t 



60 
120 



1920 
1980 
2040 
2100 



seq_prot.txt 

tttaqttttg gtttgtaaat ttttgaaatt tgaaagggga gggagttgta tgtgtgtatt 180 

tttqtgtttt tttagtgtaa tttttttttt tttttttgtg tttttttgtg gatttttgaa 240 

tttttttgtt tttggttttt ttattttttt ttaatttttt tatgagattg tttatttttg 300 

ttattagttg aaggtaaggt tgttttgtta tgagtgtttt ttaattttta taaaatgaaa 360 

agaaaaaaag ggaggattat tagtttatta tttagaggaa tggggaggtt gtaaaaattg 420 

ttgatgggta gaggtgaaga tgtttttttt ggattgtatt ttttggtgtt ttgtaattag 480 

agtttagttg tgggatttgt tgaagaaatt tgattttttt gttttggtga gattttaaaa 540 

attagaaata gaaattttta gagttagaga ggaaatataa ttaaatagta tgtgggtatt 600 

ttttttttta tttttttttt tttaaataat attgttttga gtttttattg ggtaaagaga 660 

gaaagtttga gtttttatgg atgttatgtg gaggttagaa atggtttaaa atgtagattt 720 

ttaattagtt ttttttgtgg ttgaagaggt taattttttt tataaaatga gtttatttgt 780 

tgattgttag ttattttaaa gtgaagggat ttagtattta aaataaattg agtaagtttg 840 

tttgtttgtt tttattgtta atttaaatga atttaaaata tggagtaatt taagaaaata 900 

tataatatgt tttagatagt ttttaaaagt agggaaagtt tagtatttat atagtgatta 960 

gggttagttt taagtgttaa gtttttttaa atgtatttat tttatgtata tttttttgag 1020 

ttattatata tttttaaaat tgtgagtatt ggtatattga tttaggaaga gtaatataat 1080 

ttttagaggg aattttattt ttaattaggg attaaagaga tgttttttta atagtgggtt 1140 

tgagttttgt ttttaagtag gaattaatat tggtgggaaa atttgaattt aggagtaatg 1200 

qttqtgtttt ggtatttttt aaaaatatat attaatagga tgtttttgag attgaaaaaa 1260 

tattgtttta tatgtttggt agaagttttt atatttggtt ttttaggtga attatattta 1320 

tagttttttt atttagaggt aggatagagt taaaatattt tgtttattat taaaatatat 1380 

atttttgttt aagttaagaa attagaaaat tagggtttag aagtaaggta tattttttga 1440 

qtqagaatat gttttgtaat tttatatatt ttttgttttg taggagtaaa tgtggatttg 1500 

igggaaattt ttttttttat ttttattttt attttgtgta atttaatatt atttttgtta 1560 

ggaattttaa ttttgttatt ttaaaaaatg agatatttgt gatttagggt gaatttgttg 1620 

aatgtaggta tagtagagga aattttagat tttatgagtg tttgagtttt gtttagtgta 1680 

aattttttgt gaatattggg ttagtgtgtg gttgtgttta tttgtgtgtt gatattttta 1740 

gtatgtttgg tttatttgtt ttgattttgg gtgtggtgtt ttagttaagt tgggtttagt 1800 

gttttggttt tttttagttg ataagtttag tttgtttgtt tttggttgtg gtttttttat 1860 
ttttttttat tagtttattt tattttttta gatttttttt tatttatttt tttttatttt 
tattgtgttt atttttattt ttgtttttta ttggtttttt attttttttt ttttgtagtt 
tttttttgtt gtgatttttt tttttaattt tgtaggtttg aaagaaggtt atatatgtat 
qtttatattt atattttata tgttttgttt taaataattt tatgaatatt gttttttgtt 

ttgttttttg ggttattttt tttgttgttt ttttttagtt tgttttgatt tgttttttaa 2160 

aagtatgttt ttgttttttt gttgttttgg tgtttttttt ttgatttatt agggttgttg 2220 

ggttggtgta gattgttttt tttttttttt tattttattt ttttttttgg tttttttttt 2280 

tatagtggga gtttgtgttt ttgttttttg gttggttttt aagtgttttg ttaggttttt 2340 

tttttttttg tttttttggt tttggttttt gattttttgg tttgttggta tttgtttttt 2400 

ttttttgttt tgttttttgt tgtttttgtt tgtttttttt ggtgtttgtt tgggtgttgt 2460 

gtttgttttt ggattgttag ttgtgtagtt gggtttggtt ggttgtttgt gtgttattgt 2520 

gtagtggagt ttggtggaat ttttgttgat gttatgttat tttttatatg gagtaggagt 2580 

agagggaaga gagagggatg agagggaggg agaggagaga gagtgtgaga ttgagtgaga 2640 

aagttggaga ggagtagaaa gaaattgtta gtggtggtta gattttggag gttttagtgt 2700 

atttgtggat ttttttggaa tttggtattt ttaggagttt tgtagttttt ttaggtttgg 2760 

tttttgggtg tttgttgtgt agttggaggt ttggtttgtt ggaaattgtt «gggaagta 2820 

qtgggatgtg gagatagtag tttttttttg gtagttggta agtggaggtt atttattttg 2880 

tagggatgtg agataatgtg agtttggaaa tttgttttat tttggagaat -ttttattgta 2940 

ggtgatttgt ggtttttggg gttaagtttt gtttaaggta atgtagttgg taaatagatt 3000 

ttgtaaagtt ttgttttttt tgttttttgt tatagatatt aataatttat agggtgttga 3060 

agttgagagg gaagttagat tgtggttggt atttaaaatg aggtattttt ttttaaattt 3120 

tqgtgttaat attgtaggaa taaatttttg ggttaaggat tagtattttt aagataaagg 3180 

gttgggtata aagttttagt tattggaaga ttagtttttt ttttattgtt atttattggg 3240 

aaaaaaaaga aaagaaaaag attttatttt aattggtagt tagtgatttt ttaggtttaa 3300 

gtgaattatt tgggagttag gtttggatgt taagttttta ttattttttt ggattgtaat 3360 

ttttttaaat tgattattag ttaattttaa tttggtattt taggagatat attttaaatg 3420 

qatgtagaga attatttttt agttggagat taagaaaaaa atttttgatt ttaaattttt 3480 

iaaaatatgtt tttttttttt agtttaatta ttttattttt ttaagtaatt tagaaattaa 3540 

Ittattataa ggtggtgtga ttttttttta tttttttgtg tgagtattgt tttattaaat 3600 

taaatggaaa aaatttttat tattataaat gtaaatatta gaatttatat attttaaaat 3660 

atttttatga aaaattaatt tgatttaaag aaattttttt gtatttgttt tagtttatta 3720 

attaaaatta aagatgtttt tattatataa aatattattt tggtagaaat ttatttaaaa 3780 

tttaaatatt aataatatta agaaaataaa gtatataagt aaaataaatt gaagattttt 3840 

attgatgtaa tatgagtata taatatttta ataattaaat tttttttaaa aaattaaata 3900 

attattttat ttgtggaatg ttttatttta atttagtaaa attatattta aattatttag 3960 

gtgttttgtt ttttaagtta agtgtgtttg tttttaaatg tttttaaagt atttatatta 4020 

Ittggttgta aagaatgtat atatatggta aaatatagaa ttgaattgag tagtatttta 4080 

atttttttaa ataattattt attataaatt aatttattgg ttaattttat aatttagttt 4140 

atttaaaata tatgtt,tttg tgttgtttat ttttaaattt tttattaaag attttgttat 4200 
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seq_prot.txt 

gfsgfs sssses s&gss ssggg sgsss 
sssss ssssss ssser «w.ssg ss 

<210> 374 
<211> 4435 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 



4260 
4320 
4380 
4408 



<400> 374 

gttttttttt 
tggttttttg 
gaattggggg 
ggagttgatt 
ggaatttgtg 
tgtttgtaat 
tttttagatt 
ttggaagaga 
tgggggggtg 

agggatttta 
agttggtggt 
aggagaggtt 
tttgttttgt 
ggttgtaggg 
tggaagagat 
ttgagtttga 
ggtattttaa 
gatttaggat 
ttttttttta 
agatttttag 
atttaggagt 
ttttttttag 
ggtttttagt 
gatttaggag 
tttttttttt 
gtttagggtt 
tttttaggat 
attttaattt 
ttgatttttt 
aaggaaatag 
ggagtttttg 
attttttttt 

gggtggtttt 

gggtgtggga 
atgtggtttt 
taaaagaagg 
gggtgatatt 
attgagtttt 
gagaagggga 
ggttgagatt 
tagtgtggtg 

tgggggtaga 

attgggtttt 
ttagtttttt 
tggttttttt 
Qtgggttttt 
tgtgtgtgag 
ttgttgttgt 

gggggtgttg 

gtttggtgag 
tgtaagtgtt 
gtttgtttgg 
atggtgtgta 



tggttgttat 
tttttggttt 
tttaaagtta 
ttttggagat 
tttttagtgt 
tttttttttt 
ttgagtagta 
tttatggttg 
ttttttgagg 
ggtgtgtttt 
ggagttggag 
ttgtttttga 

tgtggtggtg 
tttggaaagt 
gtagtttagt 
aaaattttgt 
ttttaatttt 
ttaaggtttt 
gatttaggag 
tttttttttt 
ttaggttttt 
atttaggatt 
tttttttttt 
tttagatttt 
tttagattta 
ttagtttttt 
gttgtttttt 
taagaattta 
ttttttttag 
gagttttatt 
gttttggttt 
atttttagag 
ataaaagtgt 
tttttgggga 
ttttttggat 
ggtgtaggtg 
tatgagtttt 
taaaagagat 
tttttgggtt 
tggtttttag 
aatattgaag 
gggtagtaat 
gagtttttgt 
atttttttgt 
tttaagtttg 
tggtagtggt 
aatatagtga 
tatttttgtt 
tttgggtagg 
gatttgtgtg 
gtggtggtgg 
gtgttgtggg 
tttgggtttt 



ggtttatgtt 
atgaattttt 
tttatatgaa 
ttagaattta 
ttttaaattt 
tgttagtttt 
gtaagatgat 
gggaattttg 
ttgttggttt 
gttttgggga 
ttgggaagag 
gggtggggtt 
ttaaatgtat 
gttaaattta 
atattagttt 
agggatttgg 
ttagaagata 
agtttgtttg 
tttagatttt 
tttagattta 
agtttttttt 
ttaggttttt 
ttagatttag 
tagttttttt 

ggggtttagg 

tttttttaga 
ggaatttagg 
gatttatagt 
gaatgtgtat 
taggaggttt 
tttttttgtt 
gttttggtgt 
tgggtttagt 
atagtttgtg 
gatggggtat 
tgttgatttt 
gggtttttga 
tttgatttgg 
tgagggagga 
gtttggtttt 
ttagggaaag 
ttttagtttt 
ttgtttgttt 
tttaggagga 
gtaggagaag 
agatggggat 
gtgtgtgagt 
tgtagttttt 
ttggtttggt 
tttgggtgtt 
agggagggag 
gtttttgttt 
gtgttttttg 



attaatttat agttattttt 
gggttgttta gattggggtt 
attgtatgta tttttgttag 
gaagagagat ttttagtaag 
tttttgagtt ttgtgatgtt 
ggtgggtttt ttattggtgt 
ggtgaggagg tgggagaggt 
gtagggttga gtgggaggga 
gaatgggtag ttgggttggg 
ggggattttg ggagggggag 
ggaggagagt gagaggggga 
tggatttttg gtgtttttta 
ttttaaagtt gttttatagg 
gtattggttt gtttaggtag 
tagtttttgt agggatgtag 
aggtttagtt gttagttttt 
agaaagatat tttatttatt 
ttaaatttag aagtttgggt 
tagttttttt ttttttagat 
ggagtttagg tttagttttt 
tttagattta ggagtttagg 
agattttttt tttttagatt 
gagtttagat ttttagtatt 
ttttttagat ttaggggttt 
gttttagttt tttttttttt 
tttaggagtt tagggtttta 
gttttatttt tattatttta 
tttttttttt atgatttata 
tttattttgt ttttttagat 
aaggttaaaa ttttgattta 
ttagagtttt tgttttgttt 
tttgttttat gttttatttt 
tttttagtag gaggggaatg 
gtttggattt ttgtatttat 
ttatagatga tggaggttag 
tttagagggt tggaaggatg 
gggtggtttg tatgggggag 
aggtggtttt taaatttttg 
ggggttgggg ggtgggattt 
ttggggtaat aaaagttata 
gttttttgtt tttagagttt 
ggagtttagt tttgaagttg 
ttagtttgat tttttttttt 
aggtagaggt tggtagttag 
gggtttttag ggaggttagg 
tgaatgttaa ttgtattttg 
tttttttgtt ttagtttttt 
tgtagttttt tttggttatt 
ttttaggggt tttttgagtg 
tggggttggt tggtggaggg 
gggtttgttt gtttgtattg 
tttttttttg gtttttttaa 
tttgtgtttg gt taggtgtt 
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ttgtttgtta 

gggggtggtt 

tttggggaag 
tagtgggggt 
ttttaatttt 
atgagtggga 
tttttaggaa 
ggtaggttgg 
gggatggaag 
taaaggttgg 
ggagtttggg 
ttgtttattt 
aatttgggag 
tttgtgtttt 
agattgtttt 
gtagttttta 
ttttttttta 
ttttagtttt 
ttagaagttt 
ttttgtttag 
tttttagttt 
taggagttta 
ttttttttta 
agggttttag 
agatttaggg 
gttttttttt 
tggattaaat 
gatttaggtt 
tttgaggatg 
aattatttta 
atatatatat 
agagttttat 
ttgggtattt 
gaggggatag 
ggttttttaa 
gggtgtttaa 
agttgggatg 
ggttttagta 
ttgggattag 
gtttgggttt 
taaggtaggg 

gtgtttttat 
ttttttattt 
gggtgggggg 

agggggggtt 

agtgagtgtg 
taagttgtgg 
ggtgtttgtt 
tttgttattt 

ggggtgtgtt 

attttgagtt 
atttagatgg 

t^ggtgtttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
_3280 



seq_prot.txt 

tggttgtttg tgtttttttg tttgtttaaa tagtttttat tagtagtggt agtttggttt 3240 

ttattagatt ttttattttg tgtgtgtgtg tttgtgtgtg tgtttttttt taagttttgg 3300 

gggtgttgag ggggaatttt agggaagtga gattgtgtgt gtgtgtgtga gtgtatgtgt 3360 

gtttttgttt gtgtttgaga gtgggggagt taaggggggg tttagaggtg gttaagtgag 3420 

gaaggggtaa gtagtttttt aagtaggtaa ttttttgttt tttatatgta tatattagtt 3480 

attagtttta gaggtgattt agatagatag atagatatag atgttggaag ggggggtggg 3540 

ggggttgagg gtataaagtg ggggtgtgag tgagttaggg agaggtggga ttggatatat 3600 

ggaaaggggg gaggagttgg ggttgaagtg gtagaggggg gtattttggg tgggtggagg 3660 

ggggattttt atggggttgg ggtggtaaga ggatattttg atagtttttg taatgtttgg 3720 

ggtttaattt ttagagtaat atgtgtagtt atgtttttgt ttagtttagg tggttgtaat 3780 

tttgggggag agatagggta ggataggatt aaggaagagg aaggagagat ggagttaggg 3840 

atagatagga ggtttgggtt gttgttgttg ttgttattat tattgttgtt gttttggggt 3900 

ttgttttttg atattggttt tttgagtttt ttttggaatt ttggggttgt tggatgttgg 3960 

gttttggttt tggttttttt gttatttttt taatagaata gggttatgaa aaggtaaggt 4020 

ggggataggg gatgtaggga tggtggtggg aatgtggatt tttaaattta ggatagagga 4080 

agttggtaag aagttttgtg gagggagggg gtttgaatgg tgggtagggg ttttgatttt 4140 

tatttagttt ttttgttttt tagggatttt tttttatttt tttttttttt ttttttgagt 4200 

tttttttttt tgagttttta tattttagat tttttatgtt tttatttttt tgatatttgg 4260 

tttttttttt ttatttttat atatttgatg ttttatattt ttgtgttttt atttttttat 4320 

tttatttttt ttttgtattt ttttttttgt attagttttt ttattgtgag ttggtttttt 4380 

tttttttttt ttttgttttt tttttttatg ttagagttat gagtttgtta ttaat 4435 

<210> 375 
<211> 4435 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 375 

attaatagta gatttatagt tttgatatgg gaggggagag tgagaggaga gagggagagg 60 

attggtttgt gatgggggga ttgatgtagg gaggggggta tagggagaag gtgagataga 120 

gagatggagg tatagggata tggggtatta ggtatgtggg gatggggaga gggagttaag 180 

tgttagagaa atgggggtgt gaaaggtttg aggtgtaggg atttaggaaa aaggggttta 240 

gagagggaga gagagagggt ggaggagagt ttttgaggga taggggagtt aggtgggggt 300 

taagattttt gtttattgtt taaatttttt ttttttatga ggttttttgt taattttttt 360 

tgttttagat ttgggggttt atatttttat tattattttt gtattttttg tttttgtttt 420 

atttttttat gattttgttt tgttgggggg atggtggggg ggttaggatt ggaatttggt 480 

gtttagtgat tttaagattt tgaggagggt ttagagagtt gatgttgggg ggtaggtttt 540 

ggggtggtgg tagtggtggt ggtggtagta gtggtagttt ggattttttg tttgtttttg 600 

gttttgtttt tttttttttt tttttggttt tgttttgttt tgtttttttt ttaaggttgt 660 

ggttatttgg attaggtgag gatgtggtta tatatgttgt tttggaagtt gggttttgga 720 

tattgtagag gttgttgggg tgtttttttg ttgttttgat tttgtgggga tttttttttt 780 

gtttatttgg ggtgtttttt tttgttgttt tagttttggt tttttttttt tttttatgtg 840 

tttagttttg tttttttttg gtttatttgt atttttgttt tgtgttttta gtttttttat 900 

tttttttttt agtgtttgtg tttatttgtt tgtttgggtt atttttgaag ttagtggttg 960 

gtgtgtgtgt gtgaggagtg ggagattgtt tgtttgggga attgtttgtt ttttttttgt 1020 

ttggttattt ttagattttt ttttgatttt tttattttta aatataggta gaaatatata 1080 

tatatttatg tatatatgta tgattttatt tttttgggat ttttttttaa tatttttaga 1140 

gtttgaggag ggatatatat atagatatat atatatagag tggggggttt aatgggggtt 1200 

aggttgttat tgttgatagg ggttgtttgg atagatagga ggatataggt agttggggat 1260 

gtttagatgt ttggttggat gtaggtagga ggtatagaat ttaggtatat attatttatt 1320 

tgggtttggg ggggttgggg ggaaggagtg ggagttttat gatatttagg taggtagttt 1380 

agggttggta tagatagata gatttttttt tttttttgtt gttgtagtgt ttatagatat 1440 

attttttttt tattagttag ttttagatat ttggatatat agatttttat tgggtagatg 1500 

gtagatgttt gagagatttt tgggggttgg gttgatttat ttaggtaata tttttagtag 1560 

gtattggtgg ttgagggagg ttgtgggagg ttgtgggtga gggtggtggt ggtggttgtg 1620 

gtttggagga gttggagtgg gagggattta tatatttgtt gtgtttttgt atatatatat 1680 

ttatttggga tgtgattaat atttagtttt tgtttgttat tgttgggaga tttatagttt 1740 

ttttttttgg tttttttggg gatttttttt tttgttaggt ttggggaggg ggttgttttt 1800 

tatttttagt tattagtttt tgtttttttt ttgggataag ggagtgaggg gttgaaaata 1860 

aaaagggagg aagaagttag gttggggata ggtaggtagg gatttagaat ttggtatgga 1920 

gatattagtt ttagagttag attttagggt tgggggttgt tgttttttgt ttttgttttg 1980 

ttttgaggut ttggaaatag gaggtttttt tttgatttta gtgtttgtta tattaagatt 2040 

taagttgrgg tttttattgt tttagaaggt taggtttgaa aattaggttt tgattttggt 2100 

tttaggagtt ttatttttta gttttttttt ttttagattt aggagttttt ttttttattg 2160 
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ggatttagga 
tgattttttt 
tattttgttt 
ggattttggt 
tttttataga 
tttagtattt 
gttttggggt 
tgtgtgggta 
tagtttgggt 
ttaaggtttg 
aaatttgtgg 
atttataaga 
agggttgggg 
ggtttgaggg 
gttttggatt 
gaggaggtgt 
ttttggattt 

gggggtttgg 

tgaggaggag 
tttgggtttg 
tggggattta 
gaggaagtag 
gttgtaaaag 
agtttttata 
atagattgtt 
aagtttttat 
ggtgataggg 
attttttttt 
ttttgtttrt 
gtttttttag 
ttgttttttt 
gggagatttt 
uttgtgtgtt 
tgggggatgt 
tattgtttgt 
ttagattggt 
tttttagttt 
ggtggggagg 



atttggggat 
ttgtataagt 
ttttagtttt 
ttttattatt 
tgtaggagtt 
tttttttgtt 
agagtgtggg 
gggtagaaat 
tagagttttg 
ggagaatagg 
gttgtaaagg 
tggtgggggt 
ttttgaattt 
aggaggggtt 
tttgggtttg 
tgggggtttg 
gggggaggag 
atttttgggt 
ttgggtttgg 
agggaggaga 
gatttttggg 
gtggggtatt 
ttagtagttg 
tttttgtaag 

tgggtgaatt 

ggggtagttt 

ggtgttgggg 
tttttgtttt 
ttrtttaaag 
tttagttatt 
tttgtttagt 
ttttattttt 
ggtgagaggt 
tgtaggattt 

^ggggatttt 
gggggtgtat 

taatttgggt 
tggttgtgag 



tgtttttaag 
tatttttagg 
ttgaagaaat 
tgtgggtatt 
tagattatag 
agagggttgg 
gtaggatatt 
tttggagtag 
gttttgagtt 
gtgaggtgta 
gagaaattgt 
gggattttaa 
ttgggtttga 
ggggttttgg 
agggaggagg 
gatttttggg 
ggtttggggg 
ttgagggagg 
atttttggat 
ggttgggggt 
tttgatgaat 
ttttttgttt 
agtttttggg 
ggttgaggtt 
ggtgttggat 

tgggggtgtg 
atttaggttt 
tttttttttt 
tttttttttt 
tgtttagatt 
tttgttaggg 
ttgttattat 
ttattagggt 
aggaagggtt 
ttttttaggt 
atagttttat 
agtttgaggg 
ttggtgatgt 



seq_prot . 
ttggagtttt 
gatttaagat 
taatatgttt 
ttattatttg 
gttgtttttt 
gtttggtatt 

ggggtttttg 
gggggaagtt 

ttttgggtga 
tatttttgaa 
agatttagat 
gttttaagga 
gggaggaggg 
atttttgggt 
ggttggggat 
tttgagggag 
tttggagttt 
aggttggggg 
ttgagggagg 
ttggattttt 
gggttggggt 
tttgggggat 
tttttgtagg 
ggtgtattgg 
ttggtatttt 
tttagtgttg 
tgtttttggg 
tttggtttta 
ggagtggagt 
agtagttttg 
ttttttagtt 
tttgttattg 
tggtgaagga 

tgggggtatt 

tttgggtttt 
gtgaatggtt 
tttatggatt 
ggattgtggt 



txt 

ttttaaggat 
ttgtggatgt 
gtattttttt 
gaagggaagt 

gggggtttta 

tttataaagt 
agggtggaag 
agggttaggg 
ggtttttgtt 
aaagaagaga 
ttttaaggtt 
atagtgtttt 
gttggggttt 
ttgagggagg 
ttggattttt 
gaagggttgg 
tgggtttgag 
tttggatttt 
aggggttggg 
gggtttgagg 
tttgagtttt 
tggagttaga 
attttttaag 
attgtgtttt 
ttgaattttg 
ttatggtaaa 
ggtggagttt 
gttttgttgt 
gtatttaggg 

gggggtattt 

atgaattttt 
tttaaaattt 
gagagattgt 
aggggtatag 
taggggatta 
ttaggttttt 
gggaataggg 
ggttaggaag 



<210> 376 
<211> 6001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 376 

gggtatagtg 
tgtttagggg 
tgatttgtgt 
aggggatgtt 
tttagtggga 
tagtggaaat 
tttttttaat 
aggatgaagg 
tataataatt 
tttttgggtt 
gaatgtaggg 
tttagagtag 
ataaaagtta 
atatttttgg 
aggttaaatt 
ttaaaggaat 
ggtttttatt 
ttttgtgttt 
gtattgtgtt 



ttagttattt 
tatttttggg 
ttttattgag 
tatgtttgtt 
tatttagatg 
tatttgtggg 
agggtgtgtg 
gtttttgagg 
ttttttattt 
attagggttt 
aggatgtttg 
aggagggaaa 
tggatttttg 
aggggttggg 
gggtttgagg 

gggtttggat 

ggaggaggtt 
tgggtttgag 
ggtttggatt 
gaggaggggt 
gggtttaggg 
gtgtttgaaa 
tttagagggt 
ttttagagat 
tgattttttt 
gtaggaggtg 
ttttttttgg 
tagttttagt 
tttttttttt 
ttttgttagt 
tttgattttt 
ggaagttttg 
gggtgggggt 
gttttatttt 
gttttttttt 
aatttagtta 
aattatggta 
ggaat 



tgaggaagaa 
tgggaggaag 
ttttgaaata 
ttagtagatg 
tatagaagta 
aatttagtta 
ttattatatg 
ttgatttttt 
gagttattta 
gagatttatt 
gttttgttta 
tgtttttgtt 
aagttttgtt 
tgtattaatt 
atggtagaga 
tttttttttg 
ggagtttttt 
ggtttggtta 
tttgttaata 



atatggaaaa 
ataaatagag 
ttattttgga 
agataatagt 
gtaggttgtt 
tattttttgg 
ggattgtgta 
tagtttttag 
tgtaaaataa 
ttgttttgaa 
ttttgttttt 
tttagtttta 
tagttataag 
ttatttagat 
atgtaaaatt 
tagggtaggg 
tgagaaggat 
ggggttattt 
tttttttatg 



gatagatttt 
gttagtgggt 
aattgtagga 
tattgttatt 
tattaattta 
gaatagagga 
tagttttggt 
tagataagtt 
tggatatgtg 
tgttgtggtt 
ttttggaaat 
tattgtttat 
ttttttttaa 
agggaaaata 
tatagtttaa 
ggagggtatt 
atttttgtgg 
ttttttggtt 
tttttaaatt 
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ttttagattg 
tttgaggttg 
tttttgggag 
ttattttaaa 
agtagttata 
ttaaatggat 
gttaggaatt 
gttaggggta 
atgttttttg 
tttttaatga 
tttgtttttt 
atttttatta 
atggggtagt 
tttttatatt 
aaatgatgaa 
ttgttttaga 
aaaagtttga 

tgggttgtag 

tatagtttgt 



aaaaggagat 

attgtatgtg 

gaatggttgt 

tttttgtgtt 

ttagtttggg 

gtgttgtttt 

aattttagga 

gttattaata 

tatgtttgag 

agttgttgag 

aatttttaga 

ggaaaagtaa 

tggattgtat 

aaaagaaatt 

attttagatt 

attttatgtg 

gtagttttgg 

tgttttatgg 

_tag_tfltggst__ 



2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4435 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140._ 



seq_prot.txt 

gtaatttgag agttgttgta ggtttttagt ttatttgttt tttaaaggag gaaatggaga 1200 

attagtgatt ttaaaggatt tgtttggtgg gtatttatgt tttttaggtt attgtttttt 1260 

tttgtgtttt tggttatttt tgttttttaa ttttgtagga atttaggatt tatgtgtaaa 1320 

tataaagaat tgtatttatt tatttttggt ttttttttat tttgtgtttt taattttggg 1380 

ggggtttttt ttttttattg gggtatttga tgtttgtggg gggtgtttag aaaagtttgg 1440 

tagattagtg attggtggtt ttggaataaa tatattatag tttaaattgg gggttggtgg 1500 

tggagaggtg ggtggatggg ggttataaat ttgtttggta attgtttttt tagttaagag 1560 

agaggagttg aggttttttg gggttggagg attggaattg gttagattgt gggtttaagg 1620 

ggtgaaggta ggttggtagg ggtagttttt ttttgttgtt tataattttt gggtgggtgt 1680 

gtagttgagt tggtttggtt ggttggtgta attttgtgtg ttttttgtat tgattaaaaa 1740 

tgggggttga aatagtaaat gtgaggagga gtaattgttt tgatttggtt tagaagtgtg 1800 

attaatgggg atgtgagttt ttttgtgtga attaattagt gtagggtttg tgatagtatg 1860 

ggttaatggg gtgttgattt ggtgtaggaa taaggtgggg gtttggggtt ggttgtagat 1920 

ttttattgta gtggttagga atgttagttg tttatgtgtt tggttttttt tggttgattt 1980 

attgttttgg ttgttgtatt atggatgttt ttaggtaggt ggttaatttt gggtttggtt 2040 

ttgttaagtt gttgtattta gtaagttttt gtgagtgggt gttgggagtg aggtttaggt 2100 

gggagtatgt atgtgggtgg gtttgtattt ttgtgtgtgg tagttggatt tttgtttttt 2160 

gttttgagtt ttttaggtgt tttgtattag tgtgtatagt gggattagta gttttggtaa 2220 

gtgggttttg ggaagaatgt agttggtgag gaagtttggt gaggtgtgtt tgtgtagttg 2280 

tttttggttt tgattgttgg tgtgaggtag tgtatgattt agttggttgg ggtttgttgt 2340 

tttgttggtg ttatgtattt gtagatgttt gggttgtgtt attttttggg ttggtttggg 2400 

ggttggggtg gggtgaaaaa gaaaaagttt tgatttttgt ttttgttttg tgtagttgtg 2460 

tggtgagttt gggtagtgtg gagtggatgt atgaatggat atagtgtgaa tgagtgatga 2520 

atgggaatga attgatgata ggttttgtat atgtagtgga ttatgttagt tgaaagggat 2580 

tgtaaattta aaggtttgtg tgtgtgtgtt tttttatgtt ttatattagt gtttgtttta 2640 

gttagtattt aggaatattg ttttttagaa atgtaaattt gaaaattttt agtttagttt 2700 

tttttttttt gttttttata gaatttattt ttttttaaag tttttttttt tatttttttt 2760 

ttttttttaa tttgtttgtt ttttaaaggt gtttgagtat tttaattttg gtagattttg 2820 

ggttgtttaa tttaattttt tgtttatttt gttttggagt aaattgagtg attgaatttt 2880 

tttttgggtt ataattagat tttgttggtg gaatttattt tttgtttttt taggttttag 2940 

tttggttttt tttttttttt gttaaattta ttgtttttag ttttatttgt agtgaagaag 3000 

gtaaagtttt tgatgtgttt tgagtttatt gttaggtggt tgttgttgtt gttagatttt 3060 

tattttttta attaggatag agttgataat atgttggagt agtatgaggt atagttaagt 3120 

attttataat tattaattgt tgagtagagt agaaattttt ttgggataga gttttttttg 3180 

tgttgtggta agaatagaga atttaatttt aaaggggaaa ggatttttta tttttttagg 3240 

ggtagtgttt atagtagttg agaggatagg gtttattttt ttttagtggt tatagtttat 3300 

tttaggtgag attttttgtt ttagttgtgg agatgttttt tgtagttttt ttttgtattt 3360 

tttatgtttt ggagtgtttt taatgtttgt tttttatgta ttttgttatt aatttattat 3420 

tataattgta atggtaattg ttattagtaa tataattatt tgttattaat ttttttggga 3480 

ggatttttgt ttttggattg tatgtaattt ggggggtagg agggtgaggg ggtaggtaga 3540 

tgttgttttt tatgtatttt tttgatttag ttgaattgta aatattagag aagtttttaa 3600 

attttttttg attgtaattt atagtgaaaa atatgtttat aaaatgataa agtatatata 3660 

tttaattgta ataaaaataa atgttttatg aaatgatgtg tagtttaatt atttgtgata 3720 

tatattattt tgtattttat tttgttttag tttgttttgt tttattaaaa aaattagttg 3780 

tgagttgggt gtggtggtgt atgtttgtaa tttaagtaat ttaggaggtt aaggtgggag 3840 

gattttttga gtttaggagt ttgaggttgt agtgaattgt gattgtttta ttgtatttta 3900 

qgttgggtga tagagtaaga ttttgttttt taataaaaaa aaaaaaaaaa aaaaaaaaaa 3960 

aattggttat gatttataaa agtaatgtat tttttaattg gaaaaaaaaa aatttattgg 4020 

attagggagg tgtgattttt tagtaggagt ttttaaaatt gtttgtaatt attttggtta 4080 

taaattttta tagaattatt tttaatgaat gttgtttata agaaagatat agttttagga 4140 

gtaagtttaa agattattta tagatgaatg ttttttttta ttaaaagaaa agatttattt 4200 

tattttattt gattagttgg tgttaataag tatttaatat tttagaaaat aataatatat 4260 

ttttagattt agttgttaag gtagtttttt ttttaattta attttttttt tttttttttt 4320 

ttgagagaag ttaagttttg ttatgaggtt ggagtgtagt tgtttggttt tggtttattg 4380 

aaatttttgt tttttgggtt ttagtgattt ttttgttttt gttttttgag taattgggat 4440 

tataggtatt tattattaag tttagttaat ttttgtattt ttagtagaga tggggtttta 4500 

ttatgttggt taagatggtt ttaatttttt gattttatga tttgtttgtt ttggtttttt 4560 

aaagtgttgg gattataggt gtgagttatt atgtttggtt ttgtttaatt ttttattgtt 4620 

aaaatattgt tttttgagat aagttgtaat ttagtttagg ttatagttgt gttttaatga 4680 

tgtttttttt agtaagtatt ttaaaaaaaa aggtgttttt tatttgtaga ttttttttgg 4740 

ttggttgttt ttgtatttat tttgtgttta tttttaaagt tttttttttt ttttttattt 4800 

tttattaagg gaatttttat atatataaag ataaaagtag tgtaatgaat ttttaaggat 4860 

ttatttttga gatttaataa ttattaatat tgtgttagtt ttgttgtagt attttgggat 4920 

gaattttatt tatagtaggt tttttttttt ttttgtttat tttatgtttt ttagtttagt 4980 

attttagatt aaaagagatt ggaaaattga gttatttgta taatttgggt ataatttttt 5040 

tattgttgtt tttttaattt aga,attttat gtatttttta ttttagtttt aggagagtgg 5100 

tgaatgagat ttgtgaaggg ga^atttttt agggaattgg taaatttggg tttttggaaa 5160 

aagtgtatta atattagtta tattgtttaa ttttatgttt attttgtgtg taagttgtat 5220 
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aattgttggt 
tggggatttt 
agtagtgtag 
gagatataaa 
tatttttgaa 
ttagaggtag 
ttgtagtttt 
gattataggt 
aaagatttta 
gaagttattt 
ttttggttta 
tttttttata 

atagaaatta 
t 



tttgtggata 
attgtagatt 
aatatattta 
aggaattatt 
ttttgtgaat 
ggttttgttt 
taatttttgg 
atatattatt 
tgagatggga 
ttttatattg 
aagggaatta 
ggtttttata 
ttttgaaaat 



tgggaaggaa 
tttttttgtt 
aataatttta 
agattataaa 
gtttatgttt 
tgttatttag 
gtttaagtga 
atatttggtt 
ttttattttg 
gttttttaaa 
atttaaatgt 
gatatatttt 
agtataaatg 



seq_prot . 
ggagattggt 
tttttgttag 
ttttttttta 
ggagttggta 
taaaggaagt 
gttggagtat 
tttttttgtt 
aatttttttt 
ttgtttaggt 
gttttgggat 
attaaatata 
taaattttaa 
ttggtgagag 



txt 

tgtgggtggg 
gtttgtatgt 
ttttttttta 
ttagtgtttt 
tttttttttt 
agtggtgtga 
ttagtatttt 
aaatgtttaa 
tgatttgaaa 
tataggtttg 
gaattatagg 
tagtttgggt 
gttttgatat 



gatggaagtt 

ttagagggaa 

ggtgttgtta 

tggtaagatt 

tttttttttt 

ttatagttta 

gagtagttag 

aatatttttt 

tttgtggttt 

agttattgag 

gtttataggg 

tgtatgtttg 

tttagtttat 



<210> 377 
<211> 6001 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 377 

ggtaagttga 
ttagatatgt 
attttatagg 
gtttagtggt 
taggttatga 
taaaaaatgt 
tttagttatt 
gtgaattatg 
aaaaaaaaaa 
aaattttatt 
ttaataatat 
tttttttttg 
aggtttttat 
tatgtaattt 
ttttttaaag 
attatttttt 



gatgttagaa 
aatttaagtt 
ttttgtaatt 
ttaagtttgt 
gttttagatt 
tttaaatatt 
taggatgttg 
attatgttat 
aaaaaaaaaa 
aagaatatta 
ttagaaaaaa 



tgttagattg 

tgttttaggg 

ggtttttgga 

aaaataaaga 

gttagagaag 

attattaaaa 

taataataaa 

tgggaggtta 

gtgaaatttt 

aattttagtt 

ttagtgagtt 

aaaaaaaaaa 

aatgtgttat 

agagtgaatt 

tttttgagat 

ataattaaaa 

tttagtaggt 

tttttttttt 

ttggaatgta 

ttttttattt 

atgattaatt 

atgttatagg 

atgtgtatat 

tttgaaagtt 

atttgtttgt 

tttttagaaa 



ttttatttat 
atatataagg 
gtttgaattt 
taaagttaaa 
tgtttagatt 
ggaagtgtgg 
tgttgtaata 
ggtttattat 
gaaaaagaaa 
gtttatagat 
tatagttata 
gaattagata 
aggtgggtag 
gtttttatta 
atttaggagg 
gagattaagt 
aaaaaaagaa 
tgttttttga 
ttttttttta 
tgtatttttt 
taattataaa 
tttttttttt 
tttttttttt 
gtgggataat 
tagttttttg 
tttttaatga 
taattaggtt 
tttgttattt 
tttttaatat 
ttttttattt 



ttttttatta 
attaaagttt 
ttatatttga 
aattttaaaa 
agtttgggta 
taaaaaaaat 
aggtaggagg 
tgtattttag 
agtttttttt 
atgttaattt 
ataaaggaaa 
tttgatgagg 
agttagtttt 
taagtatgaa 
attgattttt 
gtaaagaatg 
atataggtaa 
ggtgggtagg 
agattggtat 
attattttta 
agttttaaaa 
aggaggtatt 
atttaaattg 
aggttgggtg 
attatgaggt 
aaaatataaa 
tggaggtaga 
gattgtattt 
ttagattaaa 
aatgttggat 
gtaaagagaa 
ttatagatag 
tagttttaaa 
ttagttgaag 
tttttttgtt 
tatagtttat 
agttgtttgg 
aatagaatag 
atatattatt 
tgtaaatatg 
ttataattta 
ttttgttttt 
aaataattgt 



gtatttatgt tatttttaaa 
aaaaatgtat ttataaaggt 
tgtatttaaa ttagtttttt 
ttttgggagg ttaatgtggg 
ataaagtgag attttatttt 
tagttaggtg tggtgatgtg 
gttatttgag tttaggagtt 
tttgggtgat agagtaggat 
gaaatatgga tatttataga 
ttttgtagtt taataatttt 
atagggttat ttaaatatgt 
ggataaggaa gggtttatag 
tttttttttt atgtttataa 
attaggtaat gtgattgatg 
tagaaaatgt tttttttata 
tataaagttt tgggttaaag 
tttaattttt tgattttttt 
aagaaggaag agtttgttgt 
aatgttgata attgttgagt 
tttttgtgta tgtgaggatt 
atgaatataa aataagtgta 
ttttttttta agatgtttgt 
gattgtaatt tgttttaaaa 
tggtggttta tgtttgtaat 
taagagattg agattatttt 
aattagttgg gtttggtggt 
agaattgttg gaatttggga 
tagttttgtg atagagtttg 
aaaaagattg ttttgatgat 
gtttgttagt attagttgat 
atatttattt gtgaataatt 
tatttattga aaatagtttt 
gatttttatt gaaaaattat 
aatgtattgt ttttgtggat 
aagagatagg gttttgtttt 
tatagtttta aatttttggg 
attataggtg tgtgttatta 
attagagtag gatagaatat 
ttataaggta tttatttttg 
ttttttattg taggttatag 
attaaattaa ggaaatatat 
taggttatat gtagtttaag 
attattgatg_ataa.ttgt£a.. 
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atggttttta 
ttataggaag 
ttgggttagg 
aggatggttt 
atgggatttt 
tatttatagt 
ggaggttgta 
tttgttttta 
atttaaaagt 
ttttgtattt 
tttgtattgt 
tgaggttttt 
agttagtgat 
ttgatatatt 
aattttattt 
ggataatgat 
tggtttaagg 
gaataagatt 
tttagagatg 
tttttaataa 
aaagtaatta 
taaggagggt 
aatagtattt 
tttagtattt 
ggttaatatg 

gggtgtttgt 

ggtggaggtt 
atttttttta 
tgggtttaag 
taaataaaat 
tttgaatttg 
ataaaagttt 
atttttttaa 
tataattaat 
attatttagt 
tttaagagat 
tatttagttt 
aaaatagtgt 
ttataattga 
ttaaaagaag 
aaaaagtaat 
ggtaaaaatt 
ttataattat— 



5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6001 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2-S8G— 



<220> 

<223> chemically treated genomic dna (Homo sapiens) 

<400P 378" y/ ' r " ! " ' ' ' ' ' • • : 

seite 341 



seq_prot.txt 

agtaataaat taataatgaa atatataggg aataaatgtt gggaatattt taaaatatgg 2640 

qggqtqtagg aggaggttat aggaaatatt tttatagttg gagtagggaa ttttgtttaa 2700 

aatgaattgt aattattgag aaaaaataag ttttgttttt ttagttattg tgagtgttgt 2760 

ttttagaaaa gtaagaagtt tttttttttt taaaattgga ttttttgttt ttattataat 2820 

atagaggagg ttttgtttta gggagatttt tattttgttt aataattgat aattgtaaaa 2880 

tatttggtta tgttttatgt tgttttaata tattattagt tttattttgg ttagaaaaat 2940 

aaaaatttaa tggtggtggt agrtgtttga taatgggrtt aaggtatatt ggagattttg 3000 

ttttttttat tgtgggtggg gttggggatg gtgggtttgg tgggggaggg gaggagttgg 3060 

gttggggttt gggaaggtag agagtgagtt ttattagtaa gatttgattg tggtttggga 3120 

gagaatttaa ttatttggtt tattttagaa tgaaatgaat agggaattgg gttaagtagt 3180 

ttaaagttta ttagagttag aatatttgaa tgtttttagg ggataagtag gttaaaaaaa 3240 

ggaggggggt ggggagaaag gttttaggaa agggtaggtt ttatggggga taaggagggg 3300 

aaagttggat tgaaggtttt taaatttata tttttgaaaa gtagtgtttt taagtgttgg 3360 

ttaaagtaaa tattggtgta aggtgtaggg gggtatgtat atgtggattt ttaagtttgt 3420 

aatttttttt agttaatgtg atttattgta tatgtaaaat ttgttattgg tttatttttg 3480 

tttattattt atttatatta tgtttattta tgtatttgtt ttatattgtt tgggtttgtt 3540 

gtatagttgt gtgaggtgaa ggtagagatt agagtttttt tttttttgtt ttgttttagt 3600 

ttttggattg gtttaggaga tggtatagtt tgggtgtttg taggtgtgtg gtattggtgg 3660 

gatagtaggt tttggttagt tgagttgtgt attgttttgt attagtagtt agggttaggg 3720 

gtagttgtat gggtatgttt tgttgagttt ttttattaat tgtatttttt ttgaagtttg 3780 

tttgttggag ttgttgattt tgttgtgtat gttgatgtaa agtgtttagg ggatttaagg 3840 

tagggagtgg gaatttgatt gttatatgta gggatgtaaa tttatttgtg tgtgtgtttt 3900 

tgtttgaatt ttatttttgg tgtttgtttg tggggattta ttgagtgtgg tagtttggtg 3960 

ggattaggtt taaagttgat tatttgtttg ggggtgttta tggtgtggtg gttagggtgg 4020 

tgagttagtt aaggaggatt gaatgtgtga atggttggtg tttttggttg ttgtggtgag 4080 

agtttgtagt tggttttgaa tttttgtttt gtttttgtgt tgagttggtg ttttattggt 4140 

ttgtgttgtt gtaggttttg tgttaattgg tttgtgtgaa ggagtttata tttttattgg 4200 

ttgtgttttt gagttgagtt gaagtagttg tttttttttg tgtttattgt tttaattttt 4260 

atttttgatt aatgtaagga gtgtgtgaga ttgtgttagt tagttgagtt ggtttggttg 4320 

tgtgtttgtt tgaggattgt gggtggtggg aagaggttgt ttttgttaat ttgtttttgt 4380 

tttttgagtt tgtaatttgg ttggttttag ttttttaatt ttagaggatt ttagtttttt 4440 

ttttttggtt gaaagggtag ttgttgagta gatttgtagt ttttatttat ttattttttt 4500 

attattagtt tttagtttgg gttgtgatgt atttgtttta gaattattag ttattaattt 4560 

attaaatttt tttaagtatt ttttatgggt gttagatatt ttagtggaga gagaggattt 4620 

ttttaaggtt ggaagtgtag gatgaagaag ggttgggggt gggtggatat ggttttttgt 4680 

atttgtatgt gggttttgag tttttgtagg attaaagaat ggaggtggtt aaggatgtag 4740 

ggaggagtga tagtttggga agtgtggatg tttgttaggt aagtttttta aagttgttga 4800 

ttttttattt ttttttttgg gaaatgagtg agttggaaat ttgtgatagt ttttaggttg 4860 

tgtttgtgtt gataagttgt gagtttagaa atgtgaggag atattaataa gaatgtgata 4920 

tttatagggt gttgtagttt aaattagaag aaggtggttt ttagttaggt taggtgtagg 4980 

gttagagttg tttaaatttt tttatagaag tgtttttttt aaaagaattt tagtggagat 5040 

ttgtatgggg ttttgaaata gaatattttt tttttatttt gtaagaaaga agtttttttg 5100 

aagtttggaa ttttattatt tttaaattgt gaattttgta ttttttatta taatttaatt 5160 

tagttttttt taatgtaaaa atattttttt tgtttgagta aaattgatgt gttaggaata 5220 

tatatagttt aattgttttg tttagagaag gtttgtaatt ggataggatt ttggtttttg 5280 

tttgtttttt ttaatgagaa tgtgggtggt atggggttga gagtagaggt attgttttgg 5340 

atttgggaat tggaggataa agtttttaga aaaaggtaaa atgaataagg tttttgtgtt 5400 

ttttagtaat tttattaaga aaattgtgat atttaagata gaataaattt tagtttagaa 5460 

atttaggtat gtaagaagta ttatatattt attattttat gtaaataatt tagttattat 5520 

atattagtag ttgtttttag tagtttattt attggagatt aagggagtta gtttttattt 5580 

tttttggagt tagtttttaa tattagggtt atatatagtt ttatgtagtg ggttgggaga 5640 

agggatagta tatttatttg gttttttgtt tttaaaagat atggttgagt tgtttttatt 5700 

gtttaagttg atgtggttgt ttgggttgat gagtaattta ttatttttgt attttattag 5760 

gggtatagga atttggggtg gggtggtggt ggttattgtt ttatttgtta agatattttt 5820 

tgtagttatt ttttttgggg gttttgtagt ttttaagatg gtattttagg gatataagtt 5880 

atatatatag ttagttttag gatttattga tttttgtttg tttttttttt gtttttgggt 5940 
aatttttttt ttagtttaga gaaaatttgt tttttttatg tttttttttt atattgtgtt 
t 

<210> 378 
<211> 4001 
<212> DNA 

<213> Artificial Sequence 



6000 
6001 



seq_prot.txt 



ttttattttt 
atttgttaaa 
atttttatag 
tatggattat 
tatttttaag 
tgttattttt 
ggtattgttg 
aattatttta 
ttaaaagaaa 
tgtgttttta 
tattttgagt 
aaaatttgtt 
tttagtagga 
aaaaataata 
ataaatgatg 
gtaatttggt 
aaagatattt 
ttgggaaaaa 
tgtattttta 
attttttaat 
taattagtta 
tagaaattta 
tgaataatag 
aaaatgaaaa 
tttatttaaa 
atgggttagt 
taatgtttat 
gatatatagt 
aggagatatt 
ttttatagat 
ttttagtttt 
tttatttttt 
tttgtaggat 
tatgagtttt 
tttttatttg 
tttggtgtaa 
ttaattttta 
tttatgaaaa 
attttgaaag 
tttatttaaa 
ttattattta 
taatttgtat 
attgaatggt 
tttgtaaagt 
ggatatttag 
attattgtgg 
tttattttgt 
tttttaataa 
agaaatttgt 
gtttaggttt 
tttatttttt 
ttgtttttta 
aagtattttt 
tggttaattt 
aatttttgat 
tgagttattg 
atatttaaat 
ttttttattt 
tttatatttt 
gtttttatta 
atatttttat 
ttaaagttta 
agttagtgtt 
taatgttttt 

atatttaagt 
tagaggaatt 
ttttgtattt 



tttatatttg 
gttgtttaga 
tttagaagta 
gttgttttag 
tgttttattt 
aagtttggtg 
taatattata 
ggatttataa 
tgtaagttta 
agtaagatgt 
ttttatgtaa 
ttggaatata 
aatggtaagg 
aatttaagaa 
attatttttg 
tgattaatat 
ttagaaattt 
gattgaagag 
agggatttat 
gttaataagg 
ttaattaatg 
ttggaattaa 
ttagagaaat 
taatgatatt 
agataattag 
ttgaagatat 
ttgttttttg 
aatttttgag 
gggtttttgg 
ttttttttaa 
tattaaggta 
aattaaagtt 
gtgtttaggg 
tagtaattgt 
tatttaagtt 
ttattatgat 
ggttattttt 
gagagtatgt 
ttgatatttt 
taagagttat 
attaggaaat 
ttgaatttaa 
ttttaatata 
ttagtttttg 
ttttaaggtt 
tttttttttt 
ttagtgttat 
agagtttaga 
taaagaatgt 
ttatattttt 
tttagtgatg 
agttggagtg 
tttgttttag 
ttgtattttt 
tttaagtgat 
tgttgggttt 
attaattaat 
aggtatttta 
tttatttttt 
aattattttt 
ataagaatta 
tatattgttt 
tttaatattt 
agttttttat 
ttttattttt 
tttgtttttg 



tgttatatat 
aagtgatttg 
tattattaat 
ttattttgtt 
taaattatag 
aattaagaaa 
tgttaaattt 
tttgtaaaaa 
tttttgtaaa 
atattatttt 
atttgggttt 
attaattgtt 
aatatgattt 
aattaaagtt 
ttttttttaa 
tatgaataat 
taaaagattt 
tttaagaaat 
atagtgttag 
agggataaaa 
ttaatgttaa 
ttaattagat 
attattaatt 
taagaaattt 
aagttatagt 
agttatatat 
tttggttttt 
gaaaattttt 
tttttgttag 
tttaattata 
tatgtagaga 
ttttaaaatt 
tggttgattg 
agttaatgaa 
ttgggattaa 
gttaattttt 
gtttggagtt 
tgttattttg 
tttataaata 



taaaaaatta 
atgttaagat 
ttatgttttt 
taatttgatt 
agtgttaaag 
ttgttagatt 



agatttttga 
agaaaaataa 
taaagatgta 
tgtgtttggt 
tttgaataga 
tttaatttag 
gtatatagag 
ttttgtaagg 
agatttatgt 
aaagttgttt 
ttagattttt 
tttttttttg 
tagtggtata 
tttttgagta 
ggtagagata 
ttatttattt 
ttttagtgat 
tggtagtata 
tttttttttg 
ttttatttga 
gtttggttta 
tgaatattag 
tttaattttt 
gatttatttt 
ttgtgttttt 
gttgaaattt 
tgtttagaga 
atatatatta 



agaatgtgta 
tgaagtttgg 
tttttttttt 
ttaagttagt 
tatattaaaa 
tttttaatat 
atttaaaatt 
gaaattaata 
ataatttaag 
tgaattggta 
tttttttaaa 
aaaatattta 
tttatagttt 
aaggtttttt 
gataattaag 
ttgggtaagt 
ttgatttttg 
tatgttatta 
tatttagttg 
tatttaaaaa 
ggaagttatt 
agattttgat 
ataatatttt 
agtgttgaaa 
ttttggaatt 
attttattta 
tggatttgaa 
gtttttattt 
taggaaattt 
attatgatgt 
atttttaggt 
tatgtaattt 
agagatattt 
aaatagatat 
aatatatatt 
tgagaaaggt 
ttttggtttt 
aaatatgtat 
tgttatttta 
attatgattt 
aattggttta 
gatgtaaaaa 
tgtgatttat 
gttgtttttt 
ggggatgagt 
attttattgt 
gttgggatta 
ggagttttat 
tggtttttaa 
gattgatgtt 
tttttaatgt 
ttttattttt 
atgattatat 
tgattttata 
ggaattttta 
ttagtgatgt 
ttttattttt 
gtagatattt 
tttttaaaat 
atttttgttg 
tttttataat 



tatattagag 
ggtgtttttt 
ttttataatt 
tttaatattg 
atgttggatt 
attttgtgta 
aagattagaa 
tgttatgaat 
gagaaagtag 
tatttttgtt 
agaaatgtat 
tttttttata 
ttaatgattt 
tgaaaaatta 
aatatagtaa 
ttaagttaag 
gtaaaatatt 
tagagatgaa 
tttatatgtg 
attttttagt 
gttagttgta 
gatttattat 
tagtatttta 
tttaagtttg 
aataagtgaa 
tggataataa 
taaaagaatg 
aggtatttaa 
attttatgat 
agagaatgag 
ttttttatat 
tattattttt 
gagggatttt 
gtttttatat 
tatttaatga 
taatttgatt 
gatggtgatg 
tattgagtta 
aattagttaa 
agagtatttt 
tatttagatt 
atgaggtaat 
tttttttttt 
gttatggttt 
aaatttgtat 
tttaaaagtt 
gatatagatt 
gtaatttttg 
tttttgtgat 
tttagtatgt 

gggggagatg 

agtttatatt 
taggtgtgtg 
tatgttggtt 
aagtgttggg 
attattattt 
agtagattta 
ttttttttat 
agaaataatg 
gtggatttta 
tgaaaatgtt 
ggttatataa 
ttaattattt 
tatgtatata 
aataaataga 
gtatggtata 



aatttaaaag 
tatttgtatt 
tggagttata 
ttattgattt 
atttttataa 
atattataat 
gttgatagta 
tatgttgttt 
attattttga 
atttagtttt 
attgtaattg 
tattattttt 
ttattgtaaa 
tgaatttttt 
tgtattatat 
taattattta 
tgatttttga 
agggagtttt 
tttatagttt 
gtttattttt 
aaaaagtttt 
tgtaaattta 
ttaaagaaga 
tatgagaata 
ttattttatt 
aataatatta 
tttttttaat 
gtattttagt 
tatagttaaa 
taaaagtttg 
ttttaaatat 
ttttgggttt 
agttttgttt 
tagtagagtg 
ttgaagggta 
ttattttgtt 
ataaggttag 
tatatataat 
gtatttgata 
taggtttata 
tttttaaaat 
tttaagaagg 
ttattttatt 
gttaggtgtg 
agttttttag 
tttattttta 
attaaggatt 
tttaatataa 
aaataaattt 
attttaatta 
gaattttgtt 
ttttgggttt 
ttataatgtt 
aggttggttt 
attataggta 
aagaattttt 
ttttgaaaat 
ttgtgttttt 
tttttaggat 
tttttatttt 
tattgtgttt 
ataatataaa 
tatttagaag 
taaataaatt 
atataaaaat 
aattatattt 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4001 



seq_prot.txt 



<210> 379 
<211> 4001 
<212> DNA 

<213> Artificial Sequence 

<220> . . 

<223> chemically treated genomic dna (Homo sapiens; 



<400> 379 

aattataaaa 
ttaatagaga 
tattttaagg 
agagtatttg 
aggaaatggg 
tatattatta 
ataagaattt 
ttatgaaatt 
tatgtaatta 
aaagaataga 
tatattgaaa 
tggtattaat 
tttggaagtt 
ggtgaaattt 
gtagttttag 
tgtagtgaga 
tatttgtttt 
aaggaaataa 
aataaattat 
ttttttgtat 
ttaagttagt 
agggttataa 
ttagggtgat 
tatatatatt 
aaaaattaaa 
tatttttttt 
aggtgtgtgt 
tatatttatt 
taaatgtttt 
aaaattatgt 
tatttgaaaa 
gatgttatgg 
ggaatttttt 
taggtgagga 
ttttggattt 
atgaatgaaa 
agattttaaa 
tttttagtat 
taagatgtta 
tgttaggatt 
gagtggtttt 
atttttaagt 
ttagttgaat 
atagtagtat 
ataaaaatta 
tatttgttta 
ttttggttat 
taaaagattt 
agaattatgg 
gtagatattt 
ttttagaaaa 
atattaattt 
ttttgaatta 
atgttaattt 
aaattttaag 
aatgttagaa 
tttttaatgt 



gtaatatatg 
ttttttaagt 
agaattttaa 
tagaaatata 
gaagataagt 
aaaaaattaa 
tttaatattt 
ataagttaaa 
tttagataaa 
ataaagaaga 
atatattgtt 
tattattagg 
gaggtgggtg 
ttgtttttat 
ttatttggag 
ttgtgttatt 
ttaaaaagga 
tggaaatttg 
aaaataattt 
tatatagatt 
tttttatgga 
tttttgtgtg 
attgggttgg 
ttttgtttag 
aattaggtat 
atgtattttt 
tttatttttt 
tttggaaatt 
ttgtttatag 
ataaagtgat 
tggttttaaa 
taaaaattga 
gttgatttta 
ttttattaat 
atagttaatt 
taattttggg 
gtttttgtat 
ttatgattga 
tttaatagaa 
taaaaatttt 
taggagttag 
gatgtatagt 
gattatagtt 
ggaatttttt 
aagttggtag 
agtttaattg 
tttagtatta 
tttttgtttt 
gttgatatta 
tgttttttaa 
ggaatttttt 
aattatttat 
tttaaaaaag 
taattttggt 
tgagttata.t 
aagtaattaa 
gag?ftttaaa 



tttgttgtag 
ataaaggtag 
tagggatgga 
agtaagaaat 
taggtgttaa 
aaaatagtat 
ataattttta 
taaaaataat 
aaagaaataa 
gtgaaatgtt 
agttggttga 
aaggtttggt 
gattatttga 
taaaaatata 
gttgaggtag 
gtattttagt 
gtattattaa 
aaaaggtgta 
tgtgtttttt 
ttttaaattt 
agtggtattg 
taaaaaagaa 
gaattttgga 
gtagaaattg 
gtatattgag 
attaaattta 
tgttttttaa 
tatagttttt 
gaaagtgtta 
agtgtatttt 
taaaaatgat 
tgttatggta 
gggtttagat 
tgtagttatt 
gttttgaata 
ggattttaat 
atattttaat 
attagaagga 
gttagaaatt 
ttttaagaat 
atagagggta 
tatattttta 
tttgattatt 
gaatgttgtt 
tattttttta 
attagtttta 
gtattaattg 
tttttattag 
tatgggtttt 
gttttttaat 
aaagtttttg 
aatattggtt 
ataagagtgg 
ttttttaagt 

tttttagtat 
ttajgtttta 
tttatatgaa 



aaagtataga 
gggttttttt 
gatttaaata 
tggaaatatt 
aggtattgat 
gtaaatttta 
tatagaaatg 
ttaataagag 
aaggatgtaa 
tagatggaaa 
tatttaaatg 
atggtggttt 
ggttaggggt 
aaaattagtt 
gagaagtgtt 
ttgggagata 
gaaaaggtga 
agaatttaaa 
ggtaagtttt 
tttgttgaag 
agtggagtag 
gttatagtag 
gttaagtgtt 
agttttataa 
aattatttag 
gatgtaaatt 
ttaaatagtg 
gtttaggtaa 
gtttttagga 
ttttttatgg 
ttaagggttg 
attatattaa 
gtaggtggaa 
gagaatttat 
tattttgtaa 
tgaggagtaa 
aagaattgag 
agtttgtaaa 
taatgttttt 
tattgtatgt 
agtagatatt 
agttggttta 
ttttgaatag 
gtttttattt 
gttattattt 
atgagttttt 

gtggttgatt 
tgttgaagaa 
ttaggaatgt 
ttttttttta 
aaaatatttt 
aattaaattg 
ttattattta 
ttattatttt 
tttttgt^ga 
gagtagattt 
aafttaaagt 



gaagtataat 
aatttttgta 
taatttgttt 
gtttttgagt 
ttttgtatta 
aaaatataat 
tgaaataaaa 
tattttagaa 
agaaggtata 
gatttttaaa 
tagaaatttt 
atgtttgtga 
taaattagtt 
aggtattatg 
tgaatttggg 
gagtgagatt 
atggttggga 
taaatttgtt 
tttatgttaa 
aggtttttgg 
ataaagatag 
tttgaagagt 
ttatatttgg 
aagtgaaatg 
ttttttttag 
aattttagaa 
gtataagttt 
atattaggta 
tgttatgtgt 
gttgattttg 
aaataagata 
gtatttttta 
atattttgtt 
aagatgaagt 
aaagtttaga 
aatatttgag 
ataggttttt 
atttggttgt 
tgttgagatg 
tattggaaag 
atgatattgt 
tgataaagtg 
atgtttttat 
ttttttttta 
atgaatttgt 
ggaaattttt 
gggaaataga 
taggttatgg 
agaagttttt 
gttaaaagtt 
ttaaataatt 
tatgtaatat 
taaaaggttt 
ttttgtagta 
aaagggtagt 
ttagttgtag 
aagagttggg 



ttatattata 
ttttgtttgt 
gtatatatat 
agaataattg 
tttatgtagt 
aaatattttt 
ataaaattta 
atattgtttt 
aataaaagag 
ataagtttgt 
tgaataatgg 
ttttagtatt 
tggttaatat 
gtgtatattt 
aggtgtagat 
ttgttttttt 
tgtatattgg 
tattatagaa 
ataagaattg 
tagtttgtat 
gaatttttga 
tgtgtaggtt 
taagttatga 
agaaaaaaaa 
aattgtttta 
aagtttaaat 
ggaaatgttt 
tttagttaat 
atggtttaat 
ttgttattat 
agattaaatt 
attattggat 
ggtataaaag 
taaaattttt 
gaaaggtaat 
aatatgagga 
atttattttt 
gattataggg 
tttgagtgtt 
atgttttttt 
tttattattt 
gtttatttgt 
gtagatttga 
ataaaatgtt 
aatgataagt 
ttgtagttaa 
tattagaaaa 
gtatgtgtga 
ttttattttt 
agatattttg 
atttgattta 
attgttatat 
gtggtttttt 
agagttattg 
gtgtgagaaa 
tgtatatttt 
taatagaaat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 



seite 343 



ggttaattta 
taaagagaaa 
attaaatttt 
ttgtatattt 
tagtttagta 
tttttaatat 
aaattaaatt 
ggaggatatg 
tattttagta 
gtagtttttt 



gaaggtaatg 
atttatagga 
atttttatag 
tggatttggt 
attttttggt 
tttgtggttt 
agataagata 
gattaatggt 
ttagattatt 
aatatatgat 



tatgttttgt 
gtgaatttat 
gttgtgaatt 
atataatgtt 
ttattaggtt 
aaggtggggt 
attgaaatag 
atatttttag 
ttttgagtaa 
ataggtgtaa 



seq_prot. 
ttaaaagtat 
attttttttg 
ttagagtagt 
atagtagtgt 
tagagataat 
atttaggggt 
tatgatttat 
gttatagggg 
ttttggtaaa 
agaaaataaa 



txt 

attaaagtaa 
agaatagtat 
ttgttattaa 
tattgtaatg 
attgtagaaa 
agaattaata 
gtgtgatttt 
tagtataagt 
ttttttaaat 

g 



tttatttttt 
ggtttatagt 
tttttgattzt 
ttgtataaag 
tgatttagta 
ataatgttag 
aagttataaa 
ggaaggatat 
tttttaatgt 



<210> 380 
<211> 4418 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna CHomo sapiens) 



<400> 380 

agattttatg 
gaaggttttt 
gaagggtttt 
tgttttttgg 
tttgttatta 
agttttgtgt 
attttttaga 
tggtaggttt 
gtttatattt 
ggtggttagg 
gggttggtgt 
tgtattttga 
tgttgtattt 
ggttgttgta 
tgtgttttta 
gttaggttta 
ttaggggttt 
tatgttgtat 
tggttattat 
agttgttttt 
tggggtagtt 
gtgatgtatt 
tattattttt 
ttttagttag 
gtttatttag 
ggatgtgtgg 
ttttttttgt 
tttagggttt 
tatttttttt 
ggtatagagt 
atgtttgaga 
tttttttaaa 
tttttaggag 
gtttgatttt 
gttgtatttg 
taagaggttg 
tttttttgtt 
tttggttggt 
attttttgtt 
atgtgggttt 
tatgaaggtt 
tttgtagttt 
tttttttgtt 
tatttaagga 
tgttattgtt 
agttttgttt 
tattgttttt 



gaaggggata 
ggttttataa 
ttttagatgg 
gtaggagatg 
aatatatggg 
ttgagaagtt 
ggtatttata 
tgatttgttt 
tggatttttt 
gtttttagag 
tgagttgggg 
tgttatggga 
tgtgaattat 
tttgttttat 
taatgggtta 
taggagtttg 
ttagttgtat 
atttaggtgt 
tttggtaagg 
agggtttttt 
tttaaatgat 
ggtttaattt 
gttttttatt 
gttttggggt 
ttgggttagg 
gaggatgggg 
gttttgtgtt 
gtttgttttt 
ttttttgttt 
ttttggtttt 
tgtttttttt 
tttatttatg 
tttgttttgt 
ttttttagtt 
aagtggtgtt 
ttggggtgaa 
ttttgttttt 
gaaggtggta 
taggtttaag 
atagtgtgat 
ttggttttat 
ttagtttggt 
tggggtgggg 
agattttgtt 
attttgatgt 
ttttttagtt 
agtttatttg 



gggagttggg 
agtggttttg 
ttttattttt 
tagggtgaga 
ggatatatat 
ttatagtatt 
ttgttttttt 
tttttattag 
atttaagtgg 
tgggtttatt 
tgaggagggt 
tgttatatgg 
ggtagtggtg 
tatgttgttg 
gtatttaggg 
gttatgtttg 
tttttttttt 
aggattatga 
aataggttat 
ttggtttgag 
ttttaatttt 
tttgtttagt 
gtggtagtta 
ggttgggttg 
aaagtttttt 
atagtattta 
tatttggatt 
attattgatt 
agaggagaaa 
ttttggtgtt 
ttttttaggt 
taaattttgt 
agagattttt 
ttttttgagt 
tgtaggtgag 
aggggtgttt 
ttagtttttg 
taaaggtagg 
ttgtgtaagg 
aggattattt 
tttttgttat 
ttagttgggt 
tttggtttta 
tgttttgttt 
attagttttg 
tatagatttg 
ttggttttta 



ttttttatag 
ggtatttagg 
tagtattgat 
gtggggattg 
tgtttggtat 
ttttttgatt 
tttttgtagg 
attggggttt 
ggatagttag 
tggtagtagt 
gttaggttta 
gttatattta 
tagggtatgt 
attttttgtt 
gatatttttt 
tgttggaagt 
ttagtttttg 
gtaggaggtt 
ttattattat 
tagggttgag 
gtatttgtta 
ttttttttat 
tttttatttt 
ggttttaggt 
ggaagtgtag 
gtatttatat 
agttttaggt 
ttattaagtt 
tttaaaattg 
ttttgatttg 
tttttttata 
tttttggagg 
tggttttttg 
taggtttagt 
ggaggtgatt 
tgggggtggg 
ttttgtgttt 
tggttttttt 
tggatatgga 
ttgggggtgg 
ttattttaat 
atagggttta 
ttttgttttt 
atattgagtt 
tttattgggt 
tatttttttt 
tttttgtttt 



Sefte 3,44 



gtatttgttg 
aagtgtttgg 
gataggttgg 
gattttagga 
atagttggat 
ttatagtagg 
tgttgggttt 
tggatggata 
tgtggtggta 
tatgttgggg 
ttttagggat 

oggggatgat 

gagtttggtt 
ggatatggat 
ttttttttgt 
tttgggtgtt 
ttttgggttt 
ttaggttagt 
gtgtaggttt 
aggatattta 
gtttagatgt 
tttttttggg 
tttttttttt 
tattttgtgt 
gattttgtta 
tagatagaat 
tttagttatt 
tttttttaag 
aaatttttaa 
ggtatatttt 
gtggggtttt 
ttttagttta 
ttttgtattt 
agtttgagga 
atgttgttta 
agatgtgggt 
ttttgtttat 
ggttatttat 
gaagtgtttt 
gatgggtatg 
tttggtgttt 
ttttttgttt 
gtttattttg 
tgtagtatag 
ttttggtggg 
gtgtaggtgg 
atg tttag ga 



agaaaggtag 
ggtggaagtg 
tgatgagtgt 
tgttgggatt 
tttgttaatt 
gtgtagttat 
tttaatattt 
ggttagtttt 
ttgaggatta 
ttattattag 
gtggaagttt 
gtttttaaag 
atttattttt 
tggatagata 
gttggaggaa 
ttagttaagt 
tagttgggtt 
gtggttgagg 
ttattattga 
ggggatagaa 
ggtttgttgg 
atgtttaatt 
gttaggaagg 
ttagttagta 
gttagtgttg 

ggggttttaa 

tttaggaaga 
tgttagtttt 
tgtggatggg 

ttttatgatt 
ttggaatgtt 
gttttggtat 
tgtgtaggaa 
agtgagggtt 
tggttttgtt 
aaggggttgt 
tatttattgg 
tgttttagtg 
tgtttgtgtt 

tgggtgttgt 
tataaggttt 
atttatattg 
attatttttt 
gtgtggtttt 
tttgggtagt 
ttttttggtt 
ttttatgttt 



3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4001 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820_ 



seq_prot.txt 

tgttggtgtt gtttgggtta tggttattgt ttatttgggg tttatggaaa tgtggttttt 2880 

gttttttatt gttgtttgtt ttgggaatgt ggtttgaagt ttaggatttg gtagatgggt 2940 

gtaggtgggt ggttggttgt gtttttgttg tgggttatta ttgttttgtg tatggttgtt 3000 

agtttattga gtatgattat tggtgtttag gttagttgta ggttgaatat gtttttgaag 3060 

tggtgttgta attgtagagg gagggttagg gttttrtgtt aagttaggat tattttagar 3120 

tataggtttt agttttattt gaattttgga tgatttttgg ggttattagg agtgagtagg 3180 

tggaaggagg agatttagtt ttttgatttt ggggtggggg tgggggttat attttttgtg 3240 

atggaggaat ttagtttgga tgtgttattt aggtatgatt ttgtaagagt tattaaaatt 3300 

gttgagaggt tttagttagt attttatttt tagatgatgg tttatgttgg tgagtagtga 3360 

ggtttgagga tttatagtgt aaaaggtttg aattgggtta ttgtattttt tttatttttg 3420 

attttgtgat ttaaatggta tttaggatta atttattttt tatttttaag gttttttttt 3480 

ttggtgttag tagaagggat tttgtatttt ataatatatg ttgtttaatg ggtttgtatg 3540 

tttattgtta agtttagttt tatttttagg tttttgtttt attttttttt ggtttttgga 3600 

aaatttagtt ttttatgtta tgtataaatg ttttttttta ggatgttttt taaatttgtt 3660 

tttttttttt agtttggttt ttgatttagt ttgtggttta atttattatt tatgtttgtt 3720 

ggtggtgggg tatttttagg atttttgttg ttttttagga tttttttttt tatttggttg 3780 

aagtagtatg gtgtgttttg gaagtttata tgtagtaagg ttgtttagtt tgggtagtgg 3840 

taggggattt ggtgggtagt gtgtagttta gtgttggtgt tggtgtatta ggtttattag 3900 

gagtaggaag atggttatta ttatggttag gggtattagt gtttttagtt ttatggttgt 3960 

tttattatta attgggtttt tttggatata tttggtattt ttattttatt aggtatagag 4020 

gattaggtag gatatttttg gtatattgag tgtgtgattt tttttttata aagggagttg 4080 

atgatggttt ttgttttttg ttgtgagtga atttgttgtg ttgattgtgt tgttagtggt 4140 

agagttaggt tagggtaggt atgggttgtt ttagaggttt ttgttgttgt tttttgtttt 4200 

aggtttttat ttagggtagg gtggtagaaa ggtttggttg gagaagttat tttttttttt 4260 

tattttaagt tttttaagtt tatataggtt tttgggataa ttagggtttt agtggatttg 4320 

gttatttatt ttttagttag gtttatatat tttaatgtag ttataatttt ttttttagaa 4380 

tatgattttg tttttttttt atttttattt gtttattt 4418 

<210> 381 
<211> 4418 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA CHomo sapiens) 
<400> 381 

gagtgggtag gtgggggtag ggaaagggta aggttatgtt ttggaggagg ggttgtgatt 60 

atattagggt gtatgagttt agttgggagg tggatggttg ggtttattga gattttggtt 120 

attttagaag tttgtgtggg tttggggagt ttggagtggg gagagggggt gatttttttg 180 

attaggtttt tttattattt tattttgggt aagggtttgg agtaggaagt agtggtaagg 240 

atttttggag tagtttatat ttgttttggt ttgattttgt tattggtagt atagttaata 300 

tagtaggttt atttatagta gagggtgaag gttattatta gtttttttta taagggaagg 360 

gttatgtgtt tggtgtgttg agagtgtttt gtttggtttt ttgtgtttgg tggggtgggg 420 

gtgttaggtg tgtttagagg agtttagttg gtagtgaggt agttatgggg ttagaagtat 480 

tggtgttttt ggttatgata gtggttattt ttttgttttt ggtggatttg atgtattggt 540 

attaatgttg ggttgtatgt tatttgttag gttttttgtt attgtttggg ttgggtaatt 600 

ttgttgtatg tggattttta gaatatatta tattgttttg attaggtgag ggaggaggtt 660 

ttggagggtg gtagaggttt tgaggatgtt ttattattag taaatatggg tggtgggtta 720 

aattataggt tggattagaa gttaggttga gaaggggaag taggtttggg ggatgttttg 780 

gggaaggata tttatatatg gtatgaagga ttggattttt taaaggttaa ggaagagtag 840 

ggtaagggtt tggaggtgga gttggatttg gtagtgggta tgtaagttta ttgggtaata 900 

tatgttatgg agtataaagt tttttttgtt gatattagaa ggaaaggttt tgggaatgga 960 

agatgagtta gttttgagtg ttgtttaaat tatgaaattg aggatgaagg gggtgtagtg 1020 

atttggttta aattttttgt attgtgggtt tttgggtttt attgtttatt ggtatggatt 1080 

attatttggg aatgggatgt taattggggt tttttggtaa ttttggtgat ttttgtaagg 1140 

ttatatttgg gtgatgtatt taaattgagt ttttttatta tagaaggtgt gatttttatt 1200 

tttgttttag gattaggagg ttgggttttt tttttttatt tgtttatttt tggtagtttt 1260 

ggggottgtt taaggtttaa ataggattag gatttgtagt ttggggtgat tttggtttga 1320 

taagaggttt tgattttttt tttgtagttg tggtgttgtt ttggggatgt gtttagtttg 1380 

tagttggttt ggatgttggt ggttgtgttt aatgggttgg tggttgtgtg tgaggtgatg 1440 

gtgatttgtg gtgaggatat ggttgattgt ttgtttgtgt ttatttatta ggttttgggt 1500 

tttgggttgt gtttttaagg taagtggtgg tgggggatag agattgtgtt tttgtgggtt 1560 

ttgggtggat agtgattgta gtttaagtag tgttgatagg gtgtggggtt ttggatgtga 1620 

aatagagata aaggttagtg agtgggttga ggatagtggg ttaggaaatt atjrtgtatgg 1680 

gggaggtgtg agtttgtggg ttgggagggg gtggggttat tgtttagatt tgttagaagt 1740 

ttggtgggtg aggttgatgt gttgaagtgg tggtggtggg gattgtgttt atgttgtggg 1800; 

• s ,f j f '" • ? fi' , : . seite' 345 ' - ""■ ' * iAl 



seq_prot.txt 

tttagtgtgg gtgggatggg tgggattttt tttgagtgga aaggtggtta gggtgggtag 1860 

agatgaggtg gggttaaatt ttgttttagg taggggagta atgtgggtga gtaaagagtg 1920 

ggttttgtgt ttagttggat tgggttaggg attgtgggag attttgtgga gtgttagggt 1980 

tggagtgggt ggtggagggt ggggttaagg tttttatggt aatgtttatg tgtttgtttt 2040 

gtttttaggg gtgattttgt tgtgttatgg gtttgtgtgg tgtgagtaga ggtgtttttt 2100 

tgtgtttatt ttgtgtaatt tgggtttggg taagaagttg ttggagtagt gggtgattga 2160 

ggaggttgtt tgtttttgtg ttgtttttgt tgattaagtt ggtgggtgat gggtagaagg 2220 

gtataaagtg ggaattggga aggtggggga tggagaaggt aattttttat ttgtattttt 2280 

tatttttagg atgttttttt tgttttaatg gttttttgga taaagttgtg agtaatgtga 2340 

ttgttttttt tatttgtggg tgttgttttg agtatgatga tttttgtttt tttaggttgt 2400 

tggatttagt ttaggaggga ttgaaggagg agttgggttt tttgtgtgag gtgtggagtg 2460 

agagattgag gagtttttgt agggtgagtt tttgagaggt gttggggttg gattggggtt 2520 

tttgaagggt aggatttgta tagatgggtt tgggaaagga tattttagga gattttattg 2580 

taagaagggt ttggaggagg aggggatatt ttagatatgg ttgtgggaga ggtgtgtttg 2640 

ggttaggggg tattaggaga ggttaaggat tttgtatttt tgtttatgtt ggagattttg 2700 

attttaggtt ttttttttgg gtaaggagag agagggtgga ggttggtatt tggggaggga 2760 

tttggtgagg ttagtggtaa ggataggtag gttttgggtt tttttggaga tggttggggt 2820 

ttgagattgg tttaggtgaa tgtagagtat aggagggatt gagattttgt tttgtttggt 2880 

gtaggtgttg aatgttgttt ttgttttttt gtatatttta gtgttggttg gtaaggtttt 2940 

atgtttttaa aaggtttttt tgatttagtt ggatgagttg ttaattgagt ataggatgat 3000 

ttgggattta gtttagttat tttgagattt gattgaggtt tttttggtaa agaaggagaa 3060 

ggtgagagtg gttgttatgg tggggggtaa gggtggtggg ttgaatgttt taggaggaat 3120 

9*ggggaggt tgggtaaaag gttggattag tgtattattt ggtgagttgt atttgggttg 3180 

ataggtgtag aattggaggt tatttggggg ttattttgtt ttattttttg agtatttttt 3240 

tggttttgtt taggttaagg ggagttttga gagtagtttt aatgatgaga atttgtgtat 3300 

agtggtgggt aatttgtttt ttgttgggat ggtgattatt ttgattatgt tggtttgggg 3360 

ttttttgttt atgattttat atttggatgt gtagtgtgag tttagttggg gtttaaggta 3420 

gggattgagg gaggaagggt atagttgggg gtttttgggt ttagttggga tatttggggt 3480 

ttttagtata ggtgtggtta ggtttttgta agtttaattt tttttaatat aggaggaagg 3540 

agagtgtttt ttgggtgttg atttattgtg gggatgtatg tttgtttagt ttgtgtttaa 3600 

taggagattg atgatgtgat agggtaggtg tggtgattag agatgggtga ttaggtttat 3660 

atgttttgta ttattgttgt gatttatgag gtgtagtgtt ttggggatat tatttttttg 3720 

agtgtgattt atatgatatt ttgtgatatt gaagtatagg gtttttgtat ttttaaggta 3780 

ggtttggtgt tttttttatt ttagtttagt attagttttt ggtgatagtt ttagtatggt 3840 

tattgttagg tgggtttatt ttaggaattt tggttattta gtttttaatg ttattatatt 3900 

gattgttttt gtttggatgg ggggtttaga gtataggtag ggttggtttg tttatttaga 3960 

gttttagttt agtggggaag ataaattagg atttgttaga atgttggagg atttagtgtt 4020 

tgtagggaga gggggtagtg tgggtgtttt tgagaggtgt gattgtgttt tgttgtgggg 4080 

ttggagaggg tattgtggag ttttttgggt gtaggattag ttgatagagt ttagttgtgt 4140 

gttaggtagt gtgtgttttt tgtgtgtttg gtggtagggg ttttagtatt ttagagttta 4200 

gtttttattt ttattttgta ttttttgttt agggaatgat atttattatt aatttgttat 4260 

tggtgttgaa ggatgaggtt gtttggaaga agtttttttg tttttatttt gaatattttt 4320 

tggatgttta gggttatttt gtgaagttgg aggttttttt gtttttttta gtaggtgttt 4380 

gtggggagtt tggttttttg tttttttttg tggagttt 4418 

<210> 382 
<211> 4398 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 382 

ttgttttaaa aaatatttta aaaaaataaa taaaagaata tttattatta ttaatagaat 60 

aaattgtgat atatttataa aatgaaattt tatagaagaa taagaggata aaaggaatta 120 

attattgata tgtataatat ggattaattt taaaaatatt ttttgagtag ttagatatgg 180 

tggtttatat ttgtaatttt aatattttta gaggtaaaag taggatgatt atttgagttt 240 

aggagtgagt ttatagtgaa ttgtgattgt gttattgtat tttagtttgg gttagagttg 300 

gttttattta tatgaagttt aaaaaaaggt aaaattattt atttatgatg ataaaagtta 360 

gaataagaag gtggtaggga ttgatagagg gtataaggga attttttggg atgatgtaat 420 

attttgtata ttgagggagg tgtggttata tagtatatgg gtttgtttaa atttattaaa 480 

H2i a 2HJJ taatatttat atattggaga atatatttta tagaagtaga tgatagaaat 540 

99ttattttt ttatataatt tgtaaaagaa gtgtaatatt gttgattatt ttttttttat 600 

??^H3 tat: * a *!H9 g F tta 9tgagat tatattggtt ttttttttta tttggattat 660 

52??5S5 SJBFI*" H t H t " tt Wtttatgtt tttatatttt aattataatt 720 

t aatt ttt tt tttttataaa ttattattta tgggttgaag atatttaaqt ttattaaatt Z80_ 
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ttatatttaa 
ttttaaattt 
aaaaaatttg 
ttttattatg 
gttttttaaa 
taaatatttg 
taatggagta 
ttattgatat 
ttttggtgaa 
gatatatttt 
taaataattt 
gtttggggtt 
attggtagaa 
taaatatttt 
gttaagtaat 
gtttttttaa 
tatggggttt 
agggagaagg 
tagataagtg 
ttaggttagt 
gttggtgtgg 
gaagtttttt 
ttatgtttga 
gtttttgtta 
ttttttgtat 
gttttgagat 
tttaaaattt 
gaaaggtaaa 
tatgaagttt 
tgtaagtgtt 
agtatgtttt 
tgggtgtggt 
agagagtttg 
ttgaatttta 
agaagttatt 
tttttggtaa 
tttagtattt 
ttgtgtagag 
tatatggttt 
attttttttt 
atttgtaaaa 
gtatttgttt 
tgtaaaagga 
tattatttaa 
aaattgatgg 
ataattttat 
tggagaagtt 
aagatttttt 
gaggaggatt 
tttatgttta 
tttatgtaag 
gaatatgtaa 
attttttttt 
aggttggagt 
gattttttta 
ttaatttttt 
atttttgggt 
tgagttattg 
ttatgataaa 
gttttaggtg 
aagagaaatt 



aaatttagtt 
tttttttttg 
ttttttttat 
ttgttttagg 
gtgttgtgat 
ttttttgggt 
ttttttatat 
tttttatatt 
tagttaatat 
atttaatgtg 
ggttaaggtt 
ttaaaatttt 
tttgtgtgaa 
ataattggga 
ttgttaaatt 
atgtttttat 
tggtagatga 
ttggattttt 
attttggggg 
ttttgttagg 
agaggttttg 
ttggttttaa 
gttggttagg 
atttttgtgt 
ttgttgttaa 
ttagtgtaag 
gtatatttaa 
attagtttta 
tttttttttt 
ttatggatta 
aggaggggtg 
ttgggtaggg 
tgttgtggag 
attgattgta 
atttgttgtt 
tttgggagtt 
agggttgttg 
gaaatatata 
tttagaattg 
tggtttttgt 
tttggatttt 
tagtttaatt 
gtatattagt 
ttataggttt 
atgatattaa 
aaggtaaatg 
aagtaatttg 
attgttaagt 
aaggttgagt 
agagttttaa 
ttagtttaaa 
tttgggttta 
-tttttttttt 
atagtggtgt 
ttttagtttt 
gtatttttag 
ttaagtgatt 
tgttaggttt 
aagtttgatt 
ttagtaaata 
aaatgatt 



gttattaagt 
tttttattgt 
ttgagattag 
ttgattttaa 
ttataggtgt 
aagttgtagt 
tttatttggg 
tttttagagg 
ttatttagaa 
tattaatgtt 
atataattaa 
ttatgtttat 
atggaatatt 
aagtgtaatt 
ttttaattta 
agttttgtgt 
taaatgataa 
gatttgggtt 
ttattttgat 
tgagtaggta 
ggaggtgtaa 
tgttgtgatt 
tgttatttgt 
gaagttatta 
tgtgattgtt 
gatttggtgt 
gttgtattgg 
agagagtatg 
ttatttagat 
agtttggtgg 
ggttttaggt 
gtggggttat 
ttattgtttg 
gttttattta 
gtttagtttg 
gaagtaaata 
ggagatttgt 
tgtagagtga 
gtttttattt 
tttttgtgat 
gagtttgaaa 
taaatatgtt 
tttttatttt 
atataagtat 
tttatatata 
ttattattat 
tttaggttag 
attatgttat 
attttagggt 
gtatttagta 
gttgttattt 
tgtttatatt 
ttttttaaat 
gattttggtt 
ttgggtagtt 
tagagatggg 
tgttaaattt 
attttatgtt 
attttaaaag 
ttgagtgaaa 
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atgatatatt 
aattttttta 
aattatttgt 
atttttgggt 
gagttatggt 
gttttataga 
ttttttatta 
attaggaggg 
ttaattattt 
gttttggatg 
agtggaaaat 
ttttaatata 
ttttagataa 
ttgttgtatg 
gattatagtt 
tttttaattt 
ggaggttgaa 
atatt:gggga 
ttttgtagtg 
tataaaaaga 
ggagaataga 
gtttgttgtt 
atttgtgggt 
aatttgtagt 
ttagagaagg 
tgggattagg 
ggaattaaaa 
taaagttgtt 
ttgggtagtg 
attaagtttg 
ggggtttggg 
attgaagttg 
gttgttgggt 
tttttttttg 
tttaggagta 
agaaaatttg 
tttaataata 
tttaaagttt 
ttatttgagt 
aaatgaggta 
gatttggaat 
tgttaagttt 
ttagggatgg 
attttagatg 
gtattgtttt 
ttttatttta 
ggataagtgg 
aaataaatag 
gagggtgtat 
ttgttaatat 
ttataaagag 
tttttaagga 
tttttttgaa 
tattgtagtt 
ggaattatag 
gttttattat 
aagattttta 
ttgatattga 
aaaatgattt 
gaaaaagttt 



txt 

ataaatttaa 
ttaatttttg 
ttttgttttt 
ttaagtgatt 
attagatttg 
aaaaggaaat 
ttgtagttta 
aatatattat 
gtttaaattg 
aaattgagtt 
gttatgattt 
tgttggagag 
tttaaataat 
taattttttt 
ttttaatttt 
tgagtttaag 
agttagatgt 
tttttataaa 
agttagatta 
agttttagtg 
gatgaatttg 
gaggttaggg 
tataggtttt 
atatgatgtt 
attttgtttt 
gttgtttgaa 
gttagggttt 
gtggtggagt 
ttatgataat 
gtagtgttgg 
ggggagaggg 
gttgggaatt 
tgaaggtgtt 
atttagagta 
tagatttgtt 
gaagtatttg 
taaaagtagt 
aggtattagg 
tttaaggtta 

gggggagggt 

tattattttt 
gtgttgtgtg 
atttaggttt 
tatttatgaa 
aaatttattt 
gaatatagga 
tagagttagg 
atatgaagtt 
attagtattt 
gttgtaaagg 
atattgtgat 
taagtggttt 
atagggtttt 
ttgatttttt 
atgtgtgtta 
gttttttagg 
aaatgttggg 
atagatttag 
attgatatta 
tttatagagt 



tatttataaa 
aaaaaaaaaa 
agagataggg 
tttggttttg 
aaattttttt 
gaagtaatag 
attgtgtatt 
tttagatagt 
ttagttattt 
tataataaat 
gaaaaggtat 
ttaaatttta 
gagtatttgt 
ttttggatgt 
ttgtattatg 
tgaggttgtt 
ttttgaattg 
ttattttttt 
aatttttatg 
attattgtat 
attgtgggta 
ttgttatttt 
tgaagtttat 
tagagtttgg 
tttggttgtg 
gattgtttaa 
ttaggattat 
ttgtaggaaa 
ttaagaaatg 
agttaggtgg 
tggtgattgg 
ttattatttg 
ttttgtttta 
ttttttattt 
ggttaagttt 
ttgaaggttg 
tttttgggtt 
atgtaaatgt 
ttgattattt 
gattagaaat 
agttttggta 
taaggtattg 
tttttagttt 
attatgttgt 
aatttttata 
aaatgatgtt 
tatttggttt 
attttgaaaa 
tggttgtagg 
ttttagtgaa 
ataatggaaa 
gggattagtg 
attttgtttt 
aggttgaggt 
ttatgtttgg 
ttagttttaa 
attataggta 
gaagattttt 
gtttttttga 
ttagattgaa 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4398 



<210> 383 , 
<211> 4398 
<212> DNA 

<213> Artificial Sequence 



*220> 
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<223> chemically treated genomic DNA (Homo sapiens) 
<400> 383 

aattatttaa tttttttttt taatttaaat tttatgaaag gttttttttt ttatttaata 60 

tttgttagtg tttgggattt aagaaggtta atgttaatgg attatttttt tttgaggtag 120 

ttgaattttt tgttgtagaa aaattttttt gaatttattt agtattaaaa tatggggtgq ISO 

gtttggtatg gtggtttatg tttgtaattt tagtattttg ggagttttga gtttggtaga 240 

ttatttgagt ttaggagttt gagattagtt tgggaaatat ggtgaaattt tatttttatt 300 

aaaaatataa aaaattagtt aggtgtggtg gtgtgtattt gtggttttag ttatttggga 360 

gattgaggta ggagaattat tttagtttgg gaagttaagg ttgtaatgag ttgagattat 420 

attattgtat tttagtttgg gatagagtga gattttgttt taaaaagaat ttaaaaaaaa 480 

aaaaaaaaaa aaggaaatta ttagttttaa attatttatt tttggagaaa tatggatata 540 

aatttaaatt gtatgttttt tttattatat tatagtgttt ttttgtaaaa gtggtaattt 600 

tgggttggtt tatatgaatt tgttgaaatt tttataatat attggtggtg ttgaatattt 660 

ggggtttttg aatgtaagtt tgtaattagg atgttggtgt atatttttat tttggaatgt 720 

ttagttttga tttttttttt tttagaatgg ttttatattt atttatttgt aatataqtqt 780 

ttaatagtag gaagttttga gttaaatgtt tagttttgtt atttattttt gatttgagta 840 

a9 H a H5 ag - tttt H ta 2 g tgttattttt ttgtatttta aaatggagat gataatagta 900 

tttattttat aggattgttg tgaagattaa atgagtttag ggtaatgtta tatataggtt 960 

agtattattt attaatttgt aatatgattt tataagtgta tttagaatat gtttgtgtaa 1020 

atttatggtt agatgatgaa gttaagaaaa gtttgagttt atttttggag agtaagaaat 1080 

tagtatattt tttttatata gtattttgta tatagtatag gtttaataga tatatttaaa 1140 

ttgaattgaa gtgaatatta ttagaattga aagtggtagt tttagatttt ttaggtttaa 1200 

ggtttagatt ttataagtgt ttttgattat ttttttttta ttttatttat tgtaaagaqt 1260 

agaggttagg aaaaggatga ataattaatg attttaaaat ttaggtgaga ataaaaatta 1320 

Qttttgagaa attgtgtgat atttatattt tggtatttga gttttagatt attttgtatg 1380 

tgtatttttt ttgtataaag tttaggaggt tatttttata ttattggggt aaatttttta 1440 

atagttttgg gtattaaata gtttttagta ggtgttttta aattttttta tttgttttaa 1500 

tttttaggtt gttaggggaa gtttggttga taagtttata tttttgggta gattgagtaa 1560 

tagtaggtgg tggtttttgg ataagaggtg ttttaggttg gaaggaaata aatgagatta 1620 

tgattagtta aagtttggta aagtaagaga tgtttttaat ttgatgatta gatggtggtt 1680 

H 9 HS tata Satttttttg gatggtggaa tttttaattg gttttgatgt agttttgttt 1740 

" a "HH at ta tgtttatt aattattgtt tttttttttt taggttttgt ttggagtttg 1800 

t HI t 3 ttaa agtatgtttt atttgatttt gatattgttg ggtttagttt gttgggttta 1860 

gtttgtggag tgtttgtgta ttttttgagt tgttgtggta ttgtttaaat ttgaatgggg 1920 

gaaagaggga attttatatt ttttatagat tttgttatag tgattttgtg tgttttttta 1980 

gagttggttt tgttttttgt ggttttgaga attttggttt ttggtttttt aatgtggttt 2040 

aa £SHH2 gttttaagtt aggtgatttt tgggtagttt tagttttagt attgggtttt 2100 

tgtgttgagt tttgggatta tagttgggga ggtggggttt ttttttgggg tggttgtgtt 2160 

ggtagtggat gtgggaagtt ggattttggg tgttatgtat tataagttta gtggttttat 2220 

§HS* ag 55? gtaggagtat gggttttgga ggtttatagt ttgtaggtat gagtggtgtt 2280 

tggttagttt ggatatgaga agtggtggtt ttagttttgg tgataggtag ttgtggtgtt 2340 

ggagttaggg gaggtttttg tttgtggttg agtttgtttt tgtttttttt gtattttttg 2400 

gggttttttt atattagtgt atggtggtta ttgaggtttt tttttgtgta tttgtttatt 2460 

tggtgggggt tggtttggtg tggaagtttg gtttggttta ttgtaggggt taaggtgatt 2520 

tttggggtta tttgtttaaa agggatggtt tgtgggggtt tttagtgtga tttgagttag 2580 

aggJtJS 9 " ttttttttta atttgaagat atttggtttt tggttttttt gttgtttgtt 2640 

atttgttaga gttttatagg tagttttatt tgagtttgag attagagagt gtggggttat 2700 

agagaaa I ta tggtgtaggg ggttgggggg ttatgatttg ggttgggaaa 2760 

tttaataggt tatttaatat gtttagggag ggaagttgta tgtagtgagg ttatattttt 2820 

ttgattgtaa aatatttgat agatgtttgt tgtttgggtt gtttgaaaag tattttattt 2880 

tatgtagatt ttattaattg gagtttgatt ttttagtatg tgttgggggt aaatatgagg 2940 

fi£E: tt 2S afl "ttagatgt gtttttttaa attatggtgt tttttgtttt aattgtgtga 3000 

ttttggttaa gttatttaat ttattgtgag tttagtttta tttagggtgg tgttagtata 3060 

SSSJ^S? ag 3? t9ttaa gtgattagta gtttgggtag gtggttggtt ttggataagt 3120 

gttaattgtt tattagaagt tgtttgagat agtatatttt tttttaattt tttagagaag 3180 

tgtgggaagt gttggtgaga tatgtagtta gattgtagta gtgaggggtt tagatgaggt 3240 

gtaaaggata ttttattatt gttgttttat tttttttttt tgtgggatat tgtagtttgt 3300 

^ aa9 J?i2i a § at9tttaag aaa 9atttta ggtttggtat tgtggtttat gtttgtaaat 3360 

tatagtattt tggaaggttg aggttgaggg ttgtttgagt ttaggagttt gagattagtt 3420 

tggggtaata tagtgagatt ttgtttttaa aaataaaaat agatgatttt aattttagat 3480 

aagaagggta agtttttttt tttttttttg ggggttagtg ggggagttgt ggtgggaatg 3540 

ggaaggaaga gtttagggtt tataaatgtt gagtttgtga tatgttatgt ttggtigtag 3600 

ttagattttt gaatatggag tttaataaat ttgggtgttt ttagtttata ggtggtaatt 3660 

tgtagggaag aaaggttgga ttatagttag agtatagaaa tatgagggaa gggtggaqaa 3720 

agtttggatg agaagggaaa ttagtgtggt tttattgiaa 3780 
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ttatgtgaga aaatggttat ttttattatt tgtttttatg aaatgtattt tttagtgtat 3900 

ggatattaaa tgttatagtt tgatgagttt ggataaattt atgtattgta taattatatt 3960 

ttttttaata tgtaagatgt tatattattt taggaagttt ttttgtattt tttgttaatt 4020 

tttgttattt ttttattttg atttttatta ttataaatag atagttttgt ttttttttga 4080 

attttatgta aatggaatta gttttgattt aggttggagt gtaatggtat agttatagtt 4140 

tattgtagat ttatttttga gtttaagtaa ttgttttgtt tttgtttttg aaaatgttgg 4200 

aattatagat gtgagttgtt gtgtttggtt gtttaaaaaa tatttttgag gttagtttgt 4260 

gttgtatgta ttagtagtta atttttttta tttttttatt tttttgtggg attttatttt 4320 

ataaatatat tatagtttat tttgttggtg gtagtagatg tttttttatt tattttttta 4380 

aaatattttt tgagatag 4398 

<210> 384 
<211> 4471 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 384 

aatttataga ggtatattta tagaggttaa attttattgt atgtaaatta tattgttgta 60 

gtttaatttt tttaaagtgt tattgtatat aaaataatgt gttagatatt gaattatgaa 120 

atttaatgtt taagtatata atttttaagg tttttagatg tattattaaa aatttattat 180 

taaatatttt gagtttagtt tttagtattt tatattttat ttatatttat attttataaa 240 

atgatttaaa gtattattta aaattattat atgaattttg aattattatg tgtttaatat 300 

atatttatag aaaatttttt ttttattttt aaaagtgaat atattaatat ttattaaatg 360 

ttaattatat ggttatgagt tgaatttatt attaaatttt attttttttg aattataaat 420 

atattaagaa atgtaaattg tgattagatt tatttatatt ttttagaata ttatgaaatg 480 

tgttttaaat atgttaattg attttagaaa tttttatgtt ttgagtaaga tttaagtgtt 540 

aataagtatt tgatggtttt taaaataaga atgttttaga aatttattta aaaaaatgtt 600 

tgaatgtaaa atgtaatttt atggtgtata ttttaatgtt tgttgatgaa tagattttta 660 

tagttttttt tttttagaat ttatattaaa atatattttt tttagaaaaa ataggagtta 720 

aagatgttaa ttatgttttt tttatagtta tgttatattt gttttataat atttttagaa 780 

aagttttttt ttttatatgt ttatttttta aattataaat atgttttttt ttattagata 840 

ttgagttatg aaatttagag taaaagtaaa attttttaat ggaatgttta tgtaaatatt 900 

agaaatataa taaaataagg taaagtttgt tttttaaaat gttgtttatt atattaaaat 960 

ataattttgt tttttaaata gtgagtaaaa ttaaattaat tttatttttt taaatagaag 1020 

tttatttaaa gtttttttat aaaaatgata tttatttgat ttaatattgt ttttttaata 1080 

agaataatat gtagggtgtt agagggtagt tgttggattt ttttttagta aataaaggag 1140 

attagttaga gtttagtttg gtttgtttag gaaaggagga atgtagtttg ttgatatggt 1200 

taaggaatgt taattaatat attttaaaat ttttattatt ttgtttgaga tgtgtatatt 1260 

tttttaggtt tttaaagttt aattagaaag tgtatttaat tttattgtta ttttatttat 1320 

agtggggaag tttattgagg aaaattatag ggaaataaaa tgttttttag ttatttatat 1380 

tagtataatt aatttagagt ttatggtata aattaaataa gttaatttgt tagtgttttg 1440 

aatgaagata tttaattaaa gtatgtattt tttaattttt tagtagtttt taggatagtt 1500 

ggttttggtg attgtttttt ggttttttat tgtttttaat atgattggtt ttttattgta 1560 

ggattttaaa taaaatgaga taattaaatt atattttgag tgaaggggtg tttattttgt 1620 

agataaatat attggttttg ttttttttaa aatgtggata tgtgtttttt ttgtattagg 1680 

gggggttttt tggtgtgtgt tttgttgtta tttgttgagg aaagtgagtg tattttttgt 1740 

agtttaggtt ttgggtgtta gttttgtttt gtagttttag agtttgttgt agtttgggtg 1800 

gttttttttt ggtttagtgt ttgttgtttg ttttttgttt tgtaagtttt aagaggtagt 1860 

tattttttgt agtttttgtg tttgtaattg ttttttggtg ggggagtggg tgtttaaaaa 1920 

gttagtagtt ggagaaattg aaaagattat aagtgattta atgataagtt tttttttttt 1980 

tttaaagatt gagaggaggg tagaggggag tagtgtttga gtttatgtga ttgagttagg 2040 

gagtttgatg gttttaggaa tgtttgatgt tgtgtgtgat ttttaagtgg gagtattttt 2100 

gaattgattt ttggtttatt tataaggata gtggtgtata gatggtgttt tttgtagttt 2160 

tagttttaga tttaagaggt ttggagtagg gtttgagaat atgtattttt aattaggttt 2220 

tgggggatgt tgatattgat atagttagtt tggggattat attttgagga ttatgttttt 2280 

agtttttgat ttatataagt gttatttaga atagatgttt gattttaagg agttagtggt 2340 

gaaattagag aggttttggg tttgttaaat ttttagtagt aaatgtaatt ttgggttttg 2400 

gagtggtaaa gttgtggatt agaggtggag ggagtgggtt tftagttttt aagaggttat 2460 

tgggaaggtt ttttgtgtta gattaaagat tttaggtatt tttttgaatt tatttgaagt 2520 

ggtattgggg agatttgtgt ttttggttat ggtgtttttt tttgttggag gtatttgttt 2580 

attttttttt ttgggtgaag gttttttgtg ttttttggtg gtagttttag tttttttagt 2640 

-ttaatggggt gtttttttta tttattttag taggagtttt agggtgtgag attaggatta 2700 

tagttttaat tggttttaag gtagttgttg ttgatgaaaa tgaaaaggaa agtagtatgt 2760> 

gatttatagg ttattgtgag aattttttag tgttatattt gtgtttaatt aattattaaa 2820 
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ttatttattt 
agaggaagat 
ggtgtttagt 
aggtaatgaa 
tgtgaaaagt 
ttttttttag 
gggaagagtg 
gttttttgtt 
tttatggaga 
gtttatattt 
gtttgagatt 
gttggatgtg 
tgtttgaata 
tgggtgatag 

tggtggtata 
taattataag 
attaatatta 
ttagtttagt 
ttgttagtta 
gatagttatt 
tgtaagttta 
ggaaggttaa 
tttaggagag 
aaaatggttt 
gttaagtttt 
aatagtatgt 
gaaattttat 
gatttatttt 



aggtggtata 
gagggggggt 
taaaatagag 
gataggtttt 
ttttattgtt 
gttgaaaggt 
tgttgttggt 
ttggggtttt 
aaatgttaaa 
ataattttag 
agtttgatta 
gtggtatatg 
tgggaggtgg 
agtgagattt 
tatttgttaa 
ttaataggta 
tatttgatgt 
gagatattga 
tttgaagtta 
gtttattgga 
taaggtaaat 
ataatttttt 
aggttaagga 
gattttttgt 
tgttttgtgt 
ttttttttta 
tgtaggaaat 
gtttattaag 



agggatatgg 

atttggaggg 

agatgtaagt 

aaagatggag 

ttattttaag 

agagtttttt 

aaataattgt 

ttaaagtatt 

tgatggttta 

tattttggga 

atatggtgaa 

ttagtaattt 

aggttgtagt 

tgttttaaaa 

taagatattt 

gaaaaagata 

tgtgttaagt 

ttttattttt 

tttagtgaga 

tatttatgtt 

tttattattt 

tatggttata 

tttatttatt 

tttatgtaga 

ttattgtgat 

gagtagttta 

aataagtaga 

aaagataatg 



seq_prot . 
tttttgaggg 
aaagttttat 
gtgttttggg 
aagtatttgt 
tttaattttt 
tgtttttgag 
agtagataat 
tagtttttat: 
aaaaaattag 
ggttgaggta 
attttgtttt 
tagttattta 
gagttaagat 



ttggaaatgg 
ggtgagggtg 
atttgtatag 
ttagtgagga 
aaggaattta 
ttgttttttt 
ttattttata 
tgatttatta 
tttatattat 
atattgtaga 
gtttttttta 
aatatttata 
aaatgaatgg 
g 



txt 
ttgtgattta 
agtattagtt 
ttgataaaga 
tttattagtt 
attaaagttg 
gtagagaagt 
atgttaattt 
atgttttttt 
ttttatggtt 
ggtggattat 
tattaaaaat 
ggaggttgag 
tgtgttattg 
aaaaaaaaat 
aaattgttat 
aagaaaatgt 
tattttattt 
aattgaggtt 
gtaataatag 
ttaataattt 
tttgagaaaa 
gtggttaaga 
gtttttaatt 
taaatttttg 
gaaggtattt 
gaagttgttt 
gagtgagtaa 



gatttttaat 

ttagttgttt 

tgaggtttat 

aaataatagt 

agagttttgg 

tagtttggat 

tagttttttt 

tttttgttat 

aggtatggtg 

gaggttagga 

ataaaaattg 

gtaggagaat 

tattttagtt 

tagttttatt 

gaaggaattt 

atttattaat 

aatttttata 

tatagagttt 

tggttttata 

ttattttata 

tgaatgttta 

aggatttgaa 

aattagaatg 

ttatttgtgt 

tatatatagg 

attatataag 

gttttttttt 



<210> 385 
<211> 4471 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 385 

ttattgtttt 
ttttgtttat 
ttgagttgtt 
tattatagtg 
ttttgtatag 
aggtaaatgg 
atgtgattat 
ggggtaatag 
gggtataaat 
tttttattag 
gaaaagtgga 
tgtttagtat 
ttattttttt 
gagatatttt 
tttttgagat 
tatrtgtaatt 
gggattattg 
ttttattatg 
tttttaaagt 
ttaaattatt 
gggtattttg 
tgtagttatt 
agggagattt 
atttagggtg 
tttttatttt 
tatttgtgtt 
tttttttaaa 
attatgtttt 
attggggggt 



ttttgatgga 
tattttttgt 
ttgaaaagag 
aatatagagt 
agtagagagt 
gtttttggtt 
agggaggttg 
aatttgtttt 
gtttaataag 
gtggttttag 
gttaatattt 
agtgttaggt 
ttgtttgttg 
gttaataagt 
agagttttgt 
tttgtttttt 
gtatgtgtta 
ttggttaggt 
gttgagatta 
atttaatatt 
agagatttta 
tgttagtgat 
tgttttttaa 
gggtagtgga 
tggagtttgt 
tttttatttt 
tgtttttttt 
ttatgttatt 
ttttataatg 



tagggtgagt 
agtgaggttt 
aatatgttgt 
agaagtttgg 
taagttattt 
ttttttttgg 
tttaattttt 
ataggtttgt 
tggtagttgt 

gtagttgata 
tattgagttg 
atgatgttaa 
gtttatggtt 
atatattatt 
tttgttgttt 
atgtttaagt 
ttatgtttag 
tggttttaaa 
taggtgtgag 
ttttttataa 
ggataggaga 
atattttttt 
tttgagagaa 
gattttttat 
ttttattatt 
ggttaaatat 
tgtttttttt 
tgagtagatg 
gtttgtgggt 



tagaaggaaa 
ttttgtataa 
ttttgtgtgt 
tatataggtg 
ttattttgat 
atttaagttt 
ttgaatattt 
gtataaaata 
ttatgaagtt 
gaagttttgt 
gtgtgaggat 
tattgataga 
gaaatttttt 
aaatgaagtt 
aggttggaat 
gatttttttg 
ttaatttttg 
tttttgattt 
ttattatgtt 
ggtggtaagg 
tagggaggtt 
tgtttaaatt 
attaggattt 
agttattgtt 
tgtggatttt 
taaatagttg 
tattagagat 
gtttgatgat 
tatatgttat 



tttatttatt 
taaatagttt 
agaatgtttt 
gtagaagttt 
tgattggagg 
tttttagtta 
atttttttaa 
agaattattg 
attgttgtta 
gagttttaat 
taaatgagat 
tgtatttttt 
tatgatggtt 
gatttttttt 
gtagtggtgt 
ttttagtttt 
tatttttagt 
tgtgatttat 
tggttatgaa 
aggaagagta 
gagattggta 
aatttttttg 
ttagttttaa 
tagttgatga 
atttttatta 
aggttggtat 
ttggattata 
taattaggta 



tttatttatt 

ttgtgggtgt 

ttgaaggaag 

gtttgtagtg 

tatggtatgg 

ttgataggtt 

gtataaaatg 

agagaaagtg 

ttgggttttt 

ttttttattg 

gttgtgtagg 

ttatttttat 

tttattttta 

tttttttttt 

gattttggtt 

ttgagtagtt 

agagatgagg 

ttgttttggt 

gttgattttt 

tatggggatt 

tgttgtttgt 

ttttaaggat 

tgaaaattgg 

agtagatgtt 

atttagagta 

tgtaaaattt 

atttttaaaa 

tagatgtgat 



2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4471 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
17_40- 



seq__prot.txt 

gtaatagttg ttttaaagtt agttaagatt gtggttttag ttttgtattt tggggttttt 1800 

gttggggtgg gtgaggggaa tattttatta agttggggga attggggttg ttattagggg 1860 

gtgtgagggg tttttgtttg agaagagggg tgggtaggtg tttttagtgg agaagggtgt 1920 

tgtggttgga ggtataggtt tttttggtgt tattttaagt gagtttgagg aagtatttgg 1980 

gatttttgat ttaatgtgaa aggttttttt agtgattttt tgagagttga gaatttattt 2040 

tttttatttt tagtttatgg ttttgttatt ttagggtttg aggttatgtt tgttgttggg 2100 

gatttgataa atttaaagtt tttttggttt tattattggt tttttagaat tagatatttg 2160 

ttttgaatga tatttatgtg agttaggggt tgaggatgtg atttttgaag tgtggttttt 2220 

agattggttg tattagtgtt ggtatttttt aggatttggt tggaaatgta tatttttagg 2280 

ttttatttta gattttttaa atttgagatt ggggttgtgg ggagtgttat ttgtgtgtta 2340 

ttatttttgt gggtggatta ggagttggtt tgagggtgtt tttatttaga ggttatgtgt 2400 

ggtgttgggt gtttttgaga ttgttgggtt ttttggtttg gttatgtggg tttaggtatt 2460 

attttttttt attttttttt tggtttttaa aaggaagaag gggtttattg ttaagttgtt 2520 

tgtgattttt ttagtttttt tagttgttgg ttttttggat atttattttt ttgttaggag 2580 

gtagttgtaa gtgtggaggt tgtgagaaat aattgttttt tgaaatttgt agggtgaaga 2640 

gtaggtggtg agtgttgggt tggggaggga ttatttgagt tgtgatgggt tttggggttg 2700 

tggggtaggg ttggtgtttg gagtttgagt tgtaggaggt gtgtttgttt tttttaatag 2760 

gtggtggtgg ggtgtgtgtt gggagatttt ttttaatgtg ggaaaagtat gtgtttgtat 2820 

tttagagaag gtaaggttgg tgtgtttatt tgtaaggtaa gtgttttttt gtttgaggtg 2880 

tggtttaatt gttttatttt gtttgaaatt ttgtggtgag aaattagttg tgttgagaat 2940 

aataaaagat taaaaaatga ttattaaaat taattgtttt gaaagttatt ggaaagttgg 3000 

aaaatgtatg ttttgattaa atgtttttat ttaagatatt ggtaagttaa tttatttagt 3060 

ttgtgttgtg agttttgggt tgattgtgtt aatatzgaata attgaaaaat attttatttt 3120 

tttatggttt tttttgatgg atttttttat tatgggtgaa atgataatgg agttgaatat 3180 

attttttga-t tgaattttga gggtttggga agatgtatat gttttaggta agatgatagg 3240 

ggttttaaaa tgtattaatt ggtatttttt agttatgtta gtaagttgtg tttttttttt 3300 

tttgggtaga ttaagttaag ttttaattgg ttttttttat ttgttgaaga ggagtttaat 3360 

aattgttttt taatattttg tgtgttattt ttattggaag gataatatta agttaagtga 3420 

atgttatttt: tgtgaaaaaa ttttgagtgg atttttattt aggaagataa ggttgattta 3480 

attttatttg ttgtttaaaa agtaggattg tgttttggtg tggtaggtaa tattttggag 3540 

gatagatttt gtrttatttt gttatatttt tagtatttat atgggtattt tattagaaag 3600 

ttttattttt gttttaagtt ttgtaatttg gtgtttagtg aggggaaata tgtttgtaat 3660 

ttaaaaagtg aatatgtgaa aggaaaggtt tttttgagag tgttgtaaaa taaatgtaat 3720 

gtgattatga aaagaatatg attaatattt ttgattttta ttttttttga agaaaatgta 3780 

ttttgatatg agttttagaa gaaggaaatt ataaggattt gtttattaat aggtattaga 3840 

gtatatattg taggattgta ttttatgttt: aagtattttt ttagatgaat titttgaaata 3900 

tttttatttt aaaagttatt agatgtttgt taatatttaa gttttgttta agatatagaa 3960 

gtttttgaaa ttaattaata tgtttaggat atattttgta gtgttttgag ggatgtgaat 4020 

aaatttaatt atagtttata ttttttaatg tatttataat ttagaaaagg tagaatttag 4080 

tagtaaattt aatttataat tatataatta atatttaata gatattgata tgtttatttt 4140 

taagaataag aaggaaattt tttataagtg tatgttgaat atataataat ttaaaattta 4200 

tgtgataatt ttaggtgatg ttttgagttg ttttatagaa tataaatatg gataaaatat 4260 

aaaatattga aggttgaatt taaagtgttt aatgataagt ttttgataat atatttagaa 4320 

attttgagaa ttgtatgttt gaatgttaga ttttataatt tagtgtttag tatattgttt 4380 

tatatgtaat agtattttaa aaaaattagg ttatagtagt ataatttata tatagtaaaa 4440 

tttagttttt gtaaatgtat ttttatgaat t 4471 

<210> 386 
<211> 4479 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA CHomo sapiens) 
<400> 386 

tggtggttta tgtttgtaat tttagtattt tgggaggttg aggtgggtgg attataaggt 60 

taggagattg agattatttt ggttaatatg gtgaaatttt atttttatta aaaatataaa 120 

aaattagttg ggtgtggtgg taggtgtttg tagttttagt tagttggtag gttgaggtag 180 

gagaatggtg tgaatttggg aggtggagtt tgtagtgagt tgagattgtg ttattgtatt 240 

ttagtttggg tgatagattg agattttgtt ttaaaaaaaa aaaaaaaaaa atatggttgg 300 

gtgtggtggt ttatgtttgt aattttagta ttttggaagg atgaggtggg aggatttttt 360 

gaatttagaa gtttagtaaa attttgtttt taaaaaaaaa aaagaattgt gtataaagat 420 

tttagagagt gttaaagatt agtgtatgga taaggaagtt ttgtgaagag ttgaagtgtt 480 

aggtgaagag gtggtatggg ggaggagggg gtggaagggg agaaagggtg ttatgtttta 540 

taatggtttt taaatttttt tgtttaggag gaaatgaagt tatttgtttt tagtaattag 600 

tatgatagtt tttagttaag taatttggag ttatgagagt tgttagggga gtaatatgaa 660 
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ttatgatggt 
ggttgtagta 
tttaaataaa 
agtattgttg 
tttagaagat 
agtaggtagt 
ttattttttt 
ggtttagatt 
gttagtggat 
tttaaaatga 
ttatagaatt 
gtatatataa 
gtttgttttt 
aataaggttt 
tttttttgga 
aaatagtgaa 
agaggtgaga 
tttttattgt 
attttttaaa 
tttgttttta 
tgttaaaaat 
tttttgatag 
tttatttatt 
agtttttata 
gttgtgttga 
ttaggtaagt 
aggttttaga 
tggggatttt 
tttataagtt 
gtttttttta 
gtaagatagg 
atagagtagg 
agggtttatt 
ttgtggtttt 
tttttgtttt 
ttttgttttg 
gaggagagag 
ttgtttggtt 
gttgttgttt 
tgtgttttgg 
aaggattttt 
gattttattt 
tttgagaatt 
ttagtttgtg 
aaatttatgg 
aagagtgggt 
atttttgatg 
tatgtgagta 
gatatttttt 
tttgtttatt 
tgtgtttaat 
gtagtattgt 
gggagggatg 
atatttttat 
tataaatgtg 
atatttttaa 



ttttgggaat 
tttttgttta 
attgattata 
atttagtttg 
agtgggttat 
tggtattttg 
ggtatttttt 
tgtaaggtag 
tgttgggatt 
ttttaagttt 
tttgttatgt 
atgtttgttt 
ttggaggata 
tatgatgaag 
atgggttata 
aaataaattt 
atagaatttt 
ttttattttt 
agatttgtta 
taattttgta 
ggattttaaa 
tattattaaa 
ggttgtgttg 
ggtaggggtt 
tgaagatttt 

tggggtttag 

gttgggtgag 
ttgtattggg 
ttttgatttt 
aatttttgtg 
ggagggagtt 
tgttgtgggt 
tgtttttgta 
gagatttttg 
gggttgtttg 
ggtgtttttt 
atagttgggg 
tgtttttttg 
agtgttttga 
ttgtgagtga 
gattagtatt 
atagtagggt 
atgggggttg 
ggtatttggt 
ttttttattt 
tttgggattg 
aagttttttt 
tttattttta 



tttttgataa 
tgttttggtt 
gttttttaga 
ttttttggaa 
ttgtttttta 
agtttttggt 
ttgttttttt 
gaaaatgttg 
tagttttttt 
tggtttttga 
ttttttaagt 
atttttaggt 
attttatttg 
taaaatatta 



tttgagaggt 
tatggtgtaa 
ttgtagtttt 
gattttagtg 
ttttaaaaaa 
ttttttgttt 
gaggaaaggt 
ttttttagta 

<210> 387 
<211> 4479 
<212> DNA 



ttttgttttt 
ttatttttgt 
aagtattttt 
gaaataggta 
ggtgggaggt 
ttttatgtga 
aggtggggtt 
aatgtttgtt 
taaggtgagt 
ttttgttttt 
agttatttgg 
agttgagatt 
aaaaaataaa 
ttttgtagat 
tagtattgat 
aatgtattat 



tagagatgtg 
tttttttatt 
tatatttttt 
atttttgttt 
atattttaat 
atattatatt 
ttaagttgta 
taaatttttt 
gtgatatttg 
tttttgtttg 
atagttttat 
tggggagaag 
gtagagagtt 
gtaggttttt 
ttagtttaga 
agtgttgtgg 
gggtgggaga 
tgtagtatag 
gtgttggttt 
ggttggatgt 
ggttttttgg 
gtgggtgttg 
agtttggtta 
tttttttgtt 
ttttgttatt 
ttttaaagtg 
aggggagagg 
tagatttttt 
gtattggttg 
agatgaagtt 
gagaattaga 
gggttagttt 
ttgaattttt 
taatagttgg 
tgttgtaggg 
tttttttagt 
ttaatattta 
tggggatttt 
aagagtaggt 
taagtataaa 
ggaaggagat 
ggtagttggg 
ggattatttg 
attaaaaatg 
gaggttgagg 
atattattgt 
agttagttgg 
aagttaaaaa 
attatgggtt 
gtttatagat 



seq_prot . 
ttaatttggg 
atgtttattt 
ggaagtagta 
ggttggtagt 
aaagttgaaa 
tggggtagag 
tttattattt 
taggtttagg 
ttttttatta 
tattagggga 
gtgtttagag 
aggaagggtg 
ataaataatt 
aatatatatg 
ttttttaaaa 
ttgtttttta 
agaattttta 
gtataaataa 
aggaagtatt 
atgatattta 
aatttttagt 
tattgttgaa 
gggattgtta 
tgtttttggg 
tagttgtggg 
ttgtagaaga 
ttttgtgtgt 
ggttaggtgt 
tgttgttggg 
aagtatgtgg 
gggaggggtg 
tgtggagatt 
gtgttttttt 
gtggtggttt 
gttttttggt 
atttaggttg 
ttgtgggtat 
ttttgttttg 
atgagagttt 
agtgtgtttt 
aggggttgtt 
ttaggaaatg 
gtaagggagg 
tgtttgggtt 
tatgaatagg 
atttttattt 
tttagggtag 
ttgtatagat 
ttgtgttatt 
gaattgtgat 
ttggtgtaat 
tattgtaatt 
aggagttagg 
tatatatata 
tttgtttagt 
tatggtggtt 
aggttaggag 
taaaaattag 
taggagaatt 
attttaattg 
aatgtttttt 
tttgttatga 
ggttttgttt 
ttgatgttt 



txt 

agttttgggg 

ataattaatg 

aggttggttt 

ttagtaagta 

ggttaatttg 

taaaggagtt 

ttaggtggat 

ttgggaaagg 

agagagtgag 

aagagatggg 

atgtgtgtgt 

gatgtagtta 

gtttttattt 

tattaaaaaa 

tgttaagagt 

gttttttaat 

tagttagaat 

atgaattgtt 

aatagtttat 

atttaatttt 

aatttggttt 

agtaataagt 

ttgaatagat 

agagtatgtt 

taagggggtt 

attgattaga 

tttttttttt 

atgggaggaa 

ttttttttgt 

ggtttgggtt 

gtgttggggt 

gtagttttgg 

gttgtagtta 

tagtttttga 

ggttgtatgg 

tttggagttt 

tttttttgtg 

attttttttt 

tgttggtgtg 

tgttttgatt 

tagggagttg 

ggatagggtg 

aagaggttgt 

aatttatttt 

gtgaggtgag 

tggagttttt 

taatgaattt 

aagttgggaa 

ttttattatt 



tgtttaaatg 
tgtttaaaga 
gggagagaaa 
agttggaggt 
tgtttatgaa 
tttttggtag 
tatgtttgta 
tttaagatta 
tttggtatgg 
gtttgaattt 
ggtgatagag 
ttttttttat 
ggggaatggt 
gttttggaat 



taagttttta 

ggtttttaaa 

tgaagtttat 

tagaagtttt 

tatttttttt 

tttttttttt 

ttagatttaa 

gtttaaagtt 

ttttattggg 

ggtgatagaa 

gtgtgtgtgt 

tttatatatg 

gaatagaata 

tgtataatta 

gtaaaaggat 

ttttttattt 

tgtttatatg 

tattatggaa 

gttttaggat 

tattaagttt 

tgtttttttt 

ttggttttgt 

gtatagaggg 

ttgtatattt 

tgaggtagtt 

ggattttagg 

ttttgtaatt 

gtatggagaa 

tggagattgt 

gttgagtgtt 

gggttttgat 

agtttgggtt 

ttggtgagtg 

gtgtttgttt 

agttaaggtg 

ggagtttaga 

ttgtagttgt 

tttttgtaga 

tttgtttttt 

gatgttgttt 

gggttttttg 

gtagtggagg 

tgggattgtt 

ttttggttgg 

agggagaggg 

ggagtatggg 

tattaagttt 

ggttttaggg 

tttatttttt 



gaggaatatg 
aatttggagg 
tttggggata 

agggtgggtg 
atagtggtta 

gtttgaaatg 
attttagtat 
gtttggataa 
tagtggatgt 
gggaggtaga 
taagatttta 
atttttttat 
tatttttatt 
tttgttattg 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4479 
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seq_prot.txt 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna CHomo sapiens) 
<400> 387 

gagtattaga tttgtgggta tggtatgttt gttgaaggat agtggtagag ttttagggta 60 

ggtagggttg gtttatggtg ttagtgttgg tttttttttg gtaaaagtga ttattttttt 120 

tatagtaggt ttttgattta tttataaggg gataggagga tgagagaata tgagaaagag 180 

aagaatattt taattaattt ttattttttt tttttgagat ggagttttgt tttgttgttt 240 

agttggagtg tagtggtgtg attttagttt attgagattt ttgtttttta ggtttaagta 300 

atttttttgt tttagttttt taagtagttg ggattatagg tatttattat tatgttaggt 360 

tgatttttgt atttttagta gaggtagggt tatattatgt tgtttaggtt ggttttgaat 420 

ttttgatttt aaatgattta tttattttga ttttttaaaa tgttgggatt ataagtgtga 480 

gttattatgt ttggttgtta atggatattt taaagatgtt gttttagatt tgttagaaga 540 

ttggataggg ttttttttta attttatttt atatttgtgt ggttattgtt ttgtgagtgt 600 

gtgtgtgtgt ttatgtttgt tatataggga tgaagatgtt atttatttta tttttagttt 660 

ttaatttttg tttgtttttg ttttttattt attttttttt gtttttaaat ttttttttta 720 

gttgtagtgg agatttttat atttatttta tagtgttgtt ttttgaattt ttttgggtag 780 

ttgtattagt gaatgttgga gaagtatttg ttggatatat atgttttttt atttagatag 840 

ttatagtttg ttggagagaa taaaggtggg gtaagtgagg gggagtggaa gtggtaaggg 900 

gtggtgtagt tttgtagtag agggtaggga ggggatgttt tttgaagttt ttttaatttg 960 

tttgtgtagt taattgttgt aggggtggat atttatatgg aatttgatga agtttattgt 1020 

tgttttggaa gagatttggg aggaggtttt attaaaggtt ttatgtttta gggattttag 1080 

ggtgagggta aattggtttt aattttaaaa tttatttttt tttttttttt ttgttttatt 1140 

ttgtttgtat ttagttttta aatggaagat tatgggtttt tagttaggag aaatggattg 1200 

atttaagtaa gttttattta ttagatgttt gtaggttggg gtagttttgg tggttttttt 1260 

tttttttgtt agttagtgtt aatttttgtg gtttttaagt ttttgttgtt attttgtttt 1320 

attttttggg gagagtttgg ttttgttgtg gatggaattt ggaggatttt agttttttga 1380 

gtagtttttt tttttttttg gtgttgatta gaggtttttg ggtagtatta gttaaagtaa 1440 

gagtgtattt attttggagt tgtttatgat taggatgtag agaagtaggt gtgttagtag 1500 

ggtttttatg gtggtgaggt tggggtgtta gatggtggtt ttgtaaagga aggagaagtt 1560 

agggtaagag gtggaggaat gggaaggtag gttaggtggg tgattgtagt gtaggggaga 1620 

tgtttgtggt gattaggttt tttagttgtt tttttttttt ttgggttttg gattttgggt 1680 

agtttggatt ggtatttgtg ggggatgttt gggatggggt gttttgattt tgtgtagttg 1740 

ttggggagtt tagggagttt gggtagttta gggtggggga ggtagatgtt tgggagttgg 1800 

ggttgttgtg tatttggttt ggggatttta ggattgtggt atttattggt ggttgtggta 1860 

ggagggtgtg agttggtgtt gtggggatag gtggattttg gtttgggttt tggggttgtg 1920 

gtttttgtat tgtgttgtga tttgtggtgt ttgttttata ttagggtttg ttttggtgtt 1980 

gttttttttt tttttgtttg gttttttttt ttgttttgta gtgtttagtg atttggattt 2040 

tgtgtgtttt tgtaatgttt ataaagattt gggggaagtg tgatttttag tggaggggat 2100 

ttaatagtgt ttggattgag gaattgagag gtttgtaaat tttttgtgtt tttttttatg 2160 

tatttggttg ggggtttgtt ttagtgtaag gagtttttga attgtagaga ggagagaagg 2220 

tgtataggag attttttatt ttgtttagtt ttgaagtttt ttggggtttt ttaattagtt 2280 

tttttgtaat ttttttttgt tgggttttaa tttgtttaag attgttttag atttttttgt 2340 

ttgtagttga tggagttgtg aagtttttat taatgtgata aatgtatgag atatattttt 2400 

ttagaagtat agatagaaaa atttttgttt gtaggggttt tttttgtgtg tttgtttagt 2460 

ggtagttttt agatattatt aatataatta gtggatggaa taaagttggg tttattgttt 2520 

ttggtagtaa gggggtttgt ttgatggtgt tattagaggg ggaaaggtaa ggttagatta 2580 

ttgaaaattt gtagtttggt ttaaagtttg tttttgatag ggtttgataa ggattgggtt 2640 

aggtgttgtg atatgatgtt ataggattgt gggaataaag ttttagggta taaattgttg 2700 

gtgtttttta ttgaagtgtt aatgggtttt ttgggaagtt tttataatga gtaatttatt 2760 

tatttgtgta ggtaagaata aaagtaaaga taatggaaat atgtagatag ttttaattgt 2820 

ggaggttttg gagggtgtgg aagttttgtt tttatttttg agtagaggaa ttgggagatt 2880 

ggaggataaa ataagaggaa gatttatttt ttattgtttg tttttttata tttttaatat 2940 

tttaaaaagt atatttttgt atagtttatt ttaaaaagat aattatgtat tttttaatgt 3000 

atgtgtattt agtgttttat tttattatag agttttgttt attttattta gatagaaata 3060 

attgtttatt aaataaaatfc gttttttaga aaaatagatt atgtgtaaat gattgtattt 3120 

attttttttg tttgaggata agtagatatt tgtgtatata tatatatata tatatgtatt 3180 

tttgggtata tttggaggaa tatagtaggg attttgtgat tttgttattt ttattttttt 3240 

tttttagtgt taggaattag ggtttggggt tattttgaat ttagtaggat ttgttttttt 3300 

agtgggaagg aggaggttga gttttagtaa tttattagtg gttttgggtt tttttttagt 3360 

tjaggtttgt agtatttttt tgttttgtaa atttggattt tgggtttggg tttatttgag 3420 

agtgatagaa ggaaggtagg gagagtgtta ggaaggtagg aaggaggaag gtttttttgt 3480 

tttgttttag ttgagggttt agggtgttag ttgtttgttg gggaaagtat aagttagttt 3540 

tttagttttt gggaggtagg tgatttattg ttttttggag agatttttgt gtttgttgag 3600 

ttgttagttt tttaggggat agattgagtt agtgatgtta taggttttag agttaatttt. 3660 
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gttatttttt 
aataaatgtg 
ttaggttagt 
tttttatgat 
ttttattttt 
tttttttgtt 
atttttttat 
ttttttttaa 
tttttaaagt 
tttttttgag 
tttattgtaa 
ttgggattat 
gggttttatt 
ggttttttaa 



ttagaagatt 
attagaatat 
tattaggaaa 
tttagattat 
ttttggataa 
tttttttttt 
ttatgtgtta 
agataaggtt 
gttgggatta 
atggagtttt 
gttttgtttt 
aggtgtttgt 
gtgttagtta 
agtgttggga 



gtggttagtt 
aaatagagat 
tttttaggga 
ttggttggag 
agaggtttgg 
ttgtgttatt 
atttttagta 
ttgttgggtt 
taggtatgag 

ggtttgttat 

ttgggtttat 
tattatattt 
ggatggtttt 
ttataggtgt 



seq_prot . 
ttgtttggat 
gttgtagttt 
ttgttatgat 
gttgttatgt 
agattgttat 
tttttattta 
ttttttgaaa 
tttgggttta 
ttattatgtt 
ttaggttgga 
gttatttttt 
ggttaatttt 
gattttttga 
gagttattg 



txt 
ttgagaattt 
gaggatttat 
ttatgttgtt 
tgattgttga 
gaagtgtgat 
gtattttaat 
tttttgtgta 

aggggttttt 
tagttatatt 
gtgtagtggt 
tgttttagtt 
ttgtattttt 
ttttgtgatt 



attaattgta 
tttgaaattt 
tttttagtag 
gagtagatga 
attttttttt 
tttttataga 
tagttttttt 
ttattttatt 
tttttttttt 
ataattttgg 
tattgattag 
agtagagatg 
tgtttatttt 



<210> 388 
<211> 4492 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 388 

ttgtaggttg 
gttttagggg 
ataggtggta 
gtttttttta 
gttgtggtta 
tttttgtttt 
tatggttgtg 
tttggtatta 
tttttttagt 
gaatggtaga 
tattatatta 
tttttttttt 
aggtagagtt 
attttgattg 
aaggggtttt 
ttgttttaga 
tgtttaaata 
aattggtatt 
gttagggtaa 
ggggttaggg 
attgttttat 
tgtttttggt 
atgttattga 
aagtgttagt 
ggttgggtat 
tgtttaggat 
agggttttgg 
tttttagatg 
tttgatagtt 
ggatgagttt 
ttagagtgga 
tatttgtttt 
tgttgaaggg 
gaagggtttt 

gggttttttt 

ggaggttaga 
ttttgttatt 
ttattgtaga 
tttgggtttt 
ggtagtaaga 
agtggttttt 
atttttgttt 
ttagagtaat 



tgtttagttt 

tggtagtggt 

ggtataagtt 

ttggagaata 

tggttgtggt 

ttggttaagg 

tttatttgtt 

gttgtggtag 

ggggttatta 

gttaggtttt 

ttggtgtttt 

tgttggttgt 

gagttttttt 

atagttttat 

agtttttttg 

agagaagttt 

tagttgatag 

tttagtattg 

gattggagat 

gtgggtatat 

tgtttgttta 

tattgtagtt 

agttgttaga 

attgattatt 

agaggtagag 

ttggagtttt 

gtttagtttt 

ttttttattt 

gatgtgggtg 

ggggtattta 
ggagatggtg 
ggatttattt 
ttttttattg 
aggttttgag 
tttaggtttt 
ggttagggag 
gttatagatt 
gagtggatgg 
gaatatattt 
gtagagtttg 
gtatgatttt 
tttttatatt 



ttgtgtatta 
ttttattttt 
agtttatttt 
aggatagtta 
tttgttgtga 
aggggtgggg 
ttggtatatt 
ggaaggggtg 
gagtttattt 
tgatttttag 
gattttattt 
gtttttagtt 
taaaaggtta 
aattatagtt 
agggagaagt 
gaagatgttt 
ggagttgttt 
tttttttgag 
tttagaggtt 
gttgaggttt 
gttttttatt 
atgggtaggg 
gatattttgg 
ttaagaggtg 
ttttttgttt 

ggggttgttt 

tttgtttatt 
tggtttatag 
aaggtggtat 
ggttttattg 
tgagagtagg 
tggaaggtaa 
tatagttttt 
tttatttttt 
agaataggtg 
gggtttttta 
gtgtatttat 
tagtgagttt 
tggggtttgt 

ttatattggt 
atgaggaagg 
gatttagatg 

aggg_ttag_ta_. 



gggatagtaa ggaaaattta 
agagaagaag aagatatttt 
gtagttatta tagttgttgg 
tgtggtgtgg gatggttggt 
atggtagttt ttgtggttgt 
tgttatgttt gagatgtaga 
gtgttagttt tggttaatga 
aatgagaggt tttgggggtt 
gttgtatttt gtttaaagat 
tttaattttt ttttttttgt 
ttttaggttt tttaaatttt 
gtgttaggta aagtttaagg 
atagaagtaa agtttggttt 
ttttttagtt ggtttggttt 
aagatttgtt ttttttttat 
ttttagttgt tttgttgtat 
ttatattttt tttgattggg 
gagtagggtg gtgggtatta 
gttgaagtta ggatttttta 
gtttttagtg aaatgtgaaa 
attttttgag tatttttgaa 
agggttaagg ttgtatttat 
gggtaattag ggtttaggat 
gggaaagtga aaaggggtat 
taaggtagtt gtttttaggt 
tgtgtttata gagtttttat 
ttgtttttgg gagtaatagt 
tttttggtat tttgaagagg 
tgatggggtt tttgatgttt 
tttttttgtt tagtttatag 
gatgtttttg gggtagattt 
agagggaatt gtttttgagg 
tttttgttgg gggttgaggt 
taggtttaga ggtgtttatt 
tttgtttagg attttttttg 
tttgtttatg gatagtggta 
tttttggtat agtgaatata 
ttagtagtag ggggtatttt 
atttagttgt atagtttgtg 
attttttgtt tatgttgaag 
agtgttagtt ggtgttatta 
gttattaata tttttttgta 
tgagaagggg ^t-tttaggggt 
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agttagatta 
ggatgggttt 
tttttaagtt 
gggagttttg 
gatgtttaaa 
tgtggttagt 
attggggttg 
tgaattttgt 
atatagtagt 
gttttttttt 
ttagttatag 
tgtggggttt 
gagtaaatag 
tattaatatt 
attagaggtt 
tatggtaaag 
ttggtgttgg 
ggagggttgg 
gtatgaggtg 
ggaatttaag 
gaagtagatt 
gttgtttggg 
attattttta 
ggaggttgtt 
ttaggtttat 
taggttttgt 
ttttaaattt 
taggtttttt 
tagatatttt 
tagtattgtt 
gtatttttag 
ttggtgaagg 
ttttttggat 
ttaggtgtag 
gattgggttt 
ttgttttttt 
tttttgtgag 
agtttattta 
tagtagtttt 
tttttagtgt 
agtttagatt 
ttttggttag 
xjttatggtag - 



3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4479 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2-5S0- 



seq__prot.txt 

ggtttttagt ttagtgtttg gtaagttggg ttttggtttt tttgaagttt ttgttttgat 2640 

gtttgaggaa gggagggaga ggtagagata gggaaggagg gtattggaga agaggaattg 2700 

ttttttttgt tgttttgttt gtttaatttt tagtttattt tttttgtttt ttttagtggt 2760 

tgttgtagta aaggttatat ttttggaata ttgggtttgg gtgagttggg agataagatt 2820 

tttgttaagt ttagaattat taggttattg gaaggggaat tagtattgtg gatttggagg 2880 

gtagaaagaa ggtttattgg gtaatagttt tatttttttg agttttagtt tatttttagt 2940 

aaaataggat taataaagtt ttgttttatg gggtgttggg agattatatg aattggatta 3000 

gataaaatgt ttagtagttg gagtagttat aaattttttt tttaaatata gttattgatt 3060 

tatgattgtt tgattagata tatttttggt tgggtgttta tttgttatta tttatttagg 3120 

gtaggaaaaa gggagtggga ggagagattg taagtatttt ggggaatttt attttttagt 3180 

ataaaagaat aaagttttat ttttgggttt tttttttggg tgtattaatt tttttaggtt 3240 

ggaaatttgg gtaaatatta atttattgga ttggatttaa tttttgagtt ttttttgagt 3300 

aggttatttt ttgtttttag gtttagtttt tttatttgta aagagtgggt ttgtttaata 3360 

tatttttaga tttttgagag aaatatgatt tttttatttg gaaaatgttt attggtatga 3420 

aatatttgat tttgtgttta attttaggtg tttatagaat ttttgaaatt tttttattat 3480 

ttgaaaaaag tgattttgag attttagttg attttttttg ttttgtaatt agaggggatt 3540 

gtgatttggg ttaatttttt ttttatgggt tttagtggag ttagttgtgg aatggaatat 3600 

taatttttat tttataggta tgttgtgagg ttttgttaag aaatagttat aaaagggttt 3660 

taagagttat gatgattgtt ttaattattt agtagggagg gggtatttag tggtgttgtt 3720 

tttatgtatt gatgttatgt ggggtgggga ggtggggtgg agaatggaga gtgttttttt 3780 

attttttatt tttttttttt aagtttagtg tagggggagg tggttgtgat tgattgaggg 3840 

gttagggaag gttggttagg ggtgtgggtg tggttgggga tggtttgggg gtggggttgt 3900 

gtaggaggtt ggaggagttt tgtgggattt agaggtgggg tgttggtttg gggattattg 3960 

tttttttggg gtgtggttgt aggaggttgg aggagttttg tgggatttag aggtggggtg 4020 

ttggttgggg attgttgttt ttttggggtg tggttgttgt tgagtatttt ttagtttagt 4080 

tgagtttgtg tttaggttat gttttgtttt agttgttgtt ttttttattt tatggtgttt 4140 

ggagttattt tttgtttgtt tgtttttttt ttttgtttat tttttgtatt tgggtttgta 4200 

ttatttttgt taattgtttt tttgattttg ttgatatttt ttttttaaat ttttgattgg 4260 

tatttttgtt tggtattttt ttttttattt tttttttttt attttttttt tttgattttt 4320 

tttgggtttt ttttttttta aaatttgggt ttttttgtgt ggttttgttt ttaggttggg 4380 

gatgtttttt gtggttttgt gtttatggtt ttgatgttgt ttttttttgt ttataagttg 4440 

tgttgttttt ttgttatgtt gggtttatgg tggggtgttg agtggttggt gg 4492 

<210> 389 
<211> 4492 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 389 

ttgttagttg tttggtgttt tgttgtgggt ttgatatggt gaagaagtga tatagtttgt 60 

aggtggagag aagtagtgtt aggattatgg gtgtggggtt gtgggggata tttttggttt 120 

ggaggtgggg ttatgtggag agatttgggt tttgggaaaa gagggatttg agaggggttg 180 

gggaaagaag atggaggggg aggaatggag agaagggtat taggtaaggg tgttggttag 240 

aagtttgggg gaggagtgtt ggtaggattg ggggagtggt tggtaaaggt gatgtaggtt 300 

tagatgtagg gagtgggtga aaggagagag tgagtaggtg agggatggtt ttggatgttg 360 

tagggtagag gaggtggtgg ttggaataaa gtgtggtttg ggtatgggtt tggttgagtt 420 

gggagatgtt tggtagtagt tatgttttgg aggagtagtg gtttttggtt gatgttttgt 480 

ttttgggttt tgtggggttt ttttagtttt ttgtagttat gttttggagg agtagtggtt 540 

tttgggttga tgttttgttt ttgggttttg tggggttttt ttagtttttt gtgtagtttt 600 

atttttaagt tgtttttagt tatgtttatg tttttggtta gtttttttta gttttttagt 660 

tagttgtagt tatttttttt tgtgttggat ttggaggaag ggggtggaga atgagaggat 720 

gttttttgtt ttttgttttg tttttttgtt ttatgtgata ttggtatgtg ggagtaatat 780 

tgttggatgt ttttttttta ttaaatggtt aaaataatta ttataatttt taaggttttt 840 

ttatggttgt tttttagtga aattttataa tatgtttatg gggtgggaat tgatgtttta 900 

ttttatagtt gattttattg aagtttatga ggaagaaatt ggtttaaatt atagtttttt 960 

ttgattatag aatagaaaaa gttagttaaa attttaggat tatttttttt aggtgatggg 1020 

aggattttag aagttttgtg gatatttgaa attgggtata aaattaggtg ttttatgtta 1080 

gtgggtattt tttaggtaga gggattatat tttttttgag agtttaaaag tgtgttgaat 1140 

aagtttattt tttatagatg gggagattga gtttggggat agggagtggt ttgtttagaa 1200 

aagatttaga aattaaattt agtttagtgg gttgatattt atttaaattt ttagtttggg 1260 

gagattgatg tatttaagag aagaatttag aaatgaaatt ttgttttttt atgttaaaaa 1320 

ataaaatttt ttagagtgtt tataattttt ttttttattt tttttttttt gttttaaata 1380 

aataatggtg a&tgagtatt tagttaggga tgtgtttgat 1 taaataatta tggattaata 1440 

gttaitgtttg gagaaggaat ttgtggttgt tttagttatt gggtattttg tttggtttag 1 1500 
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tttatgtaat 
tgaattaaga 
tttatggtgt 
tttttagttt 
agggggtaga 
tttttttagt 
ggagatttta 
gtatttttga 
ggtatagggg 
tttggtggta 
ggttttaatg 
tgtatagatt 
ttggggtgtt 
gatgtgttta 
aatgttattg 
tttaaaggga 
gaggtgggtg 
agattttagt 
gattttaaga 
gtgggtttgt 
tgttatgaat 
tggggtattg 
tattttttta 
gagttattgt 
tggtgggagt 
ggatttggga 
ttatattttt 
gtgttttgga 
atgtggatgt 
tttttaaggg 
ttttttatat 
gttgggagat 
tttaatgttt 
agtttagtta 
tgatgtagtg 
gtatggagag 
tagagttagg 
ttaggttaga 
tattttggat 
gaggagttta 
gtataggggg 
gtgtttttgg 
tggattttta 
gtttattagt 
tatttatatt 
ttgtaattgt 
ttgttggtta 
agttaataat 
ttagagtgtt 
tttgggtttt 



tttttaatat 
tttagagaga 
taattttttt 
gtttaggttt 
ggggatgggt 
gttttttttt 
gggagattag 
gatttttttt 
agtgttggtg 
ttgattgata 
tggataagaa 
atatggttga 
ttttgttgtt 
ttatgttgga 
tttatgaata 
attttgagta 
tttttaagtt 
ttttagtaga 
atggtttttt 
tttaggagtg 
tggatgaaaa 
agggttttat 
aggtgttagg 
ttttagggat 
tttgtgagta 
atggttgttt 
ttttattttt 
ttttgattat 
agttttggtt 
tatttagggg 
tttattgggg 
tttaatttta 
attattttgt 
ggaggagtgt 
ggatagttgg 
agggtaagtt 
rtrggttggag 
ttttgttttt 
tttgtttagt 

gggggtttgg 

aagggaattg 
gtaaggtgta 
agatttttta 
tgggattggt 
ttaggtatgg 
aaaggttatt 
ttttgtgtta 
tatgatgatt 
tttttttttt: 
ttttgttgtt 



tttatgaagt 

taaagttgtt 

tttgatgatt 

agtgttttag 

tggaggttga 

tttgtttttg 

agtttagttt 

tatattggtt 

gttatttggg 

tttttttttt 

gtgttagtgt 

gtgtaagttt 

ggagatttat 

ggatgagtat 

ggtggggggt 

gatatttgtt 

tgaggggata 

ggaggagttg 

ttttgttttt 

tttttgtttt 

ggtagtgagg 

tgatgttgtt 

ggttgtgggt 

agggtggata 

tagggtagtt 

tagggtaagg 

tttatttttt 

ttttgaaata 

ttttttgttt 

gtggtgggag 

ataggtttta 

atagtttttg 

tttttaggga 

gaaggtagtt 

gaggatattt 

ttgttttttt 

ggggttgtgg 

gttgattttt 
atagttgaga 

gggggtgggg 

gattgagagt 
gtagatgaat 
tttatttttt 
atggtgtgtt 
tattttgttt 
gtttataatg 
tgtggttgtt 
ataggatgga 
ttggaggtag 
tttggtgtat 



seq_prot . 
aaggttttgt 
gtttaatgag 
taatgatttt 
gaatgtgatt 
gtaaatagag 
tttttttttt 
gttaggtatt 
ttgttggtat 
ttaatgtagg 
atagagttat 
gatgagtttt 
taaggtgtgt 
tattatttat 
atggtttatg 
ttttttttga 
ttgaaatttg 
gatttaaggt 
tgtagtggga 
tgaggtgaat 
tatattattt 
tttgggtatt 
tttatttgta 
tagggtagaa 
gggaagttgg 
ttaagatttt 
gattttgttt 
agggtagtta 
tttttgatgg 
atagttatag 
attgagtagg 
gtatgtgttt 
ggatttttag 
aatagtattg 
ttttgttggt 
ttgagttttt 
tttaaaaggg 
ttgtggagtt 
tggggaaagt 
gtatagttag 
ttgagatatt 
taaaggtttg 
tttaatggtt 
ttttgttata 
agggtaagtg 
ttttttggtt 
aggttatagt 
tttgtttttt 
ttggtttgtg 
gagttgttgt 
aagggttgaa 



txt 

taattttatt 

tttttttttt 

gagtttggta 

tttgttgtag 

tagtagaaaa 

ttttttttag 

gagttagaag 

gggttgtttt 

gagggtgagg 

gtaagggttg 

gtttttatta 

ttagagttta 

tttttgtagt 

atgatggtga 

tttttgattt 

aggaagaggt 

ttgagatttt 

agttttttga 

ttagggtagg 

tttttatttt 

ttaagtttat 

ttaattgtta 

agtatttagg 

atttagggtt 

aggttttggg 

ttgtgtttag 

gtattggtgt 

ttttgatggt 

tggttgggag 

tagtggagta 

atttttgatt 

ttttgttttg 

ggagtattag 

tgtgtttgaa 

tttttggggt 

ttgaaatttt 

attgattaaa 

ttagttttat 

tagagggagg 

agtgatatgg 

gttttgttat 

ttgttggaag 

gttggtatta 

gatgtagtta 

aagaaataaa 

tgtggttgta 

agtgaggaga 

tttgttattt 

tatttttgaa 

tgtagtttgt 



ttattgaaaa 
gttttttaga 
aaggttttat 
tagttgttgg 
ggtagttttt 
gtattagagt 
ttttgttatg 
ggttgattaa 
gtggtttggg 
ttgtattgag 
ttagagttgt 
ggtgggtggg 
gatttgtggg 
ggagaaaaat 
ttaggtttag 
ttttgtgttt 
ttatttaggg 
tgtttttatt 
tattggggat 
agggtagtat 
ttgagatgtt 
ggggaagatt 
gagggtttga 
ttgtaggatt 
tagtgaattt 
ttagtggttt 
tttgaggatg 
attttggata 
tgggtttatt 
gttttggatt 
tttattttat 
gtttagtttt 
ttttagtatt 
tattttgtta 
agaatttttg 
ttggtattgg 
gtttgtttgt 
ttggatttta 
ggttgtggtt 
tggagggaaa 
ttgttgttgt 
gggtaagatt 
gatagtttta 
tggttggttg 
ggtttaggta 
attagaattt 
gtaatttggg 
gtgaatttat 
gttggtttag 



<210> 390 
<211> 4448 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA CHomo sapiens) 
<400> 390 

ttagtatttt 
ggttgggggg 
aatgttaggg 
ttttgtattt 
tggtggtagt 
agattttgga 
gatggggata 



1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4492 



aggagttgat 
aatttgtaag 
agaggtttgt 
tgaaggttag 
atagattttt 
agggattttt 
gagatgtgtt 



gagtttttat 
gaatatgtgg 
ttaattggtt 
tagagaggag 
agatgtttaa 
atggtagaat 
taaggtaaaa 



ttgggtatta tttttttttg 
aattttatat ggtgtttttg 
agggttttga aattttgatt 
gaattttgag ttttttagga 
atggagttgt agagagaagg 
ttttatttta taagtaggga 
Jta^^atgg„.agg.tagggiia-. 
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ggtttgagaa 
atattgtggt 
atttttttat 
ggtgggtagg 
tttttttggt 
tagaggttta 
gggagaggtt- 



60 
120 
180 
240 
300 
360 
—420 



seq_prot.txt 

aggttttaga tttggtttta gtattttttt tatatatata ttgatataga agttggttta 480 

gatttataat ttaatggttg tgtagatttt agttttttat ttttgttgtt tatgtgttta 540 

ttggtttggg gttagggttt tttgtttaaa ggtatggatg tgtgggagtt tttttgttag 600 

gtatgtgttt attagtttgg ggtgagagat ggggttagga aaaggttagt ggtgattgta 660 

tattttgagg ttagggtgat ttaagagatt ggtgttttta aggttgtagt ttttaggttg 720 

tgggaatatg tatttggaga gagggtttat attgaaatat ttgaagtagg ggttatgaag 780 

gtgggatttt taaggttttt gagttatgtt agggggagtg tgggagttag gttgtagggg 840 

tagggggagt ggttggggtt gtggtatagg agttaggtaa ggtgttttta gggttttatt 900 

tgtgtttttg aagtagtggt ttttttttga atttggttga tattattagg atttggaagt 960 

tataggagta atggttgagg gttgtgtttt ttatttttta ttgagtggaa gaatatgaat 1020 

ggtgttgaat ttgttttttg agtttttttt tttttgtttg gtggtttaat ttattatatg 1080 

ttttgttttt aggttttgtt gtagtttttt gtggaggtat ttggtgttga gttttatggt 1140 

ggtagtttag ttggaatggt agtttagtag ggatataatt ttagtttggg tgttggttat 1200 

gttattttgt tgttttatag gagttatttt tgttggaaaa tttatttttg tttttattaa 1260 

ggtgtatgtt aatgtagtat ttttgttttt aagtagttgg gttttttagt ggtttaggtt 1320 

ttttgtgttt gttatgtgtt agttaattag agtttgagga aggtgggatt tgggtggagt 1380 

tgatgttgaa tggtatgtgt ttgttgattg gatagtagtt tggtttttgt ggttggattt 1440 

ttatatttgt ttttagatag gagaggggta tgtattggtg ttatggtttt ttgtaggttg 1500 

tttttggata gtttttgaga gtttgttttg aagtaagtat tttgtagttt tagtgattta 1560 

gttttttttt ggattttagg ttatggtaat taattttttg ttgtggtttt tgaattttaa 1620 

ggtttgatgg gttttgtggg ggttgtggtg aggttagggt gtttttttgg tgtttatttg 1680 

ttttattgtt tgaatttttg gtattggggt tgtgtttatg tggggtttat tgttaatatt 1740 

gtttgttggg tttagtagta taatgggggt tgtaaaaaag gtggggtttg gtggattagt 1800 

gggtgagggt tttggtgaga gggggaggtt ttttgtgttg ggaggaaggt gttttagagg 1860 

aggtggattt tgtggggtat aggtttgttg agaaggattg gttaggattg tgatagaggt 1920 

ggggtgtttg tggtggggat ggggtttgtt gtaggggagg agtgtgatgg ggaggtggtg 1980 

tttggggtat ttttgtggtg gaatttaggg aaggagttag ttggggttgg ggtgatgatt 2040 

taggttgggt tttagtatag ggtttttggt gggtatgttg gggttgggtg gaatgtagga 2100 

gagagaggag gtgggatagg tgggtatttg ggttggaggt agggtttgag gtgggtaggt 2160 

gtagagggtt gtattttttg gttgaagttg ggaatgagga ttttgttttt gggtgggatt 2220 

ggaggggatt tgtggttgag gtgttgggtt gtgataggga tattattgtt ttttttttta 2280 

gggagatggt gtttgttgtg tatggttttt tgttttttgt ggttgttttt tgggtaagga 2340 

aggagattgg gtagtggtgg ttgggtgagg gtttgggttt tgttttttgt tttgagttgt 2400 

tttgatgttt ttgtatgttg tttgtagttt gagtgggttt ggtagtattt gtatgttttg 2460 

tgttgtttgg tgtatgtggg gttttgtttg tgaggagaga tagtttttgt tttttatttg 2520 

tgtgtgttgg tgtatatgtt gtgtgtattg ttttgggttt gttttttgtt tatgttgtgt 2580 

tgggtgtgtt tagatttttg ttaggatttt tgttttttgt tgttgttgta tgtgtttttg 2640 

gtttttgggt ttttattgtt ttaggttttg gttttaaggt gttgtgtagg tttttttgat 2700 

gatgtggagt ttgagttaga ttagggggat ttaggggttt gttgtttttt gtgtgtttag 2760 

attatttagg tgaggttttt ttgaggaatg gatggtttta gagtttagat gattttggat 2820 

ttagtttttt tttgagggaa atttattgat atgttttggt tattttattt ttatttttaa 2880 

gatgttgatg ttataggatt tatggttatt tttgagtaaa gtaggttttg tttagggttt 2940 

ttattatttt ttttttaaag aaaaggtttt tttttaggtt attagtattt tgttgtattt 3000 

tagtgtggtt gatagagtat gatttttttt tttttttttt ttttttttga gatggagttt 3060 

agttttgttg tttaggttag agtgtagtgg tgtgattttt gtttattgta agttttgttt 3120 

tttgggttta tgttattttt ttgttttagt tttttgagta gttgggatta taggtgtttg 3180 

ttattatgtt tggttaattt tttgtatttt tagtagagat ggggttttat tgtgttagtt 3240 

aggatggttt tgatgtttta gttttgtgat ttgtttgttt tagtttttta aagtgttggg 3300 

attataggta tgagttattg tgtttggttt tttttttttt tttagataga gttttgtttt 3360 

gttgtggagg ttggagtgta gtgattttag tttattggaa gttttgtttt ttgggtttaa 3420 

gtgatttttt tattttaatt ttttaagtgt gttattatat ttgttgaatg tttgtgtttt 3480 

tagtagggat ggggtttgtt atgttgggta ggttggtttt aaatttttga ttttaagtga 3540 

tttgtttgtt ttggtttttt agagtgttgg gattataggt gtgagttatt atgtttggat 3600 

atgattttgt ttttaaaata atgtaatgaa ttgtagttta tttttgtgtg tatttttttg 3660 

tatttttttt tttttaatat ttattattgt ttggtatgtt atgtgttttt ttgattttat 3720 

tagttttttt tattttaatt gttatattgt attattagag ggtagtattt attattgttg 3780 

tagaataaag ttttgtattg agttgatggt ttagaaattt tttttttatt tatttattta 3840 

tttttttttt tgagatagag ttttattttg ttgtttaggt tggagtgtag tggtgtgatt 3900 

ttagtttatt gtaatttttg ttttttggtt ttaagtaatt tttttttttt agtttttaga 3960 

gtagttggga ttatatgtgt ttgttattgt gtttggttka tttttgtatt tttagtagag 4020 

atggggtttt aattatgttg gttaggttgg ttttaaattt ttgattttag gtgattttgt 4080 

ttatttaagt tttttaaaatf gttggtatta taggtatgag ttattgtgtt tggtttaaat 4140 

atttaataaa ataatggatg atgggtgttt tttattgagt' ttttggtaat tgtgagtgag 4200 

tagaggattt gttttgggga tatttagtga tttgttgggt gttgttgagt tgtgaggaag 4260 

tttaggtttg gttgtagtgg tgaggttgtg atttaattaa ttattgttga tgtttttagg 4320 

atttgtatta gtttagtttt aggggtaagg attttaattg tttattttag tttttttatt 4380 

tgtaagatgt aaataata^t Jtatttttgtt ttatg^gatg gagttgtgta atgtttgtaa 4440 

tagtgttt 1 * ri ' 2 f / J * - : - 4448 
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<210> 391 
<211> 4448 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 391 

aggtattgtt gtgggtatta tatagtttta ttttatgagg taggggtgat tgttatttgt 60 

attttataaa tgaagaaatt gaggtggata gttaaaattt ttgtttttag gattaagttg 120 

gtgtaggttt tgggagtatt agtagtgatt gattgagtta tagttttatt attgtagtta 180 

gatttgaatt tttttatagt ttagtaatat ttagtaggtt attgaatgtt tttagggtaa 240 

gttttttatt tatttataat tattgggagt ttagtagaag gtatttattg tttattattt 300 

tattaaatat ttaggttggg tatggtggtt tatgtttgta atattagtat tttgggaggt 360 

ttaggtgggt aggattattt gaggttagga gtttgagatt agtttggtta atatggttga 420 

aattttattt ttattaaaaa tataaaaatt agttaggtgt ggtggtaggt gtgtgtaatt 480 

ttagttattt tggaggttga ggaaggagaa ttgtttgaaa ttaggaggta gaggttgtag 540 

tgagttgaga ttgtgttatt atattttagt ttgagtgata gagtaagatt ttgttttaaa 600 

aaaaaaaata aataaataaa taaaaagaaa atttttgggt tattggttta gtgtaggatt 660 

ttgttttata gtagtgataa gtgttgtttt ttgatagtgt agtgtggtag ttggggtggg 720 

agaggttaat gagattaaga aaatatgtag tatgttagat aatagtaagt gttaagggaa 780 

agggggtgta gaaaagtgta tataagaatg agttgtagtt tattatattg ttttagagat 840 

agggttatgt ttaggtatgg tggtttatgt ttataatttt agtattttgg gaggttgagg 900 

tgggtagatt atttgaggtt aggagtttga gattagtttg tttaatatgg tgaattttat 960 

ttttattaaa aatataaata tttggtaggt gtggtggtat atttgggagg ttgagatagg 1020 

agaattgttt gaatttagga ggtggagttt ttagtgagtt gagattattg tattttagtt 1080 

tttgtaatag agtaagattt tgtttaaaaa aaaaaaaagg gttgggtgtg gtggtttatg 1140 

t H g HH tt ta 9*attttg ggaggttgag gtgggtggat tatgaggtta ggatattgag 1200 

attattttgg ttaatatggt gaaattttgt ttttattaaa aatataaaaa attaattagg 1260 

tgtggtggtg ggtgtttgta gttttagtta tttgggaggt tgaggtagga gaatggtgtg 1320 

aatttaggag gtggagtttg tagtgagtag agattgtgtt attgtatttt agtttgggtg 1380 

atagagttag attttatttt aaaaaaaaaa aaaaaaaaaa agagggttat gttttgttga 1440 

ttatattgga gtgtagtgga gtattgatgg tttagggaag aatttttttt ttgaagggga 1500 

aatggtgaag gttttgagta gggtttgttt tgtttaggaa tggttgtaag ttttatagta 1560 

ttagtatttt agagatgggg atagggtggt taaagtgtgt tagtgggttt tttttaagaa 1620 

agagttggat ttgaagttgt ttggatttta gagttattta ttttttgggg ggattttatt 1680 

tggatggttt gggtgtatag ggagtagtag gtttttggat ttttttggtt tggtttaggt 1740 

tttgtgttat taaagggatt tgtgtggtgt tttggggttg gggtttgggg tggtggaggt 1800 

ttgaaggtta gaggtatgtg tggtggtggt gggaggtaga ggttttggtg gaggtttggg 1860 

tgtatttagt gtagtgtgag tgggaggtga gtttagggtg gtgtgtgtag tatgtgtgtt 1920 

agtatgtgta ggtggaaagt gaaggttgtt tttttttgta ggtgaggttt tatgtgtgtt 1980 

aggtggtgtg aggtgtgtgg gtgttgttaa gtttatttaa attgtgggtg gtgtataggg 2040 

gtattagggt ggtttggagt gaggggtgga atttaagttt ttatttggtt gttgttgttt 2100 

ggtttttttt tttatttgaa gggtggttat ggaggatggg aagttgtgta tgatgagtat 2160 

tatttttttg aggaggagaa tggtgatgtt tttgttgtag tttagtgttt tagttgtggg 2220 

ttttttttaa ttttatttga gagtggggtt tttatttttg gttttagttg agaagtgtag 2280 

ttttttgtat ttatttattt taggttttgt ttttagttta ggtatttatt tgttttattt 2340 

tttttttttt ttgtatttta tttgatttta gtgtgtttat taagagtttt atgttggaat 2400 

ttagtttgga ttattatttt ggttttagtt agtttttttt ttgagttttg ttgtggagat 2460 

gttttgggta ttgttttttt gttatgtttt ttttttgtga taggttttgt ttttattata 2520 

gatgttttgt ttttgttata gttttggttg gtttttttta ataggtttgt gttttgtggg 2580 

gtttgttttt tttgggatgt tttttttttg gtatagaagg tttttttttt ttattagggt 2640 

ttttgtttgt tggtttgtta ggttttgttt tttttgtggt ttttattgtg ttgttggatt 2700 

tgatgagtag tgttgatagt gaattttatg tagatgtggt tttggtattg ggggtttagg 2760 

tagtagagta ggtagatgtt gaaaaagtgt tttggttttg ttgtgatttt tgtgggattt 2820 

attgggtttt ggggtttggg aattatagta gggggttgat tgttgtgatt tagggtttag 2880 

gggagggttg gattgttagg gttgtagggt gtttgttttg gaataagttt ttggggatta 2940 

tttggaggta gtttgtagga agttgtagtg ttggtatgtg tttttttttt gtttggaggt 3000 

gggtgtagaa gtttgattgt ggaagttaga ttgttgttta gttggtgagt gtgtattatt 3060 

tagtattggt tttgtttgag ttttattttt tttaggtttt gattggttga tatatggtaa 3120 

gtgtgaaaag tttggattgt tggaaagttt ggttgtttga gggtggaagt attgtgttga 3180 

tgtatgtttt agtaagggtg gaagtgagtt ttttagtgga agtggttttt gtaaggtagt 3240 

aaggtagtgt ggttggtgtt tgagttgggg ttgtgttttt gttgggttgt tgttttagtt 3300 

ggattgttgt tatggaattt agtgttgaat atttttgtga gaagttgtag tgggatttgg 3360 

aggtggaata_tg_tgg.tgagt-- tgagttgtng_ gataagggga aaagagtttg gggggtgggt- 3420 - 
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ttggtattat ttatattttt ttgtttggta ggaggtggag gatatgattt ttaattgttg 3480 

tttttgtagt ttttgagttt tggtggtgtt ggttaagttt gaggggaaat tgttgtttta 3540 

gagatatagg taaggttttg ggagtatttt gtttagtttt tgtattataa ttttagttat 3600 

tttttttatt tttgtaattt gatttttgta ttttttttgg tgtggtttag ggattttgag 3660 

gattttgttt ttgtgatttt tgttttgagt attttgatgt agattttttt tttaagtatg 3720 

tgtttttatg gtttggggat tataattttg gggatattag ttttttgggt tattttagtt 3780 

ttggggtgtg tgattattgt tggttttttt ttaattttat tttttgtttt aggttggtga 3840 

atgtgtgttt agtagaagag tttttgtata tttatgtttt tgaatagaaa attttgattt 3900 

tagattagtg ggtatgtgag tgatagaaat gagggattgg gatttgtata gttattaaat 3960 

tataaatttg gattagtttt tgtgttgatg tgtgtgtggg aggggtgttg aggttgagtt 4020 

taggatttgg tttttttttg ttttgttttt gtttattgtt ttattttgga tatgtttttg 4080 

tttttattta ggtttttatt tttgtttgta aaatgaaagt tttattgtaa aagttttttt 4140 

tagggtttat tgagaaagtt ttttttttgt aattttgttt agatatttag gagtttgtat 4200 

tgttattatt tatttatttt ttagagagtt tagggttttt tttttttgtt agtttttaag 4260 

atgtagagat gagaaagtga ttggggtttt agggttttga ttagttgaat aaattttttt 4320 

ttgatattat tgtagtatta gaggtattat atagaatttt atatattttt tgtaggtttt 4380 

ttttggtttt tttaggttta agagggagtg gtgtttaggt aggggtttat taatttttgg 4440 

aatgttga 4448 

<210> 392 
<211> 4001 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 392 

gagtttaatg tttgtgattt tttttttttt aaaaattttt gaggtaatat ttatatttag 60 

tgaaatgtat agattttaag tggattattt tattagtttg ataaatgttt gtaaggtgga 120 

atttaagttt ttattagtat ttgttaaatt tatgttagag gttgattaga gaagtatagt 180 

ttttaagatt tttattttgt tttgtggaga atggtaaaaa tttgtgaaat attttagtag 240 

agttaggaga gagtaatatt ggtaatgtga agtatgttaa gttaagaaag gaagtattga 300 

aattaattgt tttgaatgga atttgataag aatttgttat tttatttttt taagattgga 360 

tttaagttgg gagagaatat tgttttttgt tttgtttaag ttaatattat tgtttagatt 420 

tataaatttt tttatttttt aatgtttata tattaatttg tattataagt taaaataata 480 

tagagaattt ggaaaaagaa gaaggggaat gttttttatg aaaagtatat tttttaaagt 540 

gtttggtttg agagttatat agtatgtata aaatttgata ataagtatat tatgatttat 600 

attgtatata ataaaaagta gggtattttt gaagggtttt gagaaaaagt tttgggtttt 660 

taaaatttgt ttaagaaggt gattaatggt atattatttt taaatgtagt attgatttgt 720 

atgttttata gttattaata gtgaaaataa tagtttttaa taagagttgg aaatattgag 780 

ttgagggttt tttttttttt tttaaagttt gaggttaaaa ttgatattta tttttttgaa 840 

ttatgtttat attttttttt tggttaatat gtatgtggtg gtataatttt gaaaattatg 900 

taatgttata gaattattat ttagaaatga attttgttaa taaatttttt gtatttaaat 960 

tttttttatt gtatagtttt tgataatgtt ttttagataa tttttttttt agtaattagt 1020 

atttttaaat aaaaattata gagaatagta agtttttttt tttttttttg gtagatttga 1080 

tttagaaatt gttatgggaa gaaagtgtta attatattaa aaaatagttt gatagaaagt 1140 

atttaaaaag agaaagggag aatattatgt ttttattttg gtgaattagt aataaagaaa 1200 

aagattagta tggatgggta ttttttaaaa atatattttt ttttttttgg tttttgttag 1260 

ggtggaggaa gttgtttttt tgttagagat agggtggaag agagtgaaag gataaatgat 1320 

tgagaggttg ttttttttta ttggtgtagg tgtgtggggg ttggatgggg ggttgtggag 1380 

gggggaggtg gttagtaggt gtttgggttt taggttttta tttttatttt tgtttttatt 1440 

tttatttata tttagggatt ggttttgttt tttggtgggt gagtggtagg tgttgaagta 1500 

ttggggtggg gggtgtaaat tttgtgggta gtggaaaaga ggttgtgggg ggttttttag 1560 

tgttggtaga tattgtgagg ttggtagttg ttggtatgta tatttagttt gtagttttga 1620 

ggaatatgtt tgtagttagg gtgtggagta gagttttggg taggagaatt aagggagggt 1680 

gtgtgttgtg gtggtggtgg tagtggtagt ggagttgtta gttttttttt ttttgagtga 1740 

gagaatgtta tatttaggaa tatagttaat tagggaagtg aaagattttt ataatgagaa 1800 

ttattaaata ttgtttaaag aaagtataga tgatattaat taatggaaaa atattttatg 1860 

attatggata ggagagagta atattattaa aatggttata ttgtttagag taatttatgg 1920 

atttaatgtt atttttatta aattattaat gatattgttt atagaattag aagaaattat 1980 

tttaaaattt atgggttgtg tttggggtgt tgtttttgta gtttgtggtg tttttgtttg 2040 

tgtttttgta gggttttttt tgtgttgttg ttgttggggg tttgggtgtt tgtggtttgg 2100 

tgatggtagt ggttgtagtg ggtatggttg aaggattggg aggtggtggt ttggtttgaa 2160 

ttgttgtgaa gaaagtgtaa ttatgttttg ttttgtgggt taagaaattg gagaaatttg 2220 

gagtgtattt tgtttgtaag gtatgtgttt gttgtttttg gattgtggat gggtgttagg 2280 

gytttagttt gtgggatttt tttttttttt ttgtttttat tt { ttgtt : ttt tgttttttgt 2340 
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tttttgtttt 

tagttttttt 

agttttttat 

tggggttttg 

tttttatgtg 

ttttgtaatt 

ggagaagatt 

ggaaggtggg 

ggaggtattg 

aaatgttgtt 

gtgggtgtta 

ttaggtattt 

gaggattgtt 

gatagggtga 

tgaggtaggg 

tgtaaagagg 

attatttttg 

tttttggtat 

atattatatg 

ttgttgagtg 

ttttggggaa 

agtttgtatt 

tttagatttg 

ataataatgt 

tgtttttaag 

taggatattg 

tgggtatagg 

agaatttatg 



ttgtttgggg 

ttggataaga 

tttttttgtt 

ttgttttgta 

gtgggtgatt 

gatttttttt 

tttggatttt 

tagagtttga 

ttgtgtgtgg 

rtagttgttt 

ttgtttagtt 

tgttattttt 

tgaggttgtg 

tagttgaggg 

aagagatata 

gttaagtttt 

aattttgata 

ttgtaggagt 

tttaggttta 

aatgtaggat 

gttggttaaa 

tttgataaat 

gataatatat 

tatgtggtta 

aaaggtgtaa 

attgtagtag 

gtatattgtt 

taattaaata 



ttgttgtatt 

gttttgttgg 

tgaaaggagt 

ggagtttgtt 

taggggtaag 

tgtgttttat 

tggtgttttg 

ggtgagggtt 

tgatagtttt 

gaggtgtaaa 

tgaggatatt 

agttgtagat 

gggtaggttg 

gggtggggat 

tttatatata 

tagattttag 

taattataat 

tttttagtta 

tgttaagtat 

ttagtttagt 

atgttgatta 

ttttaagtga 

tatttgtttt 

tttttgaaga 

gtttaggagt 

gtagaagggt 

tgggtgatga 

ttatttgttt 



seq_prot . 
gtggaagtgt 
gtttgaagaa 
agttttggta 
ggagttattt 
gaaaaatttt 
tgggaggaat 
ggaaagtgaa 
ttgtggtgtt 
gttattattg 
taggatttgg 
tattttttgt 
ttagatgtgg 
gttttgtgtt 
ttggttttgt 
tttatatttt 
atttttgatt 
ttattttatt 
taatggaatg 
ttttgtattt 
gttttttaaa 
tgaattaggt 
tgttatgtta 
ttttgatttt 
aggaatgtaa 
tgattatgaa 
gagttatgta 
ttatattaaa 
tataaaaatt 



txt 

ttttgttgtt 

gggggattat 

tttgagtttt 

ttttgggagt 

ttgggggatt 

"tgggtgtaag 

ggggaggaaa 

tggtgttttt 

tattattatt 

taaagtgata 

tatttatttt 

aggttgtgtt 

gttagtagtg 

tagtttttgt 

tttgtatttt 

gttataatta 

agaatattta 

tttatgtttg 

tatattggtt 

ttttaatgtg 

ggtttgaggt 

gtttatagat 

agtagaaaag 

aaaagatgaa 

gggtattaga 

tagaattaag 

atttztataaa 

t 



tgtgtttaat 

taagatggag 

tgggttattt 

ggagtggtgt 

gagtggtttt 

gagtgggggt 

gtgtggggtg 

ggttgggggt 

tgtttagtta 

gtttagtgtg 

ttatgttagt 

tttgaggtgg 

gttttgtttt 

ggtttgtggt 

ttttgtttgt 

agttgtagtg 

ggttatattt 

tttttttttt 

taataaatat 

tatgggaatt 

ggaagttgag 

tatattttaa 

atagaaataa 

gtggggagtg 

tgaatgagaa 

aataaggtta 

ttattattaa 



<210> 393 
<211> 4001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 393 

aggtttttgt 
ttttggtgta 
gtatatagtt 
ttttatgatt 
tttgtatttt 
taaaattaaa 
ttagtatagt 
tatttgattt 
gtttgaggag 
aggatataga 
atattttatt 
tagtagaata 
taattgggaa 
ggaggtgtga 
gataaggtta 
tagtgtaggg 
tttatatttg 
agtaagaaat 
gttaaatttt 
ataataatga 
aggtattata 
ttttgttttt 
agtttttttt 
aaagattttt 
aaagtgattt . 
atgttgaagt 
tttttttaaa 
agtatttttg 
ggtggaagtg 



ggaataggtg 
gttattattt 
tgtttttttg 
aatttttggg 
ttttttagaa 

ggggataaat 

attatttggg 
ataattagta 
tattgggtta 
ggtgtttaat 
gtgattgaga 
aattatagtt 
tttggaattt 
gtgtgtgtga 
agtttttgtt 
ttagtttgtt 
gatttgtagt 

gggtgttttt 

gtttatattt 
tagggttgtt 
agatttttgt 
ttaaagtgtt 
agtgggatat 
ttttgttttt 
tagtgagttt 
tatttttttt 
ftttagtagga 
tggtgtagtg * 

gga_ggggg_ag__ 



gtgtttgatt 
aagtagtatg 
tttattatag 
tttgtatttt 
ataattatat 
aatatattgt 
agtttgttag 
ttttaattag 
gattttatat 
ataaatttga 
agtttttgta 
atgttaaaat 
gggaatttga 
gtgtgttttt 
ttttttagtt 
ttgtggtttt 
tggaggtgat 
aaattgagta 
tgagtagttg 
attgtgtgtg 
tttggatttt 
aggagtttgg 
aggaagggat 
gggttatttg 
ttatagagta 
aagtggaagg 
tttttattta 
gttttaggtg 
gssssttttg. 



atataagttt tttaatggtg 
ttttgtattt atgattttgt 
ttggtgtttt gtttttattt 
ttttagaagt atatttttta 
agtattatta tttgtttttg 
ttaggtttga attggaatgt 
aaatgtaaat ttttagtttt 
tttttttagg agatttttat 
ttatttaata aatatttatt 
gtatatggtg tggaaaaaga 
agtattaaga agagtataat 
ttagaagtgg ttattgtagt 
ttttttttat agtaggtagg 
tttttgtttt agttataggt 
gttattttgt taaaataaag 
gggtaatttt tttattttgg 
agggtgttta aattggtatg 
gtagtgtttg ttgtgttggg 
ggatagtgtt ttaattaaat 
gtagtgtttt tgtttttagt 
gtttattttt ttattttgtg 
gagttttttt tatttttatt 
tgattgtaga agggattatt 
ttatatggag agtattgttt 
ataaaatttt aaggtagttt 
aatgaagggt tttttgtttt 
agaagaagtt aattgggtgt 
ggaggtgaga ggtaggaggt 
_tgggttgggi:-t-tt-tagtat^ 
Seite 360 



atttgtgaga 

ttttaatttt 

gtttaatgtt 

ttttattttt 

ttttttttgt 

ggtttgtgga 

tattttagat 

gtatattaaa 

gaattagtgt 

gtaagtatag 

ttagatgttt 

ttggttgtga 

aagaatatag 

tgtaggagtt 

ttgttgttga 

gaatatagtt 

gagagtgggt 

ttgttatttt 

aagtaataat 

tgagggtgtt 

tttttttttt 

ttttgtattt 

tagtttttta 

tatttttggg 

gggagtttgg 

agtgattttt 

gggtgataag 

gggaggtgga 

tatt-tgtggt 



2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4001 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
.1440 
1500 
1560 
1620 
1680 
1-740- 



seq_prot .txt 

ttgagagtgg tgagtgtgta ttttgtaggt ggagtatatt ttgagttttt ttagtttttt 1800 

ggtttgtgga gtggagtgta gttgtgtttt ttttatggtg atttgggttg agttattgtt 1860 

ttttggtttt ttggttgtgt ttgttgtagt tattgttgtt gttggattgt aggtgtttga 1920 

gtttttggtg gtagtggtgt agggggagtt ttgtgggggt gtgggtggaa gtgttgtagg 1980 

ttgtgggggt agtgttttgg gtatggttta tgaattttaa aatagttttt tttagttttg 2040 

tgaataatgt tattggtagt ttaataggaa taatattgaa tttataaatt attttgggta 2100 

gtatggttat tttaatgata ttgttttttt ttatttatga ttatggaatg tttttttatt 2160 

agttgatgtt gtttgtgttt tttttgagta gtgtttagta atttttatta tagagatttt 2220 

ttattttttt ggttagttgt atttttaggt atggtatttt tttatttagg agggagggga 2280 

ttagtggttt tgttgttgtt gttgttgttg ttatagtata tgtttttttt tggttttttt 2340 

gtttgggatt ttgttttgtg ttttggttgt ggatatgttt tttgggattg tggattaggt 2400 

gtgtgtgttg gtggttgtta gttttatggt gtttgttagt gttgggaggt tttttatggt 2460 

tttttttttg ttgtttgtgg ggtttgtatt ttttatttta gtgttttgat atttattgtt 2520 

tgtttgttag ggggtgaggt taatttttgg gtgtgggtga gggtggggat gagggtgggg 2580 

gtaggggttt ggagtttagg tatttgttgg ttattttttt tttttgtggt tttttatttg 2640 

atttttgtat gtttgtatta gtgggagggg gtagtttttt aattatttgt ttttttattt 2700 

ttttttattt tgtttttgat agagagataa ttttttttat tttagtggga gttagaggaa 2760 

aaaaaatata tttttgaaaa gtatttgttt atgttaattt ttttttttgt tgttaattta 2820 

ttaaaataaa gatgtgatgt tttttttttt tttttttaaa tattttttgt taaattattt 2880 

tttaatataa ttaatatttt ttttttatag taatttttgg gttaaatttg ttggaggagg 2940 

ggaaaaaaat ttattatttt ttgtaatttt tgtttgaaag tgttggttgt taaaagaaag 3000 

attgtttaaa aagtattgtt aaaaattatg tagtggaaaa agtttgaata taaaaagttt 3060 

gttaatagga tttgtttttg ggtggtggtt ttataatgtt atgtagtttt taaaattgtg 3120 

ttattgtgtg tgtattaatt aggaaaaaga tataagtata atttaaaaga ataaatattg 3180 

attttggttt taaattttaa aaagaaaaaa aaaattttta atttagtgtt tttaattttt 3240 

gttgaaaatt gttgttttta ttgttgatga ttgtggagta tataagttag tgttgtgttt 3300 

gaaagtggtg tattgttgat tgttttttta agtaggtttt aaaagtttag ggtttttttt 3360 

taggattttt tagaagtgtt ttattttttg ttatatgtaa tgtgggttat aatgtgttta 3420 

ttattagatt ttatgtatgt tgtatagttt ttaggttaga tgttttaagg ggtgtgtttt 3480 

ttgtgggaaa tatttttttt tttttttttt tagatttttt gtgttatttt gatttgtaat 3540 

gtaggttaat atataaatat tgggaggtgg gggagtttgt agatttaaat agtgatatta 3600 

gtttagatag ggtaaaaggt aatatttttt tttaatttga gtttagtttt gaggggatga 3660 

aatggtagat ttttattagg ttttatttag gataattaat tttagtgttt ttttttttag 3720 

tttgatgtgt tttatgttat tagtgttatt tttttttggt tttgttgaga tgttttgtag 3780 

atttttgtta ttttttgtaa agtagggtga gagttttaaa ggttgtgttt ttttgattaa 3840 

tttttaatat ggatttaata gatattaata agagtttggg ttttattttg taggtatttg 3900 

ttaaattgat ggaatgattt atttgagatt tgtatatttt attggatata aatgttgttt 3960 

taaaaatttt taaaaaaaga aaaattataa gtattgaatt t 4001 

<210> 394 
<211> 4322 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 394 

ggtaagttgt tttttttttt tttgagtttt agtttgttta ttattaaaat tggtattaat 60 

agtagttttt attttttgag aattgtttag attgtttata tttagtattt agtatatagt 120 

ttggtgttgg gtagttttta ttagtgatag ttatgattat ttattttttt tttttatggt 180 

gaattatatg gagtgggaat aaaggtttgg tggttggatt tggagttttt taagggttag 240 

tttttttttt ggttgtagtt tttggtatag ggtgggttta ttgtttatgt tgagggagag 300 

attgggttgg tttattgtgg atttgaagga ggagttagaa ggaaatttga ttttaagtgg 360 

gtaaaggtta tttgaggttt attttgttta tttttttttt tagtttaatt ttagggaagt 420 

tatggggtat gatgggtggg gtgggatggg attaggatat agaatatgtt aatgaatatg 480 

gttttttatt tagggaagtg ttgtgtatta taattgggta ttagggagga gggatatttg 540 

aatggatata attagggtag gtagaggttt gggggtgggg tgggaaatgg tatttaggtt 600 

tagagaatag tatgtttaag gtttgggtgt gaagagaaag ttttttttaa aggatagttt 660 

ttgaatttat ttggtttggt tttttttttt tatttatttt ttgtaatttt aattagggtt 720 

tttttatttg aataatgtgg tagttttttt gttttttatt tagttttttt tttgttttgt 780 

tgtttttttg ttttatttat tttgtaggag tttttttgag tttgttattt tgtttggttg 840 

taatagtttt tattattttt ttattgtttt gtgagaagat taaagtttaa tggtttgttt 900 

ttgttataag tttttttttt agtttaggtt attttaggga ttgtttttta tttttatttt 960 

ttgtgtttta gttgggaaga gttgtttaga ttttttttat tttttttagg tggtttttta 1020 

ggatttggka tttttttttt tgttttttgg gaagtgatat ttttttggat gagtatwgg 1080 

ttaatgtttt ggttttttga gttttgggga taggggaatg gattttttag aattttaatt 1X40 
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seq_prot . txt 

ggtttttaga ttttgagttt aggggatatt tggtgaatag atgaatgagt gaatgaatga 1200 

atgaaggagt tatggttgat aattaagttt tatgtttgga aaggaggttt aggtgtttgt 1260 

taggagtaat agagaggttt taaaggtggg ggtaggaaag tagaagattg ttaattgggg 1320 

taggggtaat tgttagtttg agtaggggta gtttttagaa ttgtttgagg agtagtgtag 1380 

tgtttgatta gtgagaattt ttgattttgg agtttaatta gtttagtttt tggggtagag 1440 

ttgggatagt gggtttggtg gtgggtattg tgtttggttg tgtttgagtt ttagttaata 1500 

ggaagagtag gagtggggtt aggagaggtg ggttagttgg ttagtgggga gttttgggtg 1560 

tgggtgtggg gagtggggga atttttgttg gtttttttta ggttttagtg ggtggaggga 1620 

gtttttttat tgggttttag tgtttttttt tgtaaaatgg gttttttttg agggttggtt 1680 

gtgtgttaat ggtgggagga tgtgaagagg gttaggtagg gtagagggat tgggagattt 1740 

ggtgttttgt gtaggtagag gtggagggat gttgtttttg attgtagttg gtggggtgag 1800 

gtttggtagg gtgaggtttg gtgggggtgg ggtttggtgg gggtggggtt ggtggtattt 1860 

gggtgaggtt gtttggtttt gtttttttgt atttttgtgt gttgtggtgt tggagtttga 1920 

ggtggttgta gtttatattt tttgagtgat ttttggtgtt tgtttgttgt gtggaggttt 1980 

gggttatatt ttattggttg tttggtttat tttagttagt gttgtgttga attttgtttg 2040 

tgtgtgttgg ggagtggtgt ttttgttgtt gttgttgtga tttttggtgt ttgtttttgt 2100 

aatgggaggt aagtgagggt tgggtttggg tgtgggattg gggttatttt gagggtgtgg 2160 

tgtttttttt tttttttttt tgttttagtt tttgtttttg aaattgagaa attgagttgt 2220 

ggttagtgaa agttttgttg tgttgagatt tgttggtgat tttgtttttt tgtttgtttg 2280 

tgtgtttttt tgttagatag tgggggttga ggtttttttg tttgtttgtt tttggttttt 2340 

ttatttgttt gtgtgttttt tgtttttttg aggggtttgt ttgttttttt tgattttttt 2400 

ttgggttttg tgtttgtttg tttgttgtta tttattttgt gttttttttt tgtgtgtttt 2460 

tttgagattt gatttttttg ttttttgtta tttgatttgt ttgttttttg ttttttgtgt 2520 

gttttattgt tttattttta tttatatttt tatttttgtt tgttgttttt ttgattttgt 2580 

ttttttggag ttgtgttttt ggtttatgtt tgttgttttt tgttagttta gttgtttttt 2640 

tgttttattg tggggtttag gttaggttgt ttgttgggtt ttgttttttg tttttgggtt 2700 

tttggaggtt tgatttgtgt ggtgtttttt ttttgttgtt tttttttttg gttttaagag 2760 

ttttttttgg tttttttatt tgtttttggg ttgggtgtgg ttttagtttt tattattttt 2820 

ttttttattt tatttttttg tttttttgag atttttttta gggtggtttt tgtttgatat 2880 

ggtttgtatt tgtttttaat ttagatgtta tttttgttgt gggggtttta ttagatgagg 2940 

ggaggaggtt gggggggggg gtttttttgg ttggtggtgg agtggtagaa ggagagtatt 3000 

attggaattg tttttatttt tttagttttt ttttttttgg ggtaggggag agttttattg 3060 

agttttgttt taaatttggt tagaaggatt tgttgtgagt gggtgggggg tttttttttt 3120 

tgatgtttat tttgtatgtg tggaagttgt ttttttgtgt taagttattg gtatttgggt 3180 

gtgtgggtat gttgtgtgtt tgtgttagtt tttgtgttgt ttgttattgt gggatttgtt 3240 

tgtgtatgtg tgtttttttt tgtatgtgtg atttgtttgt gtgtatgtgt gtatgtttgt 3300 

ttgtgttttg aggttttgtg tgtgtttttt gtaattgtgt gtattatttt tggtgtgttt 3360 

gttgttatta gatttgtgtg tgtgtgtgtg tgtgtgtgtg tttttgtttg ttagtgtgtg 3420 

tgtttttgtg tgtttggagg ttttggtgtt gtatattggg tatattatgg tgattaattg 3480 

gttatgagta tgttagattt gttaattgtg gaagtttttg gtgttgggtg tttatttatt 3540 

agttttatgt gtgagtgtat ttgtataagt gtgtgtatgt gtgtgtattt atgtgtggtt 3600 

ttttttttgt tattgagttt ttgtgattag gtttatttgg ttattaaggg aggggattga 3660 

tttttggggt ttttttgttt ttgttgtgga gtgagttggg gtttggtgag taggttagta 3720 

gagggaggtt gtatggtgta tgtgtgatat gtgtggttgt gtgtgtgtgt gtgtgtgtga 3780 

tagagaggtg tggtggtgtt tgggtagttg tatttggttg atgggtggtg tgagggttat 3840 

gtgatgggat agtgttgggg taagtttggg agtgtgtggt tatggtagga gtgtgtgatt 3900 

ggtgggatat agtttatgaa attgttggtg tatgtttggg gagtgattgt gattgtgggg 3960 

aattgagagt gtttgggtgt gggtgttagt ggtaggaggg ggtgtgataa ggtgtgtggt 4020 

gatagtgttg ggagggaagg tgagagttta gtgtgggggt gtgtgtgtga gtgtgtggga 4080 

gattatgagg atgtgggtgt gtatatggag agggttagag ggtaagtttt ggggtttgta 4140 

gtgttgtgat taagattttg ggttttgtat ttaagaatta tttgggtttt aatatttgtt 4200 

tgtgtttgtt aagagtaaga gatttaagtt aggtttttta aggttgagtg tttttatttg 4260 

tagaatggag agagtgagga taaaagtttt tgtgttaaag gttttttaat ataattttgg 4320 

tt: 4322 

<210> 395 
<211> 4322 
<212> DNA 

<213> Artificial Sequence 
<220> ,' 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 395 

- .. .. * 

ggttaagatt gtgttaaagg atttttaata taggaatttt tgtttttatt ttttttattt 60 
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attataaatt ttaagatttg ttttttagtt ttttttatgt gtatatttat atttttatgg 240 

ttttttatat atttatatat atatttttat attgggtttt tatttttttt tttagtattg 300 

ttattatata ttttgttata tttttttttg ttgttaatat ttatgtttag gtatttttgg 360 

ttttttataa ttatagttat tttttaagta tatattggta gttttataga ttgtgtttta 420 

ttagttatat atttttattg tggttatata tttttaggtt tattttagta ttgttttatt 480 

gtatagtttt tatattattt attagttagg tatagttgtt tagatattat tatatttttt 540 

tgttatatat atatgtatat atataattat atatattata tatgtattat atagtttttt 600 

tttgttggtt tgtttattaa gttttagttt attttatagt aggaatgagg aaattttagg 660 

gattagtttt tttttttagt gattaggtgg gtttggttat agaaatttag tggtagaggg 720 

agagttatat gtgagtatat atatatatat atatttatat agatatattt atatatgaga 780 

ttggtgggta ggtatttaat attaggggtt tttataatta atagatttaa tgtatttatg 840 

gttagttggt tattatgata tatttagtgt gtaatattga aatttttaga tatataggga 900 

tatatatatt agtaggtaag gatatatata tatatatata tatatatgag tttggtgata 960 

ataaatatat taagaatggt atatatagtt gtaaaagata tatatagagt tttaggatat 1020 

aaatagatat atatatatat atataagtaa gttatatatg tgaaaaagga tatatatgta 1080 

taggtaagtt ttatagtgat agatagtata gaagttgata tagatatata atatatttat 1140 

atatttagat attagtgatt tagtataagg agatagtttt tatgtatata gggtaggtgt 1200 

tagagaaggg gattttttgt ttatttatag tagatttttt tgattaggtt taaggtgggg 1260 

tttggtgggg tttttttttg ttttaagaga gagggaattg gggaagtggg agtagttttg 1320 

atggtgtttt ttttttgttg ttttattatt agttaggagg attttttttt ttgatttttt 1380 

ttttttattt ggtggagttt ttataatggg agtggtattt gggttggggg tgagtataga 1440 

ttgtattggg taggggttat tttgggaggg gttttagagg ggtggagggg tggggtggga 1500 

agggggatgg tggagattgg agttatgttt agtttagaga tggatgaagg gattgagaaa 1560 

gatttttgag attgaaagaa gagatagtga ggaggaggtg ttgtgtgggt tgggttttta 1620 

gggatttagg ggtgggaggt gaggtttggt gggtggtttg gtttgggttt tgtggtagag 1680 

tggagaggtg gttggattgg tagggggtgg tgagtgtggg ttgggggtgt ggttttggga 1740 

ggatgggatt gagggggtgg tgggtggggg taggggtgtg ggtgggggtg gggtgatgga 1800 

gtatgtggag ggtaagggat ggatggattg ggtggtgagg gatagaagga ttgggttttg 1860 

gagaggtgtg tagggggaga atatagagtg ggtggtgata gatggatgga tatagggttt 1920 

gggagagagt tgggagggat ggatgggttt tttggaggag tgggagatat atgggtgggt 1980 

ggaggggttg ggagtgggta gatagaagga ttttagtttt tattgtttag tggagggata 2040 

tatagatgga tagggggatg gagttattgg taggttttgg tgtggtggga tttttgttga 2100 

ttgtggttta gttttttagt tttggaaatg ggaattgggg tgggaagggg gaaggaaggg 2160 

tgttgtgttt ttggggtagt tttgattttg tgtttggatt tggtttttat ttattttttg 2220 

ttgtaggggt aggtgttaag ggttgtggtg gtagtggtgg gggtgttgtt ttttggtgtg 2280 

tgtggatggg gtttggtgtg gtgttgattg ggatgggtta agtggttagt gaggtgtggt 2340 

ttgggttttt gtgtggtggg tgggtgttgg gggttgtttg ggagatgtgg gttatagttg 2400 

ttttgggttt tggtgttgtg gtatatggaa gtgtgagggg gtggggttga gtgattttat 2460 

ttaggtgttg ttggttttgt ttttgttagg ttttgttttt gttaggtttt gttttgttag 2520 

gttttgtttt gttagttgta attggggata gtgttttttt gtttttgttt gtgtgggata 2580 

ttgagttttt tggttttttt gttttgtttg gttttttttg tatttttttg ttattagtgt 2640 

gtagttagtt tttagaaggg gtttatttta tagaggagga tgttgaggtt tggtggggag 2700 

gttttttttg tttgttgggg tttgggaggg gttggtgggg gtttttttgt tttttatatt 2760 

tatgtttggg attttttatt ggttagttgg tttgtttttt ttggttttat ttttgttttt 2820 

tttgttggtt ggagtttgga tgtggttggg tgtggtgttt gttgttaagt ttgttgtttt 2880 

ggttttgttt taaagattga gttggttaga ttttagggtt ggaggttttt gttggttggg 2940 

tgttgtattg ttttttaggt ggttttggga attgttttta tttggattgg tagttgtttt 3000 

tattttgatt ggtagttttt tgtttttttg tttttatttt tgagattttt ttattgtttt 3060 

tggtaggtgt ttgagttttt tttttagata tggaatttgg ttgttagtta taattttttt 3120 

atttatttat ttatttattt atttatttat tgagtgtttt ttaggtttaa agtttaggga 3180 

ttagttgggg ttttagggag tttatttttt tgtttttaga gtttagaaag ttaagatatt 3240 

agttttatat ttatttaaag aaatgttatt ttttaggaga taaaaaagag ggttttaggt 3300 

tttggggaat tatttagaag ggatagagaa agtttagata gtttttttta gttgagatat 3360 

aagggatgga agtgggggat aatttttgag gtggtttgag ttgagaaagg ggtttgtgat 3420 

agagataggt tattggattt tggttttttt atagagtaat gagaaggtgg tgaaagttat 3480 

tataattagg taggatgata ggtttaaggg ggtttttgtg ggatagatgg ggtagaggga 3540 

tagtagggta gaggagaggt tggatggggg atagaagggt tgttatgttg tttaggtggg 3600 

aggattttgg ttgagattgt ggaagatgga tggggagaaa aggttagatt aggtgggttt 3660 

aaggattatt ttttgggaaa gatttttttt ttatatttag gttttgagta tgttgttttt 3720 

tgggtttgaa tgttattttt tattttattt ttaaattttt atttgttttg gttgtgttta 3780 

tttaaatgtt tttttttttt gatgtttagt tgtaatgtat ggtatttttt tgagtgaagg 3840 

attatattta ttagtatgtt ttgtgttttg attttatttt attttattta ttatgtttta 3900 

tggttttttt ggggttggat tggaggaaag ggtaaataag gtgagtttta agtgattttt 3960 

gtttatttgg gattgggttt ttttttggtt ttttttttag gtttatagta agttaattta 4020 

gttttttttt taatataaat aataggttta ttttgtgtta gggattgtag ttagagggag 4080 

gattgatttt tgaggagttt taggtt;tagt tattagattt ttgtttttat tttgtgtggt 4140 

ttattatgga agaaggaaat gaataattat ggttattgtt gataggggtt atttagtatt 4200 

aggttgtgtg ttgggtgttg gatgtgggta gtttgaatag tttttaagag gtgagggttg 4260 
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ttgttagtgt tagttttggt gatgagtaaa ttgaggttta gagaaaggag aagtgatttg 4320 

tt 4322 

<210> 396 

<211> 8467 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 396 

atgaattaaa tttttttttg gtattaagaa gggatatata gtattttgag ttttgttttt 60 

gtttttttat taagtttttt ttaatagttg gaagtggttg aagggggttg ttttaaggag 120 

attttttgtt ttttttttga tagttgatta aaagaaaata ggttatgttt tttaaattta 180 

tgtttatttt gtttttaatt attgtttttt taaataaagg taatatttgt gtaattttag 240 

ttagtttgaa tttttggagg taaatagagt gtaatttgtt ttggaagaat tttgttatgt 300 

ttaaggtttt atgttgggtg ttggtttttt tgtttttaat tattttttta ttttgatttg 360 

gattgtagaa atttttaatt ttttgttttt tagttttatt tttattttta gatttggggt 420 

tgttaaataa tggaaatttt aggaagatgt aggtgtggtt tttaaagttt tggtttgtga 480 

gtgtttttag gttggttggg gtgttggttt ttttagaaat aaggtttgaa tatgtaggtg 540 

ttagttagga ttttttgttg tttgtttatt gttgtttttt ttttgggatt tgagagaagt 600 

gggagtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgttt tgtgttagga 660 

gaaaggtttg tggttgttgt tttagttggt ttgtttattt tttgttttta ggagaagttt 720 

gtttatttgg gatattagtg atttttttta tttttatttt attggtttta tatttttttt 780 

attttttttt aagttgttga tgtagaaagt atttggttat tttgaaatgg tagtttttgg 840 

aattttagtt tugttttgta atttagtagg gttttttgga agttttataa ttgagattta 900 

tgggtttatt gggtaattaa atgatgttat tgtgttaatt ttgtatgtat tattgttgtt 960 

tgtgttgttt gtgggttgtt ggttggttgt agtttgttgg tgatgagggt attatagttg 1020 

ttttgattgt gtagattatg tatgttttgg ttttgggaat ttattatgta ttagaatata 1080 

ttagtgtttg tattttaaaa ggttaaatta ttggttttta gttagggatt tttggtaagt 1140 

ggtttgttag tgatgagtgt ttgtttatat tggtatatag tggagttttt tgtttttggt 1200 

ttatttgttt ttaggaaagt tttggggtga ggtgaaggtg attgaagtaa tgtttttttt 1260 

tttagattgt agttgtttag gggggatata gtatggtatt ttttattgaa ttttttttgt 1320 

ttgtttgtat tttagttttt ttttttttag tgattttttt attttttttt tttttttttt 1380 

ttttgatgtt tgattttagt agtaaaggag gtaaaaaagg tattgagttg ttagttaaat 1440 

ttgaaaagtg tggttttgtt ttttttatag ttattggtag ttttttgtgg aaggtttgtt 1500 

ttttggggta gttgtggttt tggagtggtt gtgtttggtg tttgttgggt gtggttttgt 1560 

tttaggtttg ggagggtagg ttggttgttt tggtgagtgg tagagttttt ttggatagtt 1620 

tttgtttatt taaatagaag atgttggtgt tggagtgggt ttggatatgg tgaggttgtg 1680 

agttggtttg agtggtgggg tttggtgatt tttttttttt tttttgtttt tttttttttt 1740 

ttgtatgtat gttttgtttg tttttatttt gtttttattt tgggtgagtt tgtttgtagt 1800 

ttggggtgta tatttgtatg tgtatttttt tttatttatt tttgtgtttg tttttatttt 1860 

tgtagttgag ttttgttatg tgtgttttgt ttgtttgttg gttgtttttg ttgtttttgt 1920 

tgtttttggg ttttgatgga ttgaatgaag gttgtttata ttgtttattg atgttttatt 1980 

aaagatttag aaggttgtgt tatgaatttg gagttgataa tggaaagttt gggtattttg 2040 

tatgggttgg ttggtggtgg tagtggtggg ggtggtggtg ggggtggtgg gggtggtggt 2100 

gggggtttgg gttatgagta ggagttgttg gttagtttta gtttttatta tgtgggttgt 2160 

ggtgttgttg gtttgttgtg gggttttttg ttgtttttaa ttgtgtatta ggagttgggt 2220 

atggtggtag tggtggtagt ggtggtgttg tgtttggtta tggttattag tatggttttg 2280 

attttggatg gtggtgatta ttggtttgag ttttttattt tgttgtatta tgttatgaqt 2340 

atgttttgtg atttgttttt gtttggtatg ggtatgagta atatttatat tatgttgata 2400 

ttgttttagt tgttgttatt tatttttatt gtgtttgata agttttatta tttttatttg 2460 

■taj-cajtaj-t tgtattatta ttattattat tattattagt gtttgtttgg taatgttagt 2520 

ggtagtttta tttttatgtg tgatgagtgt gggtttttgg ttatgaataa tttttatagt 2580 

ttttataagg agatgtttgg tatgagttag agtttgtttt tgttggttgt tatgttgttg 2640 

ggtaatgggt taggtggttt ttataatgtg tagtagagtt tgtttaatta tggtttgttg 2700 

ggttatgata aaatgtttag ttttaatttt gatgtgtatt atattgttat gttgatttgt 2760 

ggtgagtaat atttgttttg tggtttgggt attttatttg tggttatgat gttgtatttg 2820 

«^SS5H 9 5 <£H ttt9gQ tta *atttag ttttatgggt tggtgttggt atttagttgt 2880 

fagtggttat ttttgttttt attgggtttg taggtggtta tgttgggtta gttggaagaa 2940 

attaatatta aagaggtggt ttagtgtatt atagtggagt tgaagtgtta tagtattttt 3000 

i^gtgattt ttgtgtagag ggtgttgtgt tggttttagg ggattttttt tgatttgttt 3060 

tggaatttaa aattgtggag taaatttaaa tttggtaggg agattttttq taqqatqtaa 3120 

aagtggtttt aggagrttga gttttagtgt atgtttgttt tatgtttqgt aggtaaqqtt 3180 

9999^9" aggggttagg ttgttgggaa gagggttttg ggtttggtgt ttgtggttta 3240 

391*^*9^ 3300- 
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ttttttttgt ttttattttt ttttttattt 
tttttttttt tttttttttt tttttttatt 
attttttttt attttatttg tttttttttt 
tttttaggta ttgggaggtg ggatgggggt 
taagatttat agaaattagg atttgttttt 
ttagataata tattttaatt ttttgggttg 
ggttttgttt tgtagaggga aaagagtttt 
tatttggtta ttggttgaag gtatagtttt 
aatagtgttt ttggagttta tttttggttt 
tttgaggggt ttgggttagt tttaatgtta 
gagaaggtta tatttattgt tttgggagga 
tttgttttgt ttattggttg tttttgtttt 
tttgttgggg ttttgggaga ttttgagtat 
tttaggttag ttaatgtatt tttgttgttg 
tgttgattgt gtgtgtttgg tgattgtggg 
tagaggtgtg ggttatattg ttttggagtt 
taagttgtga ggtatagttt tttattgttt 
tgggtgtgtg agttagaggg aaagatgtag 
tttttgtgtg tatggatttt gtgtggtgtg 
ttatgggttt agaggggtaa aatgtttagg 
atggtaagtt agggtgggtg attttttatg 
tggtgggtgg tttaggggaa taatttgtgg 
gatgtgtgtg gttttgagtt agttttgtat 
tttatttttg attttttggg ttttttttgt 
gattaatgtt tgatagggtt ggggaatgtg 
gggggagttt ttgttttgat agtatttttt 
agttggtaat tgttttgtag tgttgattta 
aagagttttt tttaaattgg tggattttgg 
attttttttt tttagtttat tttttatttt 
tttggggtaa gataaagtag ttagtagaga 
gagattggtt gatagttttt aatattttgt 
tttggtgggt tttagtattt gttggttttt 
gttggtttta ggggtatagt taaaggtagg 
tgttgttgtt tttttatgtt ggttgtgtaa 
ttgtttatat aggtttgaaa gtgatgagga 
gtttttaaaa tgattttaag tgttgttgag 
agttttattt gtgtttatgg tgtaagtttg 
gttgttagat tttttgtaga ggattgttga 
tttagtgagt ggaaaaggtt tgttgaggat 
ttgttttatt attttgggtt gaaaagggat 
tattagtaat tataagtgtg ttgagtggtt 
gttagtagat tagttttttt agatattgat 
tgtgtttgga gtgttagggg ataggagtaa 
ttggagtatt gattggaatt tttgttggta 
gggggtttaa taagtttagt tgttttgtag 
tggaagatat tgtttttgtt ttttttttgt 
ttataggttt tttttttttt ttattttggt 
taggttgggg tttagggggt ttagaatgaa 
ttatttttag gtgaggttat aatagaggta 
tatagttaag ggtggtattt gaaaggaaaa 
tttaatgtta gattttagag aaattgattt 
ttaagtggta ggtggttgta attttgtttg 
agtttttgta gggtggtttt tagtaatttg 
gtttgagtat atatttttgg gtgttgggtt 
aagggagtga gagaatttag gttagaattt 
ttttggtggt aggtatgttt ttttttgatg 
tttattgtag aggtttttgt ttttagtgag 
ggtttttagt ttttgttgta gtttgatgta 
gttagtttga agttaatttt gtttaggtgg 
ttgtttgggt tatttgatgt ttgtagtggt 
tttgttaggg gtattgtgtg tgtgtttgtt 
tatgggttgt tgggtatttt ttaagttgag 
ttatttttta attttttatt ttgttttttg 
tttggtggtt tagttaggga ttagagttgg 
aatagatgat atgtttgttt gttagttgtt 
ttgtgttttg ttaagttttg ggtgttggga 
tgggagtggg tagaggaggt gggatttgag 
gtttaggtag ttaggtgttt ttgatgtgag 



secj prot . txt 

tttttttttt ttttgttttt tttttatttt 3360 

ttttttttgt ttttgagttt ttattgattg 3420 

aatgtgttaa tttttgtttt atttttgatt 3480 

gtgtgttttt ttttaggagt tttgtttttt 3540 

atttaaaatt ttatgtattt taagtttttt 3600 

attagttttt ttgtgtagag gtagttgaga 3660 

ttattttttt atttattata taggtaaatt 3720 

gtttttgtgg ggaattggtg gttaggatat 3780 

tggtgttggt gtagggattt tttgattggg 3840 

ttatttatag tgagggtagg gtgtaaggtt 3900 

tgtgggagaa gagattgagg tggaaagtgt 3960 

ggttttagtg tttgttggga tttgttagga 4020 

ttgtaggaag aggtgttgag aaattaaaaa 4080 

gttgtaggtt ttgtttttgt attaagtggg 4140 

gaggattggt ggtttgtggg aggggatggg 4200 

ggtttggttt tttgtgtttt ttttagtggt 4260 

taggagtata gaaatttttt gtgtgggtgg 4320 

tagttattgt gattggtatg tagttgtgtg 4380 

tgtggtgatt gtgttgtttt taggagtaag 4440 

ttttttgttg ggaaggatat attatatttt 4500 

gattgggtgg aggggggtat tttttaggat 4560 

tggtgatgat ttgtatagtg tgggttttgg 4620 

agtttgtttt tggagttgtg agtttaggtt 4680 

attgttgagt ttagtttgtg gggtgtattt 4740 

ataggtagta ggtttatttg ggtttgggga 4800 

tttgttgttt gttggtggat ttttattttt 4860 

agaaggtaaa gaaaattagg tttttttgta 4920 

atattttgag tggatttaga aatttatgta 4980 

tttttatagt tttttttgat ttgttgttgg 5040 

gtgataataa tagtggtggg aaatgaattg 5100 

tatagatttt tttgaatgtt ttaggttgtt 5160 

tgggtattgg ggattagaag gaatttggta 5220 

atgatagtta tttttttgtt tattttagag 5280 

agaatatagt ttttaaaaaa tatgtgtttt 5340 

aagtaatgtt ttgtttatta gtgagtttta 5400 

atgagaaagt gtggtatttt gggggttttt 5460 

tagggatagg tttgggatag tattgtttat 5520 

agttgttttt gtgggagata gaatgttttt 5580 

tttgttttgt ttgagtattt aaatgtgtgt 5640 

aagagtttta gtttttttat ttggttattt 5700 

attattatat aggaggtttt ttagtttggg 5760 

gtagaagttg ggattggtaa gtaggtatta 5820 

atggagaaga aaagtggagg ttttttttgt 5880 

tgttgtagag ggtttttgtt gttgggtttt 5940 

gtggtttggt tggattttta gattggtgtt 6000 

taaatttgtt tttttttttt tttttatagg 6060 

tttgtttttg ggttttgtta aatagttaag 6120 

gaggtttgat ttggttagtg ttggtaaagt 6180 

ggtttttttt gtttagtttg ttggtgtagt 6240 

gggagaaaat tttggagaaa tttagattgt 6300 

taaatgtatg gatttgtttg gaaagggtgg 6360 

gttgggtttt tgtagaggtt ttttaagatt 6420 

ataagaggtg gttaagataa atttttgtgg 6480 

ttagagattt ttaaattaag tataaataag 6540 

gtatgggtat tttattgagg aaaagtgagg 6600 

tttgaaaatt gagttgagtg tttgattata 6660 

tttggatttt ttagtggttt gtttggagtt 6720 

tggttttttt ttggtagtaa gtttttagtg 6780 

ttgagggttt ttagttaatt tattatgatg 6840 

gggatatggt ttgggtagtg tgtagtggtt 6900 

ttttgttgtg ttggggatgt ttttgggtga 6960 

gaaatggatt ttttttgtag agttttgtgt 7020 

ttgttagggt ttttgattta gtttattttt 7080 

agaggttgaa tgtaatttgt gttagtatgg 7140 

tggatgaata attgaaaagt ttgttgtagt 7200 

gaatattttt ttaatatgta ttagggtggg 7260 

ggaggagagt gaatttgagt aggagaagta 7320 

aggttgggta tttattttta ttttaggttt 7380 
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tttattgtgt ggttatgtta tttttttaaa taaatgtgta tatggaggga gattgatgtt 7440 

gataatgttt agaagattaa aagagtatta atgttggtaa taataatgta aatgtgtgga 7500 

tttagatttt attgatttgg aatttgattt ggtgtgtttt tagtaagttt gatggtgtgt 7560 

tttttttagt agagtgttta ttagtgttat ggttttgtgg ttttttagtg gtgttgtttt 7620 

gttagttttg tgtgggtttt tttgtttgat tgtagttttt tttttgtgag gttttagttt 7680 

gttttatttt tttgaggttt tttttttttt ttgtggggtt ttttgttttt tgtatttttt 7740 

tttttgattt ttgtattatt tgtttttgtg tgtatatatt gttatttgtg tttttggtga 7S00 

tttgtttggg tggttgggtt tgtgaagtta atgtgttgaa tggtgtttga gtttttttaa 7860 

ttattttgtg tttggttgtt gttattgggt tttggtgatt aagtttaagt ttgaaaatga 7920 

tgtggttaaa gttgtttgtt aatggtagag tttaattagt tgtagatttt tttttatttt 7980 

tattggtttt tgtattaaaa aggtttatgt gtagattttt tatgtaggtg tatttgttgg 8040 

ataattaaat gtggttttag taataaaagt ttgagtttta tttttttgag tagtaggttt 8100 

tgtgttggat tggttgggtt taatagggag aattttgtgt tttattttgg ttgggatagg 8160 

aagtaagaga agtaaattgg ggaatttttg ttttattttt ttttattttt tggatgtttt 8220 

gggttgaggt ttgggttatt ggtttattgt gggtaaggag gtgttttgtg gaatatttgt 8280 

attgattgga aaagaaagaa ttttaaagtt tttttttttt gtttgtgggg gatattatta 8340 

tattttgtta attatttttt tggttaaatg gtggtttgtt tttttagaag gagtattaag 8400 

gtgaatttta tggaaaaaaa ataaaagatt agggatttaa gtgggagtag gatgagaatt 8460 

aattttt 8467 

<210> 397 
<211> 8467 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 397 

aagaattagt ttttatttta tttttattta ggtttttgat tttttatttt ttttttataa 60 

aatttatttt agtgtttttt ttggaaaaat aagttattat ttggttaggg gagtggttgg 120 

tagggtgtgg tggtgttttt tataagtggg aaggaagagt tttaaagttt tttttttttt 180 

agttaatgta ggtgttttat aggatatttt tttatttgtg atgggttagt gatttgggtt 240 

ttggtttagg atgtttaggg ggtgggaagg ggtggggtag gggtttttta gtttgttttt 300 

tttatttttt gttttagtta aggtaggata taagattttt tttgttagat ttgattagtt 360 

tagtgtagag tttgttattt ggagggatga agtttaggtt tttgttatta gggttatgtt 420 

taattgttta atgaatgtat ttgtgtaaaa agtttgtgtg tgagtttttt tagtatgaga 480 

attgatgagg ataggagggg atttgtggtt gattaagttt tgttgttagt aggtagtttt 540 

agttatgtta tttttaaatt tgggtttagt tattagggtt tagtggtaat ggttaagtat 600 

aggatagtta ggaagatttg ggtattgttt agtgtattgg ttttgtggat ttagttgttt 660 

aggtggattg ttggaagtgt aagtagtggt gtgtgtgtat aggggtgggt ggtgtagagg 720 

ttgggggagg gggtgtagag ggtagagagt tttgtgagag gaggagaaaa ttttggggaa 780 

gtaaggtggg ttggggtttt gtgggggagg agttgtggtt agatgggaga atttgtgtag 840 

agttggtgaa gtggtattgt tggaaaattg tggggttgtg gtgttggtga gtgttttgtt 900 

gggaagagtg tgttgttggg tttattggaa atgtgttgga ttaagtttta gattaatgaa 960 

atttgggttt atatgtttat gttattgttg ttagtattaa tgttttttta attttttaaa 1020 

tattattagt attgattttt ttttatatat atatttgttt gagaaagtga tataattata 1080 

tagtggaaag tttggaataa aaataaatgt ttagtttttt gtattgaggg tgtttggttg 1140 

tttgggttgt tttttttgtt tgggtttatt tttttttttt gggttttgtt ttttttgttt 1200 

gtttttgttt attttgatgt gtgttgggaa ggtgtttttt tggtatttgg gatttgatga 1260 

agtatagatt gtagtgaatt ttttaattat ttatttaagt agttagtagg taaatatatt 1320 

gtttgttttg tattggtatg ggttgtgttt agttttttta gttttgattt ttaattaaat 1380 

tgttaggaga ggtgggttga gttgggagtt ttagtagtga gaaatgaggt gggaggttgg 1440 

ggggtgggtg tgagattttg tgaagggggt ttgttttttt ggtttgggag gtgtttagtg 1500 

gtttgtgtta tttaaggatg tttttggtgt agtgggagat aagtatgtat gtggtgtttt 1560 

tagtaggagt tattgtgtat tgtttgggtt gtgttttgtt gttgtgggta ttaggtggtt 1620 

taggtgatat tatggtgggt tggttaaggg tttttggttg tttgaataga attggtttta 1680 

gattggttgt tgggagtttg ttgttaggag agggttgtgt gttggattat ggtggggatt 1740 

ggagattagt tttgggtagg ttgttggggg atttgggttt attggaggtg gaaatttttg 1800 

tagtaaatgt agttgggtgt ttggtttgat ttttgggtgt tggaagaaaa tatgtttgtt 1860 

SSHSSSiS: Wttttttt tagtgggatg tttgtgtaag ttttagtttg ggttttttta 1920 

tttttttttt gtttgtgttt ggtttagagg tttttggggt ttaatgtttg agagtgtgtg 1980 

HH^i^S H2 aaatttg ttttggttgt tttttgttag gttgttgaaa attgttttgt 2040 

?»?§2££??S H3§99§2 at t " t 9tgaag gtttgattgg gtggggttgt aattatttgt 2100 

JSEH 01 ^ 9 "H ttttga atgaatttgt gtatttggag ttaatttttt tgaaatttaa 2160 

SSS&flS ^ISH^H" ""Saagtt tttttttttt tt tttttaa g tgttattttt 2220_ 

ggrtgtgatt-atettggi^ff-gti:gggt^g^aagga^«g£ ttttgttgtg~attttgttta " 2280 
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agggtgagtt ttgttggtgt tggttaaatt agattttttt attttgagtt ttttaaattt 2340 

tggtttgttt ggttgtttgg taggatttgg gggtggggtt aaaatgagag agaaagggaa 2400 

tttataattt atgagagaaa gagaagaggt aggtttggtg ggaggggggt agggatggtg 2460 

ttttttaggt attggtttga gagtttggtt gagttgtttg tggagtggtt gggtttgtta 2520 

aatttttggg gtttaatggt gagggttttt tgtggtgtat tggtggggat tttgattgat 2580 

attttgggtg gagaaagttt ttgttttttt tttttatttg tttttgtttt ttagtgtttt 2640 

gagtatataa tatttattta ttagttttag tttttgtatt agtgtttgaa gagattgatt 2700 

tattgatttt aaattgaaaa gttttttatg taataataat tatttaatat atttgtagtt 2760 

gttgataaaa tggttagata aaaaggttaa agtttttgtt tttttttaat ttagggtaat 2820 

aaaatagata tatatttgaa tgtttgagta aagtggggtt tttagtaggt tttttttatt 2880 

tgttggagga ggtattttgt tttttatgaa ggtagtttta gtgatttttt gtggaaaatt 2940 

tagtagtgtg ggtagtgttg ttttgggttt gtttttgtag atttgtatta tgggtgtggg 3000 

tggggttgag gatttttggg atgttatgtt tttttatttt agtaatgttt gggattattt 3060 

taaaagttga aatttgttaa taggtgaagt attatttttt ttattatttt tagatttata 3120 

tgggtagaag gtatgtgttt tttaaaagtt gtgttttttg tgtgattggt atgagggggt 3180 

ggtagtgttt tgagatgagt aggagaatag ttgttatttt gtttttaatt gtatttttaa 3240 

gattagttgt taagtttttt ttggtttttg gtgtttaaga agttggtggg tattaaaatt 3300 

tattagagat agtttgggat atttgggggg atttatgata aaatgttaag aattgttagt 3360 

tagtttttag tttatttttt gttgttatta ttattgtttt ttattggttg ttttattttg 3420 

ttttgaatta atagtaaatt agagaaaatt gtaggagagg atgaaaaata aattaaagga 3480 

gaaagattat ataaattttt aaatttattt aaagtatttg gagtttgttg atttggggga 3540 

ggttttttgt agggaaattt agtttttttt atttttttag attaatattg tggggtgatt 3600 

attgattggg aataggaatt tattagtaga tggtaaagga aaatgttgtt aaagtggaaa 3660 

tttttttttt ttaagtttgg gtgaatttgt tgtttgttat attttttagt tttgttgggt 3720 

gttggttgag tgtattttat aagttgggtt tagtggtgtg aggaaagttt gggggattgg 3780 

gggtggaaat ttgagtttgt agttttggaa gtgagttgtg tgaggttggt ttggaattgt 3840 

gtgtatttta agatttgtgt tatgtaaatt attgttatta tgaattgttt ttttagattg 3900 

tttgttgatt ttgaaagata ttttttttta tttgatttat gggaagttgt ttattttggt 3960 

ttgttatagg gtatagtgtg ttttttttag tgggggattt ggatattttg tttttttggg 4020 

tttgtggttt gtttttaggg gtagtgtagt tgttatgtat attgtgtggg gtttgtgtgt 4080 

ataaaagtgt gtaattgtgt gttagttgta gtaattattg tatttttttt tttagtttgt 4140 

gtatttgttg tttatatagg aggtttttgt gtttttagaa taatagaggg ttgtattttg 4200 

tagtttggtt: gttagaggag gtataaagag ttgagttggt tttagaataa tgtaatttgt 4260 

gtttttattt gttttttttt gtgggttgtt agtttttttt gtggttgtta ggtgtgtata 4320 

attagtgttt gtttaatgta aaggtggagt ttgtagttgg tggtagggat gtattaatta 4380 

atttgaattt ttaatttttt agtatttttt tttgtgagtg tttagggttt tttggagttt 4440 

tggtaaattt tggtaaattt tagtaaatgt tgggattggg gtaaggatgg ttggtgagta 4500 

aggtaaagtg ttttttattt tagttttttt ttttatgttt ttttaaagtg gtgaatgtga 4560 

tttttttaat tttatatttt gtttttgttg taggtagtga tattggagtt ggtttgagtt 4620 

ttttaagttt ggttagaaag tttttgtgtt aatgttaagg ttagagatgg gttttaggag 4680 

tgttgttgta ttttggttgt tggttttttg tgggggtaag gttgtgtttt tagttaatga 4740 

ttaaataagt ttgtttatat ggtgggtggg agagtagaga gttttttttt ttttgtagag 4800 

taaagttttt taattgtttt tgtataggga gattagttag tttggaaaat tgaaatgtgt 4860 

tgtttaaaag agatttgaag tgtatggggt tttgaataag ggtaggtttt ggtttttgtg 4920 

ggttttggaa agatagggtt tttagaggaa aatgtatatt tttattttgt tttttagtat 4980 

ttgggaagat tggaaatagg gtaaaggttg gtatattgag gggagggtga atgaaatggg 5040 

ggggggttgg ttaatgaaag tttagggata aggagagagt aagaaagaaa aagaaaaggg 5100 

agaagggaaa gtaggggaag agtggaagag aaagagaaaa tggaagaaga aataaaaatg 5160 

agaagaaaga ggatgtgtat aggaaagaga aggaaagaat taagagaagt gatttggtgt 5220 

gtagatttgg gttataagta ttggatttgg agtttttttt ttagtagttt ggtttttggt 5280 

tagttttggt tttatttgtt aggtgtaagg tggatatgtg ttggaatttg ggtttttgaa 5340 

gttattttta tattttgtgg aaggtttttt tgttagattt gagtttattt tatggttttg 5400 

gattttggag taggttggag agagtttttt gagattggta tagtattttt tgtgtaaaga 5460 

ttgtttgggg gatattgtag tgttttagtt ttgttgtgat gtgttgggtt atttttttgg 5520 

tgttgatttt ttttagttgg tttgatgtgg ttatttgtga gtttgatgag gatgagggtg 5580 

gttgtttgtg attgggtgtt agtattggtt tgtgagattg agtgtggttt gggtggtgta 5640 

ggttgtttag gtgtgatatt atggttgtag gtggggtgtt taggttgtgg gataggtgtt 5700 

gtttattgtg ggttagtatg gtagtgtggt gtgtgttgaa gttggggttg agtattttgt 5760 

tgtggtttgg tggattgtag ttgggtagat tttgttgtgt gttgtggagg ttgtttagtt 5820 

tgttgtttag tggtgtggtg gttagtgggg ataggttttg gtttatgttg ggtatttttt 5880 

tgtagggatt gtagaggttg tttatggttg ggagtttgtg tttgttgtgt atgagggtga 5940 

agttgttgtt gatgttgttg gataggtgtt ggtggtggtg gtggtggtgg tggtgtggat 6000 

ggtggtgtgg gtgagggtgg tggaatttgt tagatatggt ggagatgggt ggtagtggtt 6060 

ggagtggtgt tagtgtggtg taggtgttgt ttatgtttat gttaggtgga gatgagttgt 6120 

aggatatgtt tatggtgtgg tgtagtggga tggagagttt gggttggtag ttgttgttgt 6180 

ttaggattga ggttatgttg gtgattatgg ttgagtgtga tgttgttgtt gttgttgttg 6240 

ttgttgtgtt tagtttttgg tgtgtggttg gaggtggtgg agggttttgt agtgagttag 6300 

tggtgttgtg gtttgtgtgg tgggggttgg ggttggttag tagtttttgt utatggtttg 6360 
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ggtttttgtt 
gtttgtgtaa 
ggtttttggt 

gggggtggtg 

ggttgtggga 
gttttgggtt 
tgtgtgggag 
gttggtttgt 
gagtgggagt 
atttggggtg 
tttgggaggt 
tttttaggtt 
gttaaggaga 
tgagtgagtg 
atttggggga 
tgagtaagtt 
ataagttatt 
gtgttaatgt 
gttagagtaa 
tggtgtgggt 
gagtttataa 
tagaattttg 
aagaagtggg 
agtgagtaaa 
attttttttt 
tatttttgtt 
tggttgatat 

agaatgttta 
tttggtaatt 
tgtagtttga 
gttttaaatg 
aagttagttg 
gtggatataa 

agggggtttt 

agaaaataag 
gatttat 



gttgtttttg 
agtgtttaga 
gaggtattga 

ggggtggtgg 

gtgggggtgg 
gtgggtgggt 
aggggagggg 

agttttgtta 
tgtttagaag 
gggttatgtt 
gggtttttta 
tggttgatgg 
gggagggagg 
agagagattt 
aggggtattg 
gggagtaaaa 
tattgagagt 
attttaatgt 
ttgtagtgtt 
agtagtaatg 
attttagtta 
ggagttgttg 
gagagtgtag 
ttttttttgg 
taatatgaga 
ttttttaaat 
ttgtatattt 
taagttaagg 
ttaggtttgg 
gttggggtgg 
taataagatt 
aggttgtgta 
gtttgagaag 
tttaagatag 
ggtagagttt 



ttgtttttgt 
ttttttattg 
taggtggtgt 
gggtggttgg 
gtgtgggagt 
ttgtttgggg 
atggggaggg 
tgtttgagtt 
ggttttgttg 
tgatgggtgt 
tgggaagtta 
tttggtgttt 
agaaggtggg 
ggtgaaagat 
ttttaattgt 
gattttgtta 
ttttagttgg 
atggtaaatt 
tttgttgtta 
tatgtaaagt 
tgaggttttt 
ttttaaggtg 
aattggtgag 
gggtagagag 
tatatatatg 
tttgggagag 
aaattttgtt 
ttttaaaaat 
aggtgggagt 
aggggtagtt 
tttttaaagt 
gatgttgttt 
tgtgatttat 
ttttttttaa 
ggaatattgt 



seq_prot. 
tgttgttttt 
ttagttttgg 
aggtagtttt 
tgggtgggta 
gagtggagag 

tgggggtggg 

agggagggat 

tgttttggtg 

tttgttgggg 

taagtgtaat 

ttagtggttg 

tttttatttt 

ggggttgttg 

gttgtgttgt 

ttttgtttta 

tgtgttagta 

gagttaatag 

tttgaggttg 

gtgagttgtg 

taatataata 

gaagagtttt 

attaagtgtt 

gtggaggtga 

taagtaaatt 

tgtatatata 

aaatggtggt 

tttgaaaaga 

tgtatttatg 

ggaattgggg 

gaagatagag 

aggttgtgtt 

ttgtttaaag 

ttttttttga 

ttatttttag 

atgttttttt 



txt 

gttattgttg 
gtttatggtg 
tatttagttt 
aggtgtgtgt 
gagtgtgtgt 
gtgggggtgg 
tattgggttt 
ttgatgtttt 
tagttaattt 
tattttgagg 
tggagggggt 
ttttgttatt 
gggagaggga 
gttttttttg 
ttttaaagtt 
tagataaata 
tttagttttt 
ggatatgtat 
gttggttggt 
atattattta 
gttagattgt 
ttttgtatta 
ggagggttgt 
agttgagatg 
tatatatata 
gggtagatga 
ttagtgtttt 
tttttttaga 
gatagagagt 
aggttaatat 
ttgtttgttt 
agatagtaat 
ttagttgtta 
ttgttagaga 
ttggtgttag 



ttgttggttg 
tagtttttta 
attagggttt 
ggtggggttt 
gtgggtgtgt 
atggggtgtg 
tgttgtttgg 
ttgtttgggt 
atttttttgg 
ttgtagttgt 
ggggttgtat 
ggaattaaat 
ggttggagtg 
agtagttgtg 
tttttggagg 
tttgttatta 
taaaatgtag 
aatttatgtg 
ggtttatggg 
attatttggt 
aaaataaaat 
gtgatttgga 
tggtgtttta 
gtaattatgg 
tatatatata 
taaggaattt 
ggttagtttg 
atttttgttg 
tggagatttt 
ttggtataaa 
ttgaagattt 
taaggatagg 
aaagaagagt 
aaatttgata 
agaaaggttt 



<210> 398 
<211> 6456 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 398 

taaaattttt 
ttgttttttt 
taatttttaa 
attggaaatt 
ttgttattta 
aatgtttatt 
tgtttttttt 
agtttatgtt 
gggttgtaga 
taaaatagat 
ttattaaaat 
gttggtggta 
ttttttttgt 
ttttatttta 
ataaattgaa 
ttggtgggga 
tagtttggga 
ttatttttaa 
ttttttttat 
taaaatttat 
ataaaggtaa_ 



taaaattgaa 
taagttgttt 
aattagttaa 
tttatttata 
taggttttta 
ttttggtatt 
aaaaaagtat 
tgttattttt 
tttatgaggt 
atttgttttt 
tatatatatt 
ttgtttagag 
tttatttttt 

aaaaaagaag agaaaaagta 
aaaaaaattt ataaaagatt 
gggggtgtgg agtggagatt 
tttttagtat ttggtttggg 
attttagtta ttaatataat 
ttttttaaaa aattggtttg 
tattgtaata ttagttttat 
_tgtaatttat_ag.tttttaagi 



aagaagaaag 
agtttttatg 
tgttttggtt 
agtttggggt 
ttttggtttg 
tgtttgttat 
atattttaaa 
aggaatagtg 
tttgttttag 
tgatggattt 
ttgtaaagaa 
ttataaattg 
ttaaaattta 



gggtaattgg 
aatagtattt 
tttagtattg 
ggtgggtggg 
aagatttaat 
tttttttttt 
gtaagaatgt 
tgtggattag 
tgttaaaggt 
tgttgttttt 
aaagggaaat 
tatttttaaa 
gattgtaaaa 
aagtgttata 
aaattttggt 
ggttttttga 
gttatttatt 
tattgtagaa 
gttgtgtttt 
tttttgttag 

atatttatta- 
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agaattttta ttttttttgg 
agttttattg ttattattaa 
gaaagttttt taaataggat 
gtggggaggt ggagagagag 
tgtagttatt agagtaaggg 
tttagagata aatatttttt 
gattttattt tttttttttg 
ggttagatga attttaattt 
ttttggtagt aaatagtgag 
tttttttttt tttaagttat 
tggtagtttt tgtagaggaa 
tagttttttt tttggttttt 
aatatattta ttgatattta 
agattttttt tttggaaatt 
gggttgtggg gtggtggggg 
ggtggttagg ggttttgtga 
tgtttaattg ttaggatttt 
gggaatataa tatagagggt 
tgttttttat gggggagttt 
ggttttaata atatggtatt 
tggttattat— atttggtagt 



6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8467 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 



seq_prot . txt 

ggggtggttt ttagtttttg ttgttttttt gttttttttt tttgtttgtt tttggagttt 1320 

agttgatttt tgaggtttta attttattta tttttttttt gggttgttgt tgttgtgttt 1380 

ttttttattt ttattttttt gaggagagtt ataggttgta aatttaatta attttgtaat 1440 

ttatttttgt aaaattattt ataaagattt tttttttgtg tttgtgtttg ttttttttgt 1500 

gttgggtttt tttagttatg gttataaagt gttttttttt ttttttgagt tttgtatata 1560 

aggaatgtgg gttggggttt tgtttgtttt tttttttgtt taaggtaagg attttgggaa 1620 

tttgaagttt ggtgtttatt atgtttaggt ttgtagtttt ttttttatag agtttgtatt 1680 

atgggaaaaa ataaaataaa atttaggaaa gggaggtaat agttattggg agttaatata 1740 

gagttatgta gtgtttaaaa tataaatatt gtagtggtta gaaattttgt tatttttttt 1800 

gtttttttag gttgttttgt tgaggttttt tgagtttttt tgtatattga aaggtattgt 1860 

aggtgtagtg tgtatttttt ttttatttat tttaagaagt tttgttttgt tattagtttt 1920 

tttttttggg atgagtaggg agagtgtgtg gaggtttttg atttttttga ttataattaa 1980 

gaaagaataa tttttaaagt gtttaatatt tttgttttta agttttttaa aatatagggg 2040 

tagggaatat taaaatattt ggtttttatt aggaagatta tggttttgaa aggaaatagt 2100 

agatatgata ttttatttta tttggattta tgattaaaaa aataaaaata aaaatttaaa 2160 

gagtttgttt gtattttttt tttttaaatt ttggtttggt ttgaaggtag ggaatttaaa 2220 

agattgaggt tgatggaaga gagttagtgg ggtgagtgag tgggtagttt ttttttttgt 2280 

tttttggagt tatttagaag gataggggaa gggaaggaag aagaggtgag gaaaaagagg 2340 

agggagggaa gtggaggtta ggagtgatgg agtaaggaaa gtagtttgta agtgagaaaa 2400 

gagggaaaaa atatagttgt atgaatttag agagattata agttgtatgt aagtagtagt 2460 

agaaagagtg agagtgtgag tgtgtgtttt ttttgtggtt tggggttaga tagtttttag 2520 

attagtttga attatttttt aagtattgtt ttgttttttt tgttttggtt gttttttaat 2580 

tttttttttt tttttttttt attttttttt taaaaattaa aataatataa gggagggtgg 2640 

taaaagtttt tttaaattgg ttgatttatt taaagataat aataataata ataaatatat 2700 

aataatttat attttatggt gggagagatg tgggattaat ttttggtatt tattttaata 2760 

tttgatagtt agaataaata aatatatata tttatattaa tagatatata tagaaaattt 2820 

ggagttaaag tatttggtaa gagtggaaaa aaaaagaatt aaaaggtaaa ataatgatta 2880 

tgagtagtgg tggtggtagt ggtattagtg gtaatagtgg tggtggtggt agtagtagta 2940 

gtagtggtgg tagtaatagt aataattatt tggtgtttgg ttttttttag aaattttttg 3000 

tattattatt tttaagaatt ttagttttaa gaattaatag agtttaattt ttggaatttg 3060 

agttttggat tttattattg ttatgtggta ggggaggatt tggtgttagt tttttgagat 3120 

ttttattgtt tttggttaat taaaagtttt taaagttata agattttttt attggttggt 3180 

atattttgag gtttttataa gtagagtgtt ttggatttgg aggttttggt ttgaggtttg 3240 

aggggtttga aggtggtttt ttttttttgg gtttaagatg atggtatggt ttgttttgtt 3300 

attattatgt gggttttttt tttgtgatgt tggtgttttt gttgtagtaa agtttggttt 3360 

ttggaatttt gagaattaat ttgttatttg gtgatataag agggggagtg tgttttgttt 3420 

ttttggggtt tggggttaat tttttttttt ttatttataa atttagtaga ttgagttaaa 3480 

tgtataaaag ggagtgagag gtttgaatta ttgggaaaag tatgttatat atatagtagg 3540 

gttagagagg tgagtaagag aaaaataaaa taaaataaat attatagttt tttttaatta 3600 

gaatattagg tattatgaga aaaatatttg ttaagtagtt tttggtgggt ttatttgttt 3660 

tatttttatt taggataggg gtttttgttg ttgttttggg tttttttttt tttggtgtgg 3720 

tggtttggga tttttggttt ttgtattttg atggtttatg gatttttgtt tttgattttt 3780 

tgttttttgt aagtttgtgg tgttatgtaa attataggat tggtattggt tggatttttt 3840 

tgtatgtgtt tttttttttt ttatttaatt ttttaagtgt tttaaagatg tattatttta 3900 

atattaatat tattgaaaga agtttaaatt tttggttata tgtaataatt ttagttttta 3960 

tttttttttt tttttttttt tttggtgtaa tttttttttt tttttttgga tttttgttga 4020 

agtgtgtttt ttttgtattt tagagaaatg tttaaaggat ttgttttggt ttggtttgtt 4080 

tttttttagg atagtaagtg gtgggtttaa tttgttattg ttgatttttg ggaaattttt 4140 

tgttgtaaga aatgtgtgtg tgggggggag ggtgggggtg gtggggtggt atgtgtgtgt 4200 

tttttataaa attttgtgag ttaaatattt gtttgtgttt tgtttttttt taaggttttg 4260 

agatttttgt ttttgaggtt tgttttaagg ttgttgtaaa aaaatttttt tagtttgtgt 4320 

ttaagagatt agttggaggg aattttaaag gtttgttgtg ttagtattat agatattgtt 4380 

tgtatttaga atagattgtt ataattataa tgtattatat gtagtatagt ttttttagtt 4440 

ttgaagttga gttggggggt ggggtggggg gggggtagag aagaaggaga aatttttttt 4500 

tttttttttt ttttattatt ttttttttaa aggtaattgt agtgtagaat tattttattg 4560 

tgggttttag tttgtttgtt tagagttgtt agttttttta tttaaatgtt ggtttgggtt 4620 

tttttttttt ttatttattt ttgaataagg gagatttttg ttttttttat gttttttttt 4680 

taaaagaagg gtgtgagggg aaaataatat tttagatttt ttaagaatta gtttaaaatg 4740 

tttggattag aiagtagtttt taaaggttag gttgttttga gttttggtta tgagagtttt 4800 

ttaagtgatt tattaaatta tagatgaatt tttaaagttt ttataaaatt ttgttatttg 4860 

ttaattttta ataaagatat gtttaaaata atttatttat tttaatttaa aatttgggtt 4920 

tataatggag gagtgattag agttgtgaaa gattgttgtt tgttgggata tttggaaaat 4980 

tttttttatt gatagttttg taagaagttt tatttatt^t tgaaaatgtt gttggtaatg 5040 

tattttttta ttttataata tatatatatt tttaaaggtt gtattttagg ttttaaaaitt ,5100 

tagaaaggtg taaagaggtt gattattttt tataagggtt ggagtttagg aaatatgtgt ,$160 

tkjaagtgttt gtttttgtga aatttgaata ttttaatatt tattgataga tggattattg 5220 

tagtttttgt agaaaagttt gttti;gtgaa ^ggtt^ttga , aaaatatata taggtttaat 5280 

tttgttattg ttgttgaaat gtatgtttta tttttaaaga ■ tatatgatat tgttaagttt 5340 
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agatttattt tgttagaagg tagattattt gtttttttat ttgagttttt ttttttttta 5400 

tttttaaaga aattaaaatg aattttgttg gatgtaaaat gatattttga ttttgtggaa 5460 

gggatttttt ttgtgtgttt atatgtgtaa tttttttttt tatgagtaat tagtatttta 5520 

ggatttttta gatgtattta tatggtttgg aatgtgaatt gtatggtttt gttttattat 5580 

tgggttttta aaagattatt tttatgattt ttggttattt ttttttttgt taaaattgtt 5640 

atttagtttt atttattgtt ttttttaatt tgaaatatag tttgtttttt ttgtgttttt 5700 

tttgttattt tatttttagt tttatggggg tgttgtagag taaagggaaa tttgtttttt 5760 

aaggtagtag tagaattagt tagagtgggg gttgggaagg aatttatttt ttttatattt 5820 

tagttttttt taattttttg tagtgagttt ttagagggag aatttattag ggttggttta 5880 

tatataatta attttattat tagggatttt aaattttttt agtatattat ttttgggatt 5940 

aagtgttaga gtgttatatt tttttattgt tttttgttaa attaatgtgt ttatattgaa 6000 

agaaattgat tagaatagaa gattagttag aaagaaagag aaaattatta ttgaaaattt 6060 

tttaaagata ggtatttatt ttttggtgag ttttttttat tttgaattga atgaatggtg 6120 

aggaaaattg tgtatttgtt ttttaatttt tttttaattt ttaggaaagt aattttgtta 6180 

aatattatat tagttgaatt ttttttagaa tgttattttt ttttttattt ttgatttttt 6240 

tttagtattt agttggtatt attatgtggg ttttttttat ttttgtagtg ttgtgttata 6300 

ggattaaaga aagggttgat ttgaatgtat tttgattttt ttaaattatt tttaagaagt 6360 

tttaaagagt gattttatgt ttggggttgt atgaataatt tttgtagatt ttggggtatt 6420 

atattttaat tagttttatg atttaatggt tattgg 6456 

<210> 399 
<211> 6456 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 399 

ttagtaatta ttaaattatg ggattagttg aaatgtagtg ttttgaagtt tgtaggaatt 60 

atttatatga ttttagatat ggaattattt tttagagttt tttaaggatg atttaaaaga 120 

attagaatat gtttaagtta gttttttttt taattttgta atatggtatt gtgggagtga 180 

gggaggttta tatagtgatg ttaattggat attgaggaga ggttaagaat gaaagaagaa 240 

atgatatttt ggaagaaatt taattggtat aatatttgat aaagttattt ttttaggaat 300 

tgaaaagaga ttgagaggtg ggtgtataat tttttttatt atttatttag tttaaagtaa 360 

aagagattta ttgaaaagta agtgtttatt tttagaaaat ttttaataat gatttttttt 420 

ttttttttaa ttggtttttt gttttggtta atttttttta gtgtaaatat attgatttgg 480 

taaaaagtag taggaaaatg tggtattttg gtatttggtt ttagaaataa tatgttggga 540 

agatttgagg tttttggtga tgaggttaat tatatatgaa ttagttttgg tgggtttttt 600 

ttttaggggt ttattgtaga gaattaaaga gggttgaggt atgagaaggg tgaatttttt 660 

tttagttttt attttggttg gttttattat tgttttagag agtagatttt tttttgtttt 720 

g H2^ gtttt tat g9gg tta agagtggagt ggtaaaggga atataggagg gataagttgt 780 

gttttaggtt gaggggggtg gtggatgagg ttgaatggta gttttgataa agaaaaaagt 840 

gattaaaaat tataaaaata attttttgag ggtttaatag taaggtagag ttatataatt 900 

tatattttaa attatataga tatgtttgag aaattttaaa gtgttaattg tttataaaag 960 

aaaaaattat atatataaat atataaagaa aatttttttt ataaaattgg ggtgttattt 1020 

tgtatttagt gggatttatt ttaatttttt tgaaaatgag aaggaagggg atttaaatga 1080 

aaaagtagat agtttgtttt ttggtagaat aaatttgaat ttgataatat tatgtgtttt 1140 

tgggggtaaa atgtatattt taataatagt gataggatta ggtttatgta tattttttaa 1200 

aaattgttta taagataggt ttttttgtag aggttgtagt aatttatttg ttaataagta 1260 

ttaaaatatt tagattttat agggatagat attttaatgt atatttttta agttttagtt 1320 

tttgtggaaa ataattaatt tttttgtatt tttttgggtt ttaaaattta aaatatagtt 1380 

tttaaaaatg tgtgtgtgtt gtggggtagg ggggtgtatt gttaataata tttttggtga 1440 

tagatggaat tttttatggg attgttaatg aaagagattt tttaaatatt ttagtaaata 1500 

gtaatttttt atagttttga ttattttttt attataaatt taaattttgg gttgagatag 1560 

gtagattatt ttagatatat ttttattaga aattaataag tgatgagatt ttgtggaagt 1620 

tttaagaatt tatttgtaat ttaataagtt gtttgaagga tttttatagt taaggtttag 1680 

aatagtttga tttttgaaag ttgtttttgg tttaaatatt ttgggttaat ttttgagqaa 1740 

"JSSSHJ tat "ttttt ttatattttt tttttaagag agagatataa aagaaataag 1800 

agtttttttt atttagggat gagtaggagg ggaaaaaatt tgaattaata tttaaataag 1860 

U a ^E tagta gttttgaata aataaattag gatttataat gaaatgattt tgtattgtaa 1920 

i52IIiiH2 aaa g aaa g ta atagagaaaa agagaaggaa agaatttttt tttttttttt 1980 

2S252 aa 9 a agattg tgttgtgtgt 2040 

a gttgtggta gtttgtttta aatataggta gtatttgtga tattggtatq 2100 

gtaggttttt agaatttttt ttggttgatt ttttaaatat agattgaaga gattttttta 2160 

SESS& SSS8ES gggffig KSgyg *htLg!g 855358 §§L 

dtciaanaggr atttggttta tagaattttrg tagaaaatat atatatatta ttttgttatt 2280 
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tttatttttt tttttatata tatgtttttt gtaataagaa attttttaag agttaataat 2340 

aatagattaa atttattatt tgttgttttg gaaagaaata aattaaatta aaataaattt 2400 

tttgaatatt tttttgaagt gtaggagaga tatattttag taaaagttta agggggaaaa 2460 

agaaaattgt attaaaggaa aaaaaaaaaa aaaaagtggg ggttgggatt gttatatatg 2520 

gttaaaaatt taagtttttt ttaatagtat tagtattgaa ataatatatt tttaaaatgt 2580 

ttgagggatt agatagggaa agaaaaggta tgtataaaaa aatttaattg atgttgattt 2640 

tgtgatttat gtaatattat aaatttgtaa aaggtaaaaa attagaagta aaaatttata 2700 

aattattaaa atatagaaat taaaaatttt aagttattat attagaaaga aaaaaattta 2760 

gaataatagt aaaaattttt gttttaaata aaaataaagt aaatgaattt attgaaaatt 2820 

gtttggtaaa tatttttttt gtggtgttta atattttagt tggaaagagt tgtgatgttt 2880 

attttatttt attttttttt tatttgtttt tttaatttta ttatatatat aatatatttt 2940 

ttttagtggt ttaaattttt tgtttttttt tgtgtattta gtttgatttg ttgagtttat 3000 

gggtaagaaa gaaggaatta gttttagatt ttgggaaagt aaagtgtatt ttttttttta 3060 

tgttattgaa tagtaaatta gtttttagaa ttttagaggt tgagttttgt tatagtgaag 3120 

gtgttgatgt tatagaggag gagtttatgt gatggtggtg gagtaggtta tattattgtt 3180 

ttgggtttgg ggagggagag ttatttttag gttttttgag ttttgaattg gaatttttaa 3240 

atttgagatg ttttgtttat gaggattttg aaatatgttg gttagtgaaa aaattttgtg 3300 

gttttgaggg tttttggttg gttaggggta gtaaaaattt tggagagttg atattaagtt 3360 

tttttttgtt atgtagtagt ggtaaagttt gaagtttaaa ttttgagaat tgagttttgt 3420 

tgatttttag aattggggtt tttagaagtg gtgatgtaag aagtttttag gaaaggttgg 3480 

atattaggtg attattgttg ttgttgttgt tgttgttgtt gttattgttg ttgttgttgt 3540 

tgttgttgtt ggtgttgttg ttgttgttgt tgtttatgat tattatttta ttttttaatt 3600 

tttttttttt ttgtttttgt taaatgtttt ggttttaagt tttttatgtg tatttattga 3660 

tataaatgta tatatttatt tattttagtt gttaggtgtt aaaataaatg ttgaagatta 3720 

gttttatgtt ttttttatta taggatatag attgttatgt atttattatt attattgttg 3780 

tttttgagtg aattggttgg tttggggagg tttttgttat ttttttttgt gttgttttgg 3840 

tttttggaaa ggaggtggag gagaggaagg aggggaatta gggggtggtt ggagtagaga 3900 

ggatgagata gtgtttgggg ggtgatttgg gttagtttgg gggttgtttg gttttagatt 3960 

gtggagagga tgtgtgtttg tgtttttgtt ttttttgttg ttgtttgtgt atggtttgtg 4020 

atttttttgg atttgtgtgg ttgtgttttt tttttttttt tttgtttgta aattgttttt 4080 

tttgttttgt tgtttttggt ttttgttttt tttttttttt tttttttttg tttttttttt 4140 

tttttttttt tttgtttttt tgggtaattt tgggaggtaa aaagggaggt tgtttgtttg 4200 

tttgttttgt tggttttttt ttattggttt tggtttttta gattttttgt ttttgagttg 4260 

aattgagatt tggaaggaaa aaatgtaagt gaattttttg ggtttttgtt tttgtttttt 4320 

tggttataaa tttagatgag atgaagtatt gtgtttatta ttttttttta gagttgtgat 4380 

ttttttaatg agagttgagt gttttggtgt tttttgtttt tgtgttttgg ggagtttggg 4440 

ggtggggatg ttgaatattt tgaaaattat ttttttttgg ttgtagttga gggagttggg 4500 

aatttttgtg tgtttttttt gtttattttg aggagagaga ttgatggtgg gatagggttt 4560 

tttggggtgg gtgggaaagg ggtgtgtatt gtatttgtga tgttttttag tgtgtggggg 4620 

gatttaggga attttgatag gatagtttgg gagaatgaga aaggtggtgg gatttttggt 4680 

tgttgtggtg tttgtatttt gggtgttgtg tgattttgtg ttggttttta atggttgttg 4740 

tttttttttt ttaaatttta ttttattttt ttttatggtg taagttttgt aaaaagggaa 4800 

ttgtgggttt gagtgtagtg gatgttaggt tttagatttt tgaggttttt attttgggtg 4860 

aggagaaaga tgaatagagt tttagtttgt gttttttatg tgtaagattt aggaggagag 4920 

aagggtattt tgtggttgtg gttgagggga tttggtgtgg gaggagtggg tgtgggtgtg 4980 

aaagggagat ttttgtgagt gattttgtaa aaatagattg tgaggttggt tggatttgta 5040 

atttgtggtt ttttttgagg gagtaagaat gggggaaggt gtggtggtgg tggtttgggg 5100 

agggagtggg tagagttgga gttttagaaa ttggttgagt tttgggggtg ggtggggaga 5160 

aagggtgggg gggtagtagg agttaggggt tattttgttg ttggatgtag tgattgtggt 5220 

aaatgttttg agaattgtgg gttgtgttgt ttttatgatg ttgtgttatt ggaattttgg 5280 

tgaaaaatgg aattagtgtt gtaataatga gttttaaagt ttttttatgg aaaataaaaa 5340 

tataattaaa ttgatttttt aaaaaaatga aaaaagattt tttgtgttgt gttttttttt 5400 

atgatggttg tgttggtggt tggggtttgg gggtgggggg ttttgatagt tgagtagata 5460 

aatgatttta aattaggtgt tgaaagtttt aagttgttat agggtttttg attattttag 5520 

agagttaatt tttattttgt gttttttttt tgttggtttt tgttatttta taatttgtta 5580 

ggatttaatt ttttatggat ttttttttta gtttatagtt tttaggagga aagttttgta 5640 

atgttttatt tttttttttt tttttttaaa gtaaagtgag tgttagtaga tgtgtttttt 5700 

atagtttgga ttttaaaaaa agtggggtag gagagaggga attagggaaa gggttgttta 5760 

gaaatatagt ttgtggtttt gagtgatgtt attggttttt tttatagaga ttgttagttt 5820 

tttttttttt ttgtaaggtg tgtgtggttt taataaataa tttaaaaaaa aaaagagggg 5880 

gtggtaagat ttattaggag ataggtattt attttgttta ttatttatta ttaagagttt 5940 

ttggtattag aatagaattt tgtaaatttg tagtttaagt tgaagtttat ttggttttag 6000 

tttatatgtt atttttggag gtagtaaata tgagtttaaa gaagagagat gaaattatat 6060 

ttttgtttta aaatatatat ttttttaaaa aagatgaaaa gatatttgtt tttgggggag 6120 

gaggaaatgg tgaataagta ttaggagatg ggtatttttt tattttaatg attgtagttg 6180 

agtttttagg ttaaaataga aatttgtaga tggtaatttt ttttttattt ttttgttttg 6240 

tttgttattt taagtttata aatagaagtt tttagtattt tatttggaag attttttaat 6300 

attagaggtt gaagtattgg ttagttttaa gggttgttga tagtgatagt aaggttagat 6360 
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2^9«tata aaaattgggt gatttgaagg agatagttag aaaaagtggg aattttttaa 
ttgttttttt ttttttttta gttttgaagg gtttta 

<210> 400 
<211> 4001 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic dna CHomo sapiens) 
<400> 400 

tgtatttttt 
attttgtgag 
tttttttgat 
attattttga 
ttttgttaat 
attagaattt 
agaataagaa 
tttttttatg 
aaatttatta 
atttttaaga 
gagtttttta 
tatatatttt 
gtaagttaaa 
aataatgggg 
tttttttttg 
ttattatagt 
gaaagttaga 
gttttgattt 
tttgatgata 
taaaattttt 
ttttttttaa 



6420 
6456 



ttttttttta 
attgtttttt 
tatgaaaatt 
atatttttaa 
tgataaaaat 
attagttttt 
gggaatgggg 
gttttttttg 
atttaagtat 
gtttaaaata 
aaatttaaat 
ttaaggtaat 
atttttatta 



gtttttatta 
ttttagtttt 
tagattttta 
gttattagtt 
aattagaaaa 
tttttataat 



ggaggtaggg 
attgtttgtg 
ttttttttat 
gagtttaagt 
aaataagtgt 
tattaaagga 
ttgaggtata 
ttgatttggg 
tagtttagtt 
gggtgttttt 
taaattaatt 

ggggtgattt 

ttttagttat 
ttttttgttt 
attttaggat 
agagtttgtt 
tgagtggttg 
tggtatgttg 
ttgttttttg 
tgatttttta 
ttaatttgtt 
aatagatagg 
tagagttgta 
agaaatatgg 
tattggttat 
atgtatggga 
ttttgtgagt 
ttttagagga 
atttttatta 
tagaattttt 
tgttaggaaa 
atattgtagg 
gtgaaggtga 
gtgattatag 



tttaattttt 
taagtagata 
taatgttttt 
tgttttatag 
ttagtttttt 
taatgagatt 
ataatggatt 
gaaaaattgg 
aaggaggaaa 
ttttttgtgt 
aaatgaaata 
ttgtagatta 
tggataattt 
ggtaatatta 
agatgttaaa 
tgatgttaat 
tgtaataggg 
taagttgttg 
taatatattt 
ttggtttttt 
tgttattttg 
tgtttttttt 
tagagttttt 
ttattgtgtt 
atggttttgt 
agttgtttga 
gaattatttt 

ggggtgtttt 

ttgtaagtat 
tagaatagaa 
gattgtttat 
ggatatatat 
atttttgatt 
ttttttattt 
tagtttttat 
tatgtttggt 
ttagttataa 
aaattgtttt 
tttttgattt 
ttaaagttga 
gttaagttgt 
ttttgtaatt 



gtaatggatt 
gtatgtatgt 
atttaattta 
taattaaaaa 
attaagatat 
taaagggaat 
atatagtatt 
tgaagtgtag 
ttttttaaag 
tttattaagt 
ttgttttggt 
tatttaaaaa 
ataaataaaa 
tatatttttg 
ggatgggggg 
ataagtaaaa 
tttttttttt 
taatggatat 
gttttttgtt 
taaatagtta 
gtaattgatg 
tttaaggtta 
tgtgtaatta 
agattttaat 
tgtgggatgt 
atatttttgt 
ttattttgta 
aaattttgtg 
tagtgttgtt 
tttggtttta 
tatttagtat 
tttttgagtt 
gaatggtata 
agtttttgta 
tgtgtgtaaa 
taggttattt 
tgtttgtaga 
atgggtggtg 
gtagttggtt 
ggttttttta 
aattttagtt 
ttattttttt 
tatttttaat 
atattttaaa 
tgaaggtata 
gattttagat 
taaattaggt 
aatgttttgt 
tgttttgaaa 



atttttgaaa aaaaattttt 
gattttaatg tttaagaaag 
ttatttttta gaaatgtgat 
tagttttttt tttatggatt 
tgttttaatg atatggtata 
ttttttttat taatatgtaa 
taggtttaaa atatatattt 
tgtatatttt taatattttt 
gatttagttt tagtttagat 
tgtaaatatt tgaaattagt 
atattttttt tattaagatt 
atttattttt gtttgaggaa 
ttattatatg attttttttt 
tttttttagt aataaaggtg 
gtaagaaagg gaagattttg 
gtttaattgt tttgaaagtt 
ggtaattatt ttaagatagt 
atggggatat taattatttt 
tattttgtga tgtttgattt 
gtttagttat tatatatttt 
gttagggggg tgggagagta 
ttagtatata tataagtttt 
taagtttttt taggtttatg 
gaattataga gaattttatg 
tataagtgga ggtgattgga 
aagaagtttt aatgggatag 
aaattatagt gatggttgat 
ttaattaatg tgtgtaagaa 
ttggttgttg tgttatttta 
tagggagagg aggagtgggg 
ggaagggatg aaagaggtga 
tgtgttttta ggattatgag 
aaggttttta attgggtttt 
tttttttgta ttttttatgt 
tgattttgga attatagaga 
tataatattt aattggttgg 
aataggaatt agaagaagat 
ggaggtattt tgatttagtt 
ttttgtagat aaagaagtgt 
tttttttgtt ttttatgatt 
tgaatattta gataatattt 
tatgtgagtg tttagtatgt 
ttatttgaat aaataaatta 
atttattatt tgttagtagt 
ttaagttata tgagtggtaa 
ttagttaaaa gttttttatg 
tttgtttttt ttttattttt 
taaaggtaag tttaaatatt 
tttaatttga ttattttatg 
atattgatgg agatatatat 
agattggttt tatatttaaa 
tgaatttgag tgttagaaag 
agtatataaa aaagaaaata 
aattagtttt tttttttttt 
aataatgtaa gaagagaaag 
Sei te 372 



ttaattagaa 
atggtattat 
gagtgtgaag 
atagtattga 
aattatgaat 
ttgtaaaaga 
tatttttgtt 
aagatattaa 
ataatgtttg 
gttatattga 
tgatttttaa 
aaaatatttt 
tttattttaa 
gataaatttg 
ttagtttagt 
agatgtttta 
ggtattaaag 
ataataaagt 
attatttaat 
gttaagtggt 
gtagggagta 
tttatttttg 
ttttgaattt 
gaagatattg 
aaattaagtt 
ggtttaggtt 
tgattatgat 
gtggttttaa 
ggtttaggaa 
taaaggggtt 
ggaagagtag 
aagtgggtag 
ttggttaatt 
tattttttat 
atatttaaga 
ttgttggtta 
agttgttaat 
gttgggagtt 
tttgtgtttt 
taagtaggtg 
gttattgagt 
gtggaaattt 
gtatttgtat 
gttttgaaga 
ggttgatata 
atgttatttt 
tatttttttt 
taaaaaatta 
taggaaggtt 
taatttttag 
tttatttttg 
taatgatgag 
attttgaatt 
taatatatta 
_agaaaatagi:__. 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3Q60 
3120 
3180 
3240 
33D0- 
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gtatgttgga aatattaaag ttttttatgt agttaaaatt ttatatgtag ttattatttt 3360 

taagatatgt tgagtagttt ttattttaat atattttttg tatttggatg ggagaattaa 3420 

ttataataaa tatagtttgg tattaatagt atattaataa ttttgttagt atagattaaa 3480 

agagttttta tgggaattat tggttggtaa aggatttaga aaataaatat atttgtagag 3540 

atttagtgga attattaata ggtttttata gtttttggtg ttttttaaaa ttaggtaatt 3600 

agtgaaatgg tttatttaat gtgttatgta gtaaaggtag ttgttttttt tttgttagtg 3660 

ttgaaaatga aattatattt ttttttttga aagtttataa taaattatga mttttttta 3720 

aatgataaat gttttgagta gtgatttttt attttgtatt ttaaatgtaa tatgtgtagt 3780 

tgtgttttgt ttttttgttt tttaaatgtt ttagtgtttg aatatttatt tgagtgatat 3840 

ttaattatta tagttggtta gggttgattg tagtgattga aataggattt tgatgttttt 3900 

agttttgatt tttattattt ttaaatttat ttggtttata ttgtgtaatt agtagatgtt 3960 

tatagattag ttttttgttt taattgaagt atattgttat a ^ 4001 

<210> 401 
<211> 4001 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 401 

tgtaataatg tgttttaatt aggataggaa gttgatttgt aaatgtttat tgattatata 60 

atgtaaattg gatggatttg aaaatggtaa aaattaaaat taaaaatatt aaggttttgt 120 

tttagttatt gtaattaatt ttgattaatt gtgataatta aatattgttt aagtgaatgt 180 

ttaaatattg aaatatttag aagataagaa agtaaaatat agttatatat attgtattta 240 

ggatgtagaa tagaaggtta ttatttaaag tatttgttat ttaaaagaaa gttataattt 300 

attataagtt tttagaaaaa agaatataat tttattttta atattagtaa aaaggaggta 360 

gttgtttttg ttatatgatg tattaggtgg gttattttat taattatttg attttagaaa 420 

atgttaaagg ttatggaagt ttattaatgg ttttgttgaa tttttgtaga tgtgtttgtt 480 

ttttaaattt tttgttaatt aataattttt atgaggattt ttttagttta tattgataaa 540 

attattaata tgttgttgat gttagattgt gtttgttata gttgattttt ttatttaagt 600 

ataagaaatg tattgaaata aaaattgttt agtatatttt aaaaatagta gttatatata 660 

aaattttagt tatatgaaag attttaatat ttttaatata tattgttttt tttttttttt 720 

tttgtattgt tttttaaaat gagttgtaaa attatgatta ttaatgtgtt aggaaagaaa 780 

aaaaattagt tgtaaaatat tataatttag tttatttttg tagtttaaga ttattttttt 840 

ttttatgtat tatttggttt attagtttta atttataata ttttgttatt gttttttgat 900 

atttaaattt agtttaaaat taaattaaga attttttaat ataaaggtaa gtttaggtat 960 

aggattgatt ttatattttt aagggtagtt taaagatttt gttaaaaatt agtatgtatt 1020 

tttattaata ttttgaaata tttataattg gtggtaagaa tagttttttt atatgggata 1080 

gttaggttaa gattagaaat aattaagtgt attttttaag atagtttttt aaatgtttaa 1140 

atttattttt aagggaaata agtgggggtt gatttatgga aagaggggat agaaggtagg 1200 

gaaagggtga aggttagagt taagtgaaga attttgtatg tagagtgata ttatagagga 1260 

tttttggttg gtgaaaagat tagttgggaa tatagttaat atatattaat tttgttattt 1320 

atataattta aaattgattg tatatgtatt tttgtatttt ttttttaaaa tattattaat 1380 

aaataataaa ttattgttta tataagtaat ttgtgatttt aatatgggtg ttaatttatt 1440 

tgtttaggtg gtttataaat gttttgtttt gtttgtttat taaattttta tatgtattag 1500 

gtgtttatat gggataattt gatgtttata aagtggatta aatttagtgg taaatgttat 1560 

ttgggtattt gtttatatat ggagatattt ttaaaggatt gtatttgttt gggttataag 1620 

aaatagaaaa atgtaaaaat taaggtaatt ttaaaagata gaaaatgtag aatgtttttt 1680 

tatttgtaga atgtattgtt tttggatggt ttaatgtatt aggtttttag taattaagtt 1740 

aagatgtttt taatttgaga gataggattg ttagttattt agttagtagt tatttttttt 1800 

taatttttat tgtattagat gagtgtaatg aggtgggttt ttagttgata attaattaat 1860 

tgagtattgt atggaattgg aagagatttt ggttttggaa tttttgggta ttttttatgg 1920 

ttttagggtt aagtggtgtt aagagaaggt gggatagagg aatgggaaat gatgtgagga 1980 

gtgtggaggg gtgtgaggtt ttaagatggt ggtagttgag gggttgattg agagatttag 2040 

ttgaaggttt ttatgaagtg aaagaggttg ggagttgttt tttatttgtt ttttgtagtt 2100 

ttgggagtat agtagaagtg tgagtgtatt gaattgattt attatttttt tttatttttt 2160 

ttgttttttt tatgttttgt gtagtggttt gggggatatt tagttttttt gttttgtttt 2220 

tttttttttt ggttgaggtt tttttgttgt ggattgggtt atttttgggt ttggggtggt 2280 

gtggtgatta gtggttgtgt ggttggtatt gtttgggttg gttggggttg ttttttatgt 2340 

atgttgatta gtagttttag gtttgatgtt ttgtattttg agttatggtt ggttagttgt 2400 

tgttatggtt ttgttagttg ttggtgttgt tttttttggt gaatttggat tttattttat 2460 

tgaagttttt ttggttgttt agagttattt aatgtttatt tggtttgatt ttttagttgt 2520 

ttttgtttgt agataaaaaa ttaatttgta gatttaagtt ttagtgtttt ttgtaaaartt 2580 

ttttgtaatt tgtatrttgtt gtgttttatt tgtgggggag gagatttaga atatgagttt 2640 

gaggaaattt gagaaaggaa agtataaaaa atgtaggtaa ttaaagatgg gagagtttgt 2700 
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atatgtattg 

atttttttaa 

ggtagttagg 

attataggat 

gatgttttta 

aagtaattgt 

ggtagttgga 

ttttttttta 

tattggaaag 

ttatgtaata 

taaaaataag 

aagaaaaata 

aaatatttgt 

agaattgaat 

gagaatgtat 

ttttgagttt 

aataaaggga 

ttgttaaagt 

gaaaaaagtt 

ttgaagaatg 

atattaaagt 

tttttagaga 



attttgtttg 

tttttttatt 

ttaggaatat 

attttatttg 

ttttttagat 

tattagaata 

tatttggtga 

tttttaaaaa 

attgtgtttt 

gggtattgtg 

tatttttttt 

tgtgttttaa 

atttttagtt 

ttgaattaaa 

agtatatatg 

agatttatta 

ggttatagaa 

attttttagt 

aaattgatga 

atagagattt 

taaaattaag 

ttggtaggga 



tttttttttt 
tttagttttt 
ggatttatta 
taattttatt 
gaggaagtta 
gttgtgaaat 
aagagatatt 
gtgtttattt 
aaaaaattaa 
ttaataaaag 
aattatttta 
tatttaagtt 
gtgttttgaa 
tatgtttaaa 
ttaagaaaaa 
tttttatttt 
gaaggattaa 
tgtttttatt 
tttgaggata 
aaatttttat 
gaaagaatag 
ttagaaaaag 



seq_prot . 
tttttatttt 
tttagaagga 
taaagatttt 
gtattattag 
aaaattagag 
atttggtttt 
aattgtaata 
gtaaaaggaa 
gttttattat 
ttttagttta 
agaaatatat 
ttggggagtt 
tttttgagag 
ttaatgagtt 
ttatggagag 
ttttttattt 
tgaattttga 
agttgataaa 
tttaaagtgg 
ggttaagaaa 
ttttatagaa 
gaagaaatat 



txt 

ttattttttg 

aattatttga 

agttaagtag 

agttttgtta 

tttttgatat 

ttgaaatatt 

aattgggttg 

gtaaatttgt 

tttggaataa 

taaaatattt 

attgggagtt 

tttagtatag 

ttaagtatta 

tttagtgttt 

aggtaaaggt 

tttttttata 

tgtttatggt 

gttagtgttg 

tttttatatt 

ggtgatgtta 

tttttaatta 

a 



ttgttttttt 

tagggtgtat 

tgagttaaat 

taaagtaatt 

tgttgttttg 

tggtttttgg 

ataaaatttt 

ttgtttttat 

gaggaagaag 

tttttttaaa 

aggttttagt 

tattaatttt 

tatttgaatt 

tagaagtatt 

gaggtatgtg 

attatatgtt 

ttgtgttata 

tgatttataa 

tattatattt 

ttttttttgg 

aaggaatttt 



<210> 402 
<211> 4341 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 402 



2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4001 



ttatggaagt 

gtggggtgtt 

gaatgggagt 

tttttttatt 

aatagaattt 

taaagaaagt 

tttagataaa 

ttatattgat 

gtgataagtt 

ttaaaatttt 

tgtgtgttta 

ttatttgggt 

ggttgtttta 

ggtgggtagg 

ttttttaagg 

attataatgg 

tattttaggt 

gatgtgatgg 

tattattatt 

aggttagatg 

ggaagatggt 

aggggttttg 

ttagtgagtt 

t:gatgggggg 

tattagggag 

tttagtaggt 

gtgttgtagg 

atagggtttg 

aggtgaagtg 

ttgggggtag 

tttgtggtgg 

agtaattaat 

tatgtttgat 

tttttaattt 

ggaggttttt-- 



ttttagtggg 

aattgttttt 

ttttaaaggg 

tttgtagtaa 

ttaatttggg 

agttagttta 

gatagtaaat 

tagttatttt 

tttgagataa 

agtttatgtg 

tgtgaagaga 

gtaggtgggt 

taggatttgg 

ggtgaatttt 

gtggggaaga 

tggaatgtta 

tatttggatg 

tttggtttga 

ttatttattt 

agttagattt 

atgggtgtgt 

tagttattgg 

ttgggaggga 

aaggatatgt 

tttttggagt 

ttgggtttgg 

tgtggggtta 

tgagttttat 

agagtggagg 

tgtttagtaa 

agattagtgt 

aataataata 

tttggttttt 

gtggtttttt 

gagggatttt 



ggtttgtaag 
aaagggaaat 
aggattattt 
aattttagtt 
ttaatagtag 
tatgttgttt 
tatttaattt 
ttaagttgga 
gtttgtattt 
ttatttttaa 
ttattaaata 
tgagtttgaa 
gaaggtaatg 
ataaagtata 
ttataaagta 
ttagttaagg 
tatatttgta 
taattattat 
tattttattt 
tagggaggtt 
ttggttaggg 
tatataatgt 
attgaagatt 
ttaggtttaa 
attttgattt 
gtttgagaat 
ggagttaggt 
ttttttgtgg 
ttgtatttta 
ttgtgttagg 
tttggagtat 
attataatta 
attaggttgt 
tgtaggagag 
-agtaagtggg 



ggattgtgag 

gttatttttt 

atgggtttgt 

aaggaagata 

gtttatgttt 

gagatgggaa 

gtgataattt 

ttagggttgg 

gtaatttgtt 

atagtttata 

ggttttgtgt 

aagagagtta 

gaaaattata 

tttttaaggg 

tattgattag 

ttgtttttat 

agttataggg 

ttatgttatg 

tttttttata 

tagaagtggg 

gtgggagtgt 

ttgggagttt 

tttaattatt 

ggatatgtgt 

ttagatgggt 

ttgtgttttt 

tttgtttttg 

tgtggggtgg 

gtattgtgta 

agtgtggaga 

gggtatgatg 

tagtaagggt 

ttaggttttt 

gttgggaagg 

gaagggtgttr 



ttgagttaag 

atttatagtt 

ttgatttagt 

aagagatttt 

aaaatggttg 

aaataaggta 

tgaggaaatt 

aggttggggt 

ataatgttgg 

ttaaatgtta 

gagtaatatg 

gtgaagggag 

gttaaagggg 

tggggagaat 

ttagggtggg 

tttttttgtg 

gatgtgatgg 

tttattattt 

tatttgtttt 

taaggggaaa 

tggatggagt 

ttgttggtgt 

aatgtatttg 

aggtttggat 

ttgatgaaat 

tgtgagtttt 

tgtttggagt 

ggttgggtgt 

gggattggtg 

agggtattgg 

ggggtatttt 

gttgatgggt 

gatgatgtat 

ggtgggggat 

xjggaaagttt 



gagaatggag 

agttagttga 

tttttttaat 

tggagattaa 

ttaattttaa 

ggatataggt 

ggtaatttag 

tttgaggtag 

gaggggttgt 

attgttttta 

gttgtgtatt 

ataggggtgg 

gttgtttttt 

tataaataat 

gtaggaataa 

gatttttagt 

tttggtttgg 

taagttttat 

tattttggag 

tgggaaagga 

ttgggataag 

tgggattatt 

tttttaaaat 

gattttgggt 

gagtatttga 

tgtgaggtaa 

tgtttttagt 

ggggtgaaag 

agtgttgttt 

gagagtggtg 

tttggttgtt 

gggtttggag 

tagaaatatt 

ggggtttggg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100- 
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gttgtgtggg ttatgagttt atttgaatgt tgattattgt tttttgttga tttttatttt 2160 

ttgggaatgt gtgaaagtaa atttaagtta gattgtggag gttgttgggg agggaaggtt 2220 

taaggagttt ttgttgattt tgttgaataa agggggtttt gagttgggtt gagatggggt 2280 

atgtgtggga agatttttgt ttgttgtttt tttttattgt tttagtggat gttatgtttg 2340 

gggttttttt ggtgtgtggg gttgatgtat ttttggggtt tattgtagtt ggttgggatt 2400 

gtggagtggg tgtggtggat gaagggaggt aggatagttt tgggggtggt agaaggagtt 2460 

tgggtatagt tgagatttgt gtttttattt tattaatatt tatagtaggt gttgttgagt 2520 

tgggtaattg ggatggttta agttatttgg ttaaatttta aattatgttt gttattggga 2580 

agtagagttt agtgatgtta attgtgtttt tatttttatt attggtgtta gttttaaagg 2640 

gtttttaaaa tggttgtgtt attttttagt tttggattgt agttgttggt taggaatttt 2700 

agtgttatgg tggatatgtt ttttttgtgt ttttgttgtt tatttgttta tttagtttag 2760 

gggttttggt aggtagtagt gtatttggtt taaagggtaa gatgtttttt tttttattta 2820 

taataaattt aaatattagt agggtttggg gggaaaaatg tttttagaag aaaaggtgaa 2880 

tgttagtttt gtaagagtta gttttaaaat tagattgaat tggtatatgt atatttatgt 2940 

aataaatttg tatgttttgt atatgtattt tagaatttaa aagtttataa aaaaagaaaa 3000 

aattagattg gattatgttg ggaaagtgta gtttttttta ttttaggtat tttttagaat 3060 

gtaggtagta ggtggttttt attaggagtt ttgggagagg aagggggttg aattttattt 3120 

tttatttttg tttttttatg gggtttgagt tgaggaagtt ttattataag gagagaattt 3180 

tttgataatt ttggatgtta tttttatttt tattgtagga attttgtggt tatattgtga 3240 

ggagattggt tttgggttgg aggttatagg aagattttta ttttttgaaa tttggagtga 3300 

agaatgttgt tatttagtta ttattttaag gtaaggtaga aatgaagtgg gttgttgggt 3360 

tttttttttt tttttttttt tttttttgag ataaggtttt attttgttgt ttaggttgga 3420 

gtgtagtggt gttattttag tttattgtaa tttttgtttt ttaggtttaa gtgattttgg 3480 

tgttttagtt ttttgagtag ttgggattat aggtatatat tattatattt ggttaatttg 3540 

tatgtgttta gtagagatag tattttatta tattggttag gttggtttta aatttttggt 3600 

tttaaatgat ttatttgttt tggtttttta aagtattggg attataggta tgagttattg 3660 

tatttagtta ggtttatttt agttgttatg ttaattaggt tttttgtatt tgtgtgttaa 3720 

tttttatttt tttaaaaggt tttagggtga tttagtaggt aatgagtgat ttttaaattt 3780 

aggatttatt gtgagagatt tatatatata attgagtaga tatttatagt ataatgatta 3840 

aagggagtga tagggtaagg atttatagtg gaggttttgg aggttagttt attgatagtt 3900 

attttaggga gtttagaagt tttgttttag tgttgggtgg tggagggaaa tttgtttttt 3960 

tagggatttt gtttttggtt gtttagttgt taaagttagg aataagtttt tagaaatttt 4020 

attgttaaga ttttgaaaat gttttagata ttgttagttt tttgttgttt ttgtgatttt 4080 

ttataggtgt gtgtgttatt gggtttttat ttattggggt ttttggtggt attgggttat 4140 

agtaagtgtt tttttatttt tttagtttat tatatatatg tttattattt tgaagaaaaa 4200 

tttttttatt atgagtgaaa gtgagaaata tgtatgttta ttgtttttaa agaaagaaat 4260 

ttaatatggg tttaatgtta tttagtgagt gtttttattt tgagatatta gggttataag 4320 

ttattattat atattatggg t 4341 

<210> 403 
<211> 4341 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 

<400> 403 1 

gtttatgata tataataatg atttgtgatt ttaatgtttt aaaatggagg tatttattag 60 

gtagtattaa gtttatatta agtttttttt tttagaaata ataaatatat gtgtttttta 120 

tttttgttta tagtaaaggg gttttttttt aaagtggtaa gtatgtgtgt ggtagattaa 180 

ggggatgagg gaatatttgt tgtggtttaa tgttattaga gattttagtg aataaggatt 240 

tagtggtatg tatatttgtg gaggattgta aaagtgataa gggattagta atgtttgaag 300 

tgtttttgga attttggtag tgagattttt gaaagtttat ttttgatttt ggtagttggg 360 

tagttgagga tgaggttttt ggaggaatag attttttttt attatttagt attagagtgg 420 

gatttttgga ttttttggag tggttgttag tgggttggtt tttagagttt ttattgtggg 480 

tttttatttt attatttttt ttaattattg tattgtgaat gtttgtttaa ttgtgtgtgt 540 

gaatttttta taataaattt tgaatttgag aatta^ttat tgtttgttgg gttattttga 600 

aattttttgg gaaagtgaaa gttagtgtat aggtgtagga aatttggtta atataataat 660 

taaaatgaat ttgattgggt gtagtggttt gtgtttgtaa ttttagtatt ttgggaagtt 720 

aaggtgggtg gattatttga ggttaggagt ttgaaattag tttggttaat atggtgaaat 780 

gttgttttta ttaaatatat ataaattagt taggtgtggt gatgtgtgtt tgtaatttta 840 

gttatttagg agattgaggt attagaattg tttgaatttg ggaggtgggg gttgtagtga 900 

gttgagatgg tgttattgta ttttagtttg ggtgatagag tgagatttjig ttttaaaaaa 960 

aagaaagaaa gaaaagaaag aatttaatgg tttatljttat ttttgtttta t^ttggaatg 1020 

gtggttggat ggtggtgttt tttattttag attttaggga gtggga^ttt tattgtagtt 10A0 

tttgatttag aattgatttt tttgtagtgt ggttatagaa tttttgtagt gagggtaaag 1140 
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gtggtattta gggttgttag ggggtttttt ttttgtggtg aagttttttt agtttagatt 1200 

ttatagagga gtagggatgg gaggtaggat ttagtttttt ttttttttta aaatttttaa 1260 

taaggattat ttattgttta tgttttagga aatgtttaag atggaagagg ttatattttt 1320 

ttaatataat ttagtttagt tttttttttt tttataagtt tttaagtttt ggggtatatg 1380 

tgtagaatgt gtaggtttgt tatataggta tatatgtgtt aatttagttt agttttaaaa 1440 

ttaatttttg taggattgat atttattttt ttttttgaaa gtgttttttt ttttaaattt 1500 

tgttggtatt taaatttgtt atgaataaaa gagagaatat tttgtttttt agattaaatg 1560 

tattgttatt tattaaagtt ttrtgagttgg gtgagtaggt gggtggtaag ggtgtgaggg 1620 

aggtgtgttt attgtgatat tgggattttt ggttggtaat tgtggtttaa ggttgaaaga 1680 

tgatatagtt attttaggag ttttttggga ttgatattgg tggtggaggt agaggtgtgg 1740 

ttagtattat tggattttat tttttagtaa taaatgtgat ttaaaattta attaaataat 1800 

ttgggttatt ttaattgttt agtttggtag tatttgttgt gagtgttggt gggatgggag 1860 

tgtaggtttt agttgtgttt gggttttttt tgttattttt gggattgttt tgtttttttt 1920 

tgtttattgt atttatttta tgattttggt taattatgat ggattttgag ggtgtgttag 1980 

ttttatgtgt tggggaggtt ttaggtatgg tatttattgg ggtggtgggg gggaatagtg 2040 

ggtaggggtt tttttgtgta tgttttattt tggtttagtt tggaattttt tttatttagt 2100 

aggattggtg agaatttttt gaattttttt tttttagtga tttttgtagt ttgatttggg 2160 

tttgtttttg tgtgttttta ggaaatagaa gttgatggaa agtagtggtt ggtgtttggg 2220 

tgggtttgtg gtttgtgtag ttgtggattt gaaatttttt tggtgttttt ttttgtttat 2280 

tagagttttt tggagatttt ttttgagttt tgttttttat ttttttttaa ttttttttgt 2340 

aggaaggttg tgggttaggg ggatgttttt gatgtgttat taggagtttg ggtagtttgg 2400 

tgggaattag aattaggtgt gttttgagtt tgtttattag tgtttttgtt atgattatga 2460 

ttattattat tgttagttat tagtagttga gaaggtgttt ttgttgtgtt tgtgttttgg 2520 

agtgttagtt tttgttatga atgttgtttt tttaatgttt ttttttgtgt ttttagtgtg 2580 

gttattgggt gttgttttta gaagtgatat ttattggttt ttgtgtagtg ttggagtgtg 2640 

gtttttgttt ttgttttgtt ttttttattt tgtgtttagt tttgttttgt gttgtgaaga 2700 

aatgaaattt atagattttg tgttgagggt ggttttgggt gtagaaatga aatttagttt 2760 

ttggttttgt atttgtagta tttgttttgt gggaatttgt gggagatgta ggtttttggg 2820 

tttgaattta ggtttgttga attagatgtt tgttttatta ggtttgtttg aggattaagg 2880 

tgttttggag gttttttaat ggtttggagt tatttaagtt tgtatgtatt tttgaattta 2940 

ggtatgtttt tttttttgtt ggttttgaaa atagatgtat tggtaattgg gggtttttag 3000 

ttttttttta gggtttattg ggatgatttt agtattagta gggattttta ggtattgtgt 3060 

gttaatggtt gtagagtttt ttttgttttg aattttgttt agtattttta tttttaatta 3120 

ggtatattta tattattttt tttttttttt gttttttttt gtttattttt aggttttttt 3180 

ggagtttggt ttatttggtt tttttagggt ggaaatgggt gtgtggaagg aaaataaaat 3240 

aaaataaata aaatagtaat aataaagttt aaaataataa atataatata ggtaatagtt 3300 

gttaggttag gttattgtat tttaggttag gttattgtat tttttgtgat ttgtaggtat 3360 

atatttagat ggtttaaagt aattgaagat ttataaaaga agtaaaaata gttttaattg 3420 

atgatatttt attattgtga tttgtttttg ttttatttta attgattaat gtattttgta 3480 

atttttttta tttttaagaa ggttatttgt aatttttttt atttttgaga atgtattttg 3540 

tgagatttat ttttgtttat tagagaataa ttttttttga ttgtaatttt ttattatttt 3600 

tttaaatttt ataaaatggt tttattttta tttttttttg ttgatttttt ttttggattt 3660 

agtttgtttg tatttaggtg aaatatatag ttatgttgtt tatataaagt ttgtttggtg 3720 

gtttttttat atggatgtat atgaaaatag ttaatattta gtataagtta tttagaagta 3780 

S^fS 99 * taa *atttta agtagttttt tttagtattg taataagttg tagatgtaag 3840 

tttattttaa aagtttatta tttgttttgg gattttagtt tttaatttta gtttagtttg 3900 

gaggatgatt agttaatata attgaattgt tagttttttt agaattgtta tgggttaagt 3960 

aatttattgt ttttgtttga aatttgtatt ttattttgtt ttttttattt taaatgatat 4020 

ataagttagt tgtttttttt gttggggttg gtagttattt tgggtatgag tttgttgttg 4080 

gtttgaatta aaagttttgt tttggttttt aaaggttttt ttgttttttt tggttagagt 4140 

HHH2 taa a 99tgaaagg ggttggggag ggttgagtta agtagattta tgggtggttt 4200 

tttttttggg agtttttatt tttggttaat tggttgtgag tgggagatgg tatttttttt 4260 

tgggagtggt tggtatttta tttttatttt ttttagttta gtttatagtt ttttataagt 4320 

ttttgttaaa gatttttatg a y 434 1 

<210> 404 
<211> 4433 
<212> DNA ' 

<213> Art-ificial Sequence , 
<220> 5 . ! 

<2?3> chemically treated genomic DNA (Homo sapiens) 
<400> 404 

' i 

- ****** 
m . 4 * * 

SSSSS 3 !* 9 ^Stttatag tttttggggt ttaattttat taggtaggag gagttgagtt 60 

2iHSS taa HS?? 9ttta a 9gttagtga tggtagt gag gggtagagtt tagggtaggg 120 

^aaat-tagag-tttttaggar-gtgg ttag t a g ag tt t t g tt "t ttgttgtag tgS ggqtgg ISO - 
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gggtttggtt gttatggttt ggtttttatt taagatggtt gttgaggagg tttgtagaga 240 

ggtattagga tagtaatggg gtagtgttat ttttgtggtt ttttgagtgg aaggaaagga 300 

gttttttagg ttaggtattg tttttttggg taggagtgaa gtagggggga tgtttgtttt 360 

ttttttgatt tattttattt tttggagttt tggtagtgaa gtggaggtgg agataggtgg 420 

tgttagggtg tttttttgtt aggttgtagg ttttgtgggg taataggata ttggtttttg 480 

gtaggtgggt gtggggtttt tgggtttgat ggggagaggt tggggttggt ggtggtggat 540 

ggatgttggt ttgggagata gagtttgttg tttttggaaa ttggatttgt tttgtttggg 600 

tgttgtgggg agtgtgtggg atgtggtgga tggggttgtt gttttggatg tttatgtgtg 660 

tggttttttt ataggatttt tatttgttta ggagtttgta tttttggtgt tgtttgtttt 720 

ttgtgttttg tagtttgggt tttagatgga aggttttggg gttaagggtg gttggagtag 780 

ggaggttttg ggggtttttg tttttggagg aggttggttt tgggttagta gggaggtttt 840 

gggtatttgg agttttggtt tagttttaga ttttatgtgt ttttggttag agagtttgtt 900 

ttaaggttgt ttatttttgt tttaagggtt gggtgttata ggggtttatg atgttggtga 960 

ggttggggtt gatagttttt gagataatag ttttgttgtt gatttggggt ttatttggtt 1020 

tttggagtaa gtaaagttgg ttgtagttgg ttatagttgt gtgttttttt ggagttgtga 1080 

gttttgtttg tgttttgttt ttttgtttgt agtttttggt ttggggtttt tatttgaagt 1140 

tttgattttt tttttttttt tagatttttt gtttttggag gttatttagg atttttttgt 1200 

gggtgaaaat tttagagtgg tgttagtttt aagtttagtt ttagtttaag ttttaatttt 1260 

agttttagtt ttagttttag ttttagtttt ggttttattt tttagtttag tattaatttt 1320 

agttattatt ttaattttat tttttatttt agttttagtt ttagttttaa ttttagtttt 1380 

aattttaatt ttagttttag ttttaagttt agttgtagtt gtaattttag ttttagtttt 1440 

agttttagtt ttaattttta tatttttatt tttagtttta attttagttt taattttagt 1500 

tttaagttta gttttaattt tagttttaat tttagttgta tttttagttt taattttagt 1560 

tttaatttta gttttaagtt tagttttaat tttagtttta attttagttg tattttttgt 1620 

tttagttttt attttagttt taattttaat tttagttgta ttttttgttt tagttgatgg 1680 

gttaaagttt taggagagtg tggttttttt taggtgttat taggaggggt aggttttagt 1740 

tagttgggga aattttattg ttatggtttt tagaagttat ttttttttta ggtaagatag 1800 

gttttaaggg agtgttttga gggtggtttt tgggagtttt gtgggtttta gtatttttat 1860 

atattttgag gatagatatg gttttttttt tttagtgggg atagttatag ggataggtat 1920 

agggggtttg gttgaggttg gaggttagtt atagttaaga agttttgaga ttaatggatt 1980 

gtttagttta gatttttttt taggtttaag aggagaggga attagggaga aaagtttaga 2040 

tttgttgttt taagttgtga tgtttagggg ttttgtatag ttttttgtgt agaggttgtt 2100 

tggttttgtg gttttttttt ttgtgaggtg tttatagttg gtattttagt tatggaaatg 2160 

tagtaggtgt gaaattgttt tgagtttgga gtaggaggtt aggttggttg ttttggggga 2220 

tttttaaggg gaggtgttgg gggagggggg tagttttgtt ggttgtagtg gggtgtttat 2280 

ggaaaagtag gaggaggttt ggaagtttat ggtgtttttg tagaggtttg ggggttgggg 2340 

ggtggtggag gggttttgga agtttagttt ttgggttttg gagtttgtta tggtggtagt 2400 

tttgtggtgg tggttggatt tgggtagttg tttggatgtg ttggtttttg tttagtagta 2460 

tggagagttt ggtttggtgt aggagattta tgtgttgatg agtgataatt tgttgtgagt 2520 

gttgggagat ttgtgttttt attgttggtt gagtgtggtt gattgtgagt gtatttttag 2580 

tttgtggatt ggttggggtt gggtggtgtt gggtgttttt gtattgttta gtttttatta 2640 

ggggggttgt ttagggtttt ttaggggttt ttgtggtgag gagttttttg tggtggtttt 2700 

tgtgtttttg tttttatttg tgtatttgta tgtgtttaat ttttgggaga atatttggtg 2760 

gtttttgatt taggtgtttg aggaggtttt gttttggggt tgtggttttt gtattatgta 2820 

taattatttg tttttggtgg ggggtatttg tggttttggt gttaaggttg tttgttttaa 2880 

tgaggttttt tgttataatt ttttgattaa tatttggagt taggtttggt ttatgtagta 2940 

ggtttgagtt tagtttaagt tggtggtttt ggatgggttg ttttatgtta ttggtggtga 3000 

atgtttgtat agtatggagt gttatgattt gtgaatagat gtttggattt tatgtgtgtt 3060 

attttttgta ggtatttttt ttgtggttta tgaggttgtg gtttgttgtg gggatattta 3120 

tgttattggg ggttattttt tttattgttt gtttaggtat agttttgtga aggatgtttg 3180 

ggatgagtgt ttatatagtg ttagttattg gtgttttagt gatattgtgg tattgggggg 3240 

ttttttgtat tgttttgatt tgttgtgggg tgtgggtgtt gttgtgatgt gttataatat 3300 

agtgattggt ttttggagta gggttgtttt tttgtttttg tttgtttttg ttttattgta 3360 

ttgtattatt ttgggtaata ttatttattg ttttaatttt taggttattg ttatttttat 3420 

ggtttttggg gggattgttt agttttaggt taaggagttg tagttttttt ttttggggag 3480 

tattggggtt tttagtttat ttattttgat tttgtttttt gaggattggt tgtagatttt 3540 

attttgagtg gtaggtagag aattaaagtt gttttgttgt tttttaggga gatttttttg 3600 

ggatgggttt gagaggttgg ggtttaggga aggggttggg attggaattt tttgtttttg ,3660 

tttttggata attttttttt tttgttttaa aggttgttga ttattttgaa gtttagattt 3720 

ttttagtttt tttttgtttt ttattttata tttagattgt tttttgattt aattttgtat 3780 

ttatttatag atttttttag tttgttgata tttttttgtt tgtgggattt tttatttttt 3840 

agagttaggg attgatgtgt ttttatagat aaggatttgg tttgttggag ttttgttgag 3900 

ttgagagagg agggggtaga aaatatttat attttttatg ttttgttagt aggataggga 3960 

gtaaaaatgt ttttaggtaa tgtttttgtt tttgggattt tttgtttgtt ttaaggtt;tt 4020 

tttaggtatt aattttgtag ttatttgggt ttgtttggta ttgtggattt ttaagggttt 4080 

agiaatttttg tttttgaaat tggtttgttg gtgtagtttt gttgtttgta gtttttgttt 4140 

atatt1:ttag tttatattag gttaglgttta ttttgggttt attatttttt gtagtttt-gt 4200 

ggggtttttt tagttttttt agaagtttat tttatttttt gttaattttt igatttttaaa 4260 
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tggttttgga 
tttttatgat 
tgttgggatt 

gtttggggaa 
tttgtttttt 
ttggggtagg 
atgtggttgg 
ttagggttgg 
ttgggattgg 
gggttggagt 
aggttggagt 
gtgttgagtt 
agatttgggt 



attgtgtgag 
tttattgttt 
tatagttaaa 
ggtgagaagt 
ggtgagtttg 
agtagggttg 
tagtgttaaa 
agaaagtttt 
gagtatagga 
gagttaagtt 
atagataggg 
gaaatagttt 
taattgataa 
gatrtttagtt 
agagtagtga 
tttagggggt 
ttgtagtttt 
ttgggggttg 
gggtaggggt 
ggtggtgttt 
gttgttggaa 
gttgtatttg 
ggttatagtt 
ggtgtttgtt 
gagtagtttg 
ttggttttag 
ggtattggag 
gtagttttga 
ggggttgaat 
ttttttgtta 
tatgaggatg 
gttggttgtg 
gtttattagt 
tagtatgttt 
ttttagggtt 
ttgtagaaat 
gttggtaggg 
tttgattttt 
tattggttgt 
gggttgtgtg 
ggtttttttt 
gttttttggt 
tgtttttatt 
tatggggttt 
ggggtggttt 
ttggtagtgt 
ggatgtggtt 
ggttggggtt 
ggatgtggtt 
ggttagggtt 
tgtggttgta 

tggggttggg 

gggggatgaa 



gtgtttatag 
atattttata 



seq__prot.txt 

tgaggtttga gtgttatttt agttttgttt ttttttagtt gtgtagattt ggatqaqata 
tttgattttt tttttttttt gtttataaaa tgtggatagt ggatgtttgt tatttaagag 
agttgtggga gataagatta tagttatgag tattttgtat ggtgtttagg atg 

<210> 405 
<211> 4433 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 405 

tattttggat 
gtgatagatg 
tttaagttta 
attgggggtt 
tgtagagggt 
agttgtagat 
tgagaattta 
taggataggt 
tttgttgata 
gagttttagt 
agggagtttt 
attgagttag 
ggttttaaaa 
aggaagtttt 
gtttttttgg 
gtagttggtt 
aggggaaggg 
tggtagtgat 
gggtgggggt 
agtgtattat 
gtgttatgat 
tttttatggg 
ttttatggta 
gtggggttta 
tgatggtata 
tgggttgaat 
agatggtttt 
tgtagagatt 
tgtttttttg 
ttgtaggagg 
ggttgggtag 
tgtgtttgtg 
gtaggttgtt 
gggtaaaggt 
ttgtggtggg 
ttttgggttt 
ttttgttgtg 
aggtggttgg 
gtagttgggg 
tttgtgtggg 
tttttttttt 



4320 
4380 
4433 



aaggggtaga 

ggggtgggtt 

gagtgggttt 
tattagtgga 
tagatttaga 
agaggtgagg 
agtgtgaatg 
tttgtttgtg 

gggtgttagt 

gggtgtggag 
tttttaaagt 
ttttttttga 
agtagttttg 
agagttagga 
ttggtttgga 
aggtagtaaa 
agggaggtag 
atgttttgta 
tgttggtggt 
agtaggtggt 
ttgtgggtta 
tgtgggttgt 
tttagggtta 
atgttggtta 
ttatggatgt 
agtggggttt 
atatgtaggt 
tgagggtttt 
tttagtattg 
tttagttggt 
gtgtaggttt 
aggtaattgt 
tgggagttgg 
attatgagtt 
ttgttttttt 
tgttttgagt 
gagtgttttg 
gggtttttgg 
ttttttaggt 
tgtggttgat 
gaagaagagg 
ttgggaattg 
ttaagaatta 
ttggggagag 
ggggttgggg 
ggagttggag 
ggggttaggg 
ggggatgggg 
gttggatttg 
gttgaggtgg 
gttagggttg 
tttflflggtSfl 



ttgtgatttt gttttttata 
gataaggaga aagggaagtt 
attagggtga tgtttaggtt 
tttggagggg ttgggagagt 
ggtttggtgt gggttggggg 
ttagttttag aggtaagggt 
tagttatggg gttggtattt 
gtgttgtttg gggatgtttt 
ttttttattt tttttttttt 
gagatgtatt agtttttggt 
aagttgagag ggtttgtaag 
tgaggggtag aaagaggttg 
agaaggggaa agttgtttag 
gttttggttt tttaggttta 
gttttttgtt tgttatttag 
tgaatggatt gaggattttg 
attgggtagt ttttttagag 
tggtgttgtt tagggtggtg 
ttttgtttta ggagttggtt 
gtaggttgaa gtggtatagg 
tggtattgta tgggtatttg 
agaagaggtg atttttggtg 
tagggaaggt gtttgtgggg 
agtattttat gttgtatagg 
ttagtttgag ttgggtttgg 
gagggttgta gtagaagatt 
tttttgttag aaataggtag 
ttttgggtat ttgggttagg 
gtgtaggtag aggtagggat 
tggggagttt tgagtggttt 
tttggttttg gttggtttgt 
ggtagaggta tgggtttttt 
tttgtgttag gttgggtttt 
ttaggtttag ttgttgttgt 
gtttttgggg tttttttatt 
tttgggtttt tttttgtttt 
tttttggtgt ttttttttga 
ttggggtgat tttgtatttg 
tagaggagga ggttgtgggg 
gtattgtggt ttggggtagt 
«gggggagg gtttgggttg 
ttttagtttt aattaggttt 
ggttgtgttt gtttttagag 
tttttgggat atttttttgg 
tggtaataag gtttttttag 
ttatattttt ttgaggtttt 
ttggggttga ggtgggggtt 
ttgggtttag ggttggggtt 
ttggagttgg agttgggttt 
gtgtgagggt tgggattggg 
gggttggggt tggggttgaa 
gggatgaggt tgaggtggtg 
gggttaggat tgaggttgga 
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atttttttgg 
aaatgttttg 
ttatttagag 
tttataaggt 
tatgggtagg 
tttaggtttt 

ggggaggttt 

tgttttttgt 
tggtttagta 
tttagggaat 
taggtatgga 
agggagtttg 
aaataagagt 
ttttaggagg 
agtgaggttt 
gtgtttttta 
attgtgaagg 
tagtgtagtg 
attgtgttgt 
aagtttttta 
ttttaagtat 
atgtagatgt 
agtggtgtgt 
tatttgttat 
gtttgttgta 
ttgttggagt 
ttgtgtatgg 
ggttgttagg 
ataggggttg 
ttttggtaga 
aggttgagga 
agtatttgta 
ttgtgttgtt 
agggttgttg 
attttttaat 
tttgtgagtg 

gggttttttg 

ttgtgttttt 
ttaggtggtt 
gggtttagat 
ggtgatttgt 
tttgtatttg 
tgtgtggaag 
ggtttgtttt 
ttggttgaga 
gatttgttag 
ggggtagggg 
gggattgggg 
agggttgggg 
attggggttg 
gttggagttg 
gttgaggttg 
gttgaagtta 
_tt_tatggaag_ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
240— 



seq_prot.txt 

gattttgggt aatttttagg ggtaaaaagt ttgaaggaga ggggagagtt agggttttag 3300 

gtgggaattt tgggttggga gttgtgggtg gggaggtgga gtgtgggtga ggtttatggt 3360 

tttgggaggg tgtgtggttg tggttggttg tagttggttt tgtttgtttt gggggttggg 3420 

tgggttttag gttagtggta ggattgttat tttgggagtt gttagttttg gttttgttgg 3480 

tattgtaggt ttttgtggta tttggttttt gggatgagag tgggtgattt tgaggtggat 3540 

tttttagtta ggggtgtgtg gagtttgggg ttgggttaaa gttttgggtg tttgggattt 3600 

ttttgttgat ttagggttag ttttttttgg gggtagggat ttttgaggtt tttttgtttt 3660 

agttattttt ggttttagag ttttttgttt gagatttagg ttgtagggtg tggggagtgg 3720 

gtagtgttgg gagtgtgggt ttttgggtga gtgggggttt tgtggggagg ttgtgtgtgt 3780 

gggtgtttag ggtggtggtt ttgtttattg tgttttatat gttttttgtg gtgtttaggt 3840 

ggagtgggtt tagtttttgg gggtggtggg ttttgttttt tgagttagtg tttatttgtt 3900 

gttgttggtt ttggtttttt tttattgagt ttgagggttt tgtgtttgtt tgttaggggt 3960 

tagtgttttg ttgttttgtg gggtttgtgg tttggtgaga ggatgttttg atattatttg 4020 

tttttgtttt tgttttgttg ttagggtttt gaggagtgaa gtggattagg aggggggtgg 4080 

atattttttt tgttttgttt ttgtttagga gggtagtgtt tggtttggga ggtttttttt 4140 

tttttgtttg gggggttgtg ggggtggtat tgttttgttg ttgttttaat gtttttttgt 4200 

gggttttttt gatagttgtt ttgagtggaa gttgggttat ggtggttagg tttttgtttg 4260 

gtgttgtagt agggataggg ttttgttggt tgtgttttgg gggttttggt tttttttgtt 4320 

ttgggttttg ttttttgttg ttattattga ttttagattt tggttgtagg gttagtttag 4380 

ttttttttgt ttgatgaggt tgagttttga gggttgtgga tattggtttt gtt 4433 

<210> 406 
<211> 4494 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 406 

attataaatt agtattatta taaaggtaag atgattgtga ttaggttaaa tttttaatag 60 

agggaaataa aatttaagag gaagttagat ttaaaaataa ataaataaat aaatattagg 120 

gtagtgaaat attatataga ttagggtttg ttaaatttat atttttatta atatattagt 180 

taaaattatt aatttagata ataattatgt tttttttgat agttaaggtt ttattattat 240 

tgtttttatt tatttttttt ttatagttat ttgtagtaga tttatttttt aagttttata 300 

aaagatagtt aatttgtaat tatatttttt tagtggatat agagagaatt agataaatta 360 

tttttaaatt ttagtagatt gaattattta tttttttatt aatattttgt aaaatggtat 420 

aatgtgaatt ttatttttat tttttaaagg agttaataat agttattgaa aaggttttaa 480 

aagatggtag aaagagtata ttaatgtaat taaatagatt aaaaattaaa tttgaatttg 540 

tttatatgtt tttttttttt atatagaaag aaatattgtt taataaagtt atatatttaa 600 

gttttgataa aagttttaaa aatatggtta ttatatgttt ttaatttttt aaatgtatgg 660 

tttttaattt gtagtaagat tatttgggaa aattggaata attttataaa taaatgtaag 720 

taagttatga gagaaattag aattttttta gttaagtgta tttttttaaa atatggaaaa 780 

tgaaattttt taatttttaa aataatttgt tttaattagt tttaaggtga tttttgagtt 840 

atttgtgggt aaaaattaga agagaaaaaa gtttaatgat tttgtaaatt agaaattaaa 900 

aaattaatgg tgtgatagtt tgtatagttt tagtggaatt ataggaaaat atttgtttgg 960 

aaattgtttt tttttttttg gaaaattatt aggaggttgg gggaaaagag gggaaaaatg 1020 

ggtaaggaag gttgtaaatg ttaagatgtt aaataggaat ttaattgaag tttgaagtga 1080 

aatatagttt tttttggttt aattgttatg taaataattt ttaagtattt ttagatgtaa 1140 

agaaaaatag ttattgtgta taggtgttag ttgaatgttt gagagttgga tgaatgggga 1200 

agatttgtta tatatttagt ggtgggtgga ttttattagg ttgtagtttt tatatagttt 1260 

ttttttgttg tattgtagag gtgttatttt ttttgtttta atttttattt aatttgattt 1320 

ttttatttgt attttaaaaa aaagattttt tggatttttt ttagtatttt gtgtttttag 1380 

gggaattaaa ttagagtttt tagagttgtt agggaagagg ttgtgtttta attagataaa 1440 

tgtttttttg gatgtatagg atagagtttg tgttttttta ttatttgtta ttgttttttt 1500 

tagatgtata ttttattttt tgttttattt tttttttgtt aaggatttta tagtttagta 1560 

tagaattaag ttttggaggt gatttgaggg tgggaagtag aggggttttg aggttttgtt 1620 

agaagttttg gtagtgtttt tgtttttttt taggtggatg tgtagttttg ggaagggagt 1680 

tgggtgggag ttttggtggt gtattgagat taatgtggtt tggtttatgg tgagggtggg 1740 

gtgttgtggt ttttgttttt gtggattttt tgttttttag atagagttta gggtgtgagt 1800 

gaatttttgg gatgtttaga tttggggaag gtgtggggtg gttatgggtt tttgattttt 1860 

tttatttttt gtaagttata taaaagggag atgtggatat gtataaagtt tgtttgtgga 1920 

aggtggtgtg tgattttgat gttggtttgg gttagggttg tatttaggtt gttgttgtta 1980 

gtagttgtga aaagaggttg gagtaaggaa gggggatggg ggtgagagag gaaggtgaaa 2040 

tgaggtggag aatgtaggga aaagtgaggg gtttttagtt ttttgggttg ttttgatttt 2100 

tttttttttt ttttagggtt taagagaaag gaaagggtaSa tgatttaaga gtgaaggatt 2160 

ggttttaggg atgttgtgtt tttggttttt tttatgtgtt tttttttttt ttttttttgt 2220 

Seite 379 



seq_prot.txt 

ttgaatgttt tttgtttttt ttgatttttg tttttaaggt agggattaag ttggggtttg 2280 

9 g "* aggg ! ^""" at t " tt "9tt ttgggttggt gtggaattag ggagittagt 2340 

gttttgtttg ttttttttta gtgggttgga gtgtgatttt ttggggtagg gttgggttga 2400 

a *2H!8S fla J ?*?ttggatg tt1: 9taagtg ggggtggaga ggagaggggt tttattgatg 2460 

gatttttggt ttttttgtat taaagatttg ggttaaggtt ttggggggta ttgtgttttt 2520 

attgtgtagt ttttttttgt tttttgaggt gtggtattta tttttagtgt ttgagttgtt 2580 

taggtggtta gtaggagtag tgttaaattg ggtagtattg tgattttgtg tggggtattg 2640 

agtgtgttgt tgtgtgagtg ggatttgttg tgtttttgtt: ttgtttgtgt tgttattgtt 2700 

gttgtttttt ggggtttttg tgtatgtttt tttgtgtgtt tttgtttatt gttgttgagg 2760 

aaat 5§ atgg agtttgagtg tggtggtggg gtttagagtt aggtgagtta gttgatttgg 2820 

"* a H5 t9t tt 99tatttg agagagattt ttagtggtgt tgttggggaa ttgtgtttgt 2880 

ttgtgttggg aggggttttt gtgttttgtg tttataggtg tatgtgtttt tggtgttgtt 2940 

tgtattttat gtgttttttt tgttttttgg ttgatggttt atttgggttt tgtgaatagt 3000 

gggagggaga gtttggggtt aggagaggga tggtgtagga ttagggaaag gtgagtttta 3060 

99"fttgag gttttagaaa agttgagagt gtttttgttt gtttttgtga gtttgaatta 3120 

tttgattttg taggtttttt gggggtgttg tataaaggat tgttgttagg gtgtgttttt 3180 

atttgtattt tgtttttttt tttggtttgg agaggtgggg taggtgtttt tggaagagaa 3240 

tgagaatgag tgaagtttaa aggaaatagg attttttttg tttgttgaga tagtaaaatt 3300 

ttatttttta atttttaatg ttaaaagtag gtatgagtaa tttggaattt ttatttttag 3360 

aatgaattaa aggagtaagg tgtaggattt ggtaaagaag tggttaattt atttattttt 3420 

ttttgaaagt agggttttgg gtttgggttt tgtttttttt ttatattttt gtttgttttt 3480 

SSSHH" 1 J9ttaaaggg ggtggttagt gtggattttg tggggtgggg ggtttttttt 3540 

t S3 tg iJ aag tagggatgta atatttattt gagttgggga agaaggaaaa attaggagag 3600 

attatttgta tttgattttt gtaaatgagg atttttgatt ttaaatgttg tttttgtttt 3660 

ttattgtgtt tgttttgaat tatagaaatg aattttttgt tatgttattt ttttttggat 3720 

taaatataaa ttgtttattg tttagttagt gtttagatag tttagaatgt tttagaattg 3780 

tttagatata ttttttgttt ttgatttgaa gtagtattta gttattaagt tatagtttat 3840 

tttatatagg gtttggagtg aaggattgaa gttagggagt gttttggttt ttttgagggt 3900 

tgtattgtag tttgtttttt tttttttgtt tattgtgttg ataagggtgt ttaggtttgg 3960 

gaaggatggt ttagttagtg gggtatagat tttttttttg ggaagttatt tttgtattaq 4020 

tttttttaaa tgttgttttg tgtgtttttt tttgtattag gtatttagtt ttgatttttg 4080 

gattattttt gtttttgaat gttttttgga tgtttttaat aaataaggat aaatatttat 4140 

^■"atgtag tatttgttta tatgttttat gttatttatt ttttataaaa gttaaagagg 4200 

ttgttattat tatttttatt tatatattag gaaaatgaag ttaatgaatt tttaaattta 4260 

atagtaagta agtgatatga agttgggata gtagggaaaa gtttaaagtg ataggataaa 4320 

tttagttgtt tattttttgt attatattta tttattgtaa ttttgtattt gttttgggtt 4380 

tgtatgagta aataaggtaa aaaagaaggt atttgaatgt aaagagaaat gtggtttaaa 4440 

gttataaatt ttgtagagtt tattgtaaaa tgtaaatgtg agattttttg ttat 4494 

<210> 407 
<211> 4494 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 407 

gtaataaagg gttttatgtt tgtattttgt agtggatttt gtaagatttg tagttttgga 60 

ttatgttttt ttttgtattt agatattttt ttttttgttt tatttgttti tgtagatttg 120 

§5 a J aaa H5 99 aa 3H tgg tgggtaaatg tggtgtagaa agtgaataat tgggtttgtt 180 

ttgttatttt aggttttttt ttgttgtttt agttttatgt tatttatttg ttattagatt 240 

£999 a 9*" a ttagttttat ttttttgatg tataaatagg aataatagta atagtttttt 300 

aggaagtaaa tgatatgaag tgtataaata aatattgtat gataataaat 360 

atttgttttt atttgttgag gatatttaaa ggatatttag gggtaaaagt aatttaagag 420 

ttaagattga atgtttagtg tgggaaaaga tatataagat aatatttagg ggagttggta 480 

tagaaatgat tttttaggaa ggaagtttgt attttgttgg ttgagttatt ttttttgggt 540 

« a iH^izr "gttagtgt aatgagtaag ggagagaagg taggttgtag tgtagttttt 600 

agaagggtta gagtattttt tggttttagt tttttgtttt aagttttgtg tggagtgggt 660 

* aat B aat 9 ttattttagg ttaagagtag gggatatatt tgggtagttt 720 

tagagtattt taaattattt ggatattaat tggatagtgg atggtttata tttaatttaa 780 

SStSSS ??aIIS a ? aa SSSSSS * a?aat?tag aSEHS Ultlllitl 38 

?28?ff?i »*2H w i at tta ^9ggggi: tgaatatgaa tgattttttt 900 

SJSSSS HHP*"* a J t ^ a 9 a tgg atgttatatt tttgtttaat aataaaaaaa 960 

gattttttgt tttgtaaaat ttatattgat tatttttttt aataaaatraa aattaaaaa? in?n 

aaataaaaat ataagaaaga aataaaattt aagtttagaa ttttgttttt SSlaSS 1080 

aaatgggt-tg-gt-t-gtt-tt-tt-tgt^^^^^ g|g- 
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atagaaattt taggttgttt gtgtttgttt ttgatgttgg gggttaaaaa atgaggtttt 1200 

gttgttttaa taagtaaaga aaattttatt ttttttaagt tttatttgtt tttatttttt 1260 

tttagaaatg tttgttttat ttttttaaat tgagagaaaa aatgaaatgt ggataaaaat 1320 

gtattttagt agtagttttt tatatgatat ttttgggagg tttgtggggt tggatgattt 1380 

aagtttatgg ggatgagtag gagtgttttt gattttttta gagttttagt gttttaggat 1440 

ttattttttt ttgattttgt attgtttttt ttttggtttt agattttttt ttttattgtt 1500 

tatgaagttt aggtgggttg ttggttgggg agtggagggg gtgtgtgggg tgtaggtggt 1560 

gttaagggtg tgtgtatttg tgggtgtggg gtgtgagggt tttttttggt gtgagtgggt 1620 

gtagtttttt ggtggtgttg ttaggggttt tttttgggtg ttgagtgggg tgggttggat 1680 

tagttgattt gtttggtttt gagttttgtt gttgtgtttg ggttttgtta gtttttttgg 1740 

tagtggtagg tgagagtatg tggaggagtg tgtgtggggg ttttgggaga tggtggtggt 1800 

ggtggtgtgg gtagagtaag gatgtggtgg attttatttg tatagtagtg tatttggtgt 1860 

tttgtgtagg gttgtgatgt tgtttggttt ggtattgttt ttgttggttg tttggatggt 1920 

ttgggtgttg gaggtgggtg ttgtgttttg gaaggtgggg ggaggttgta tggtggggat 1980 

gtgatatttt ttaagatttt aatttaagtt tttaatgtag agaagttggg ggtttgttaa 2040 

tgggattttt tttttttttg tttttgtttg tggatgttta gtgtattttt gtttttggtt 2100 

tagttttgtt ttagggagtt gtgttttggt ttgttgagag ggagtgggtg aggtgttggt 2160 

ttttttggtt ttgtgttagt ttggggtgag aagggtaggg ggtgattttg agtttagatt 2220 

ttgatttagt ttttgttttg gaagtggggg ttgggggagg tgagagatat ttagataggg 2280 

gggaaggggg aaggagtatg tggggaaaat tgaaaatgta gtgtttttaa agttagtttt 2340 

ttgtttttga attgttgttt tttttttttt tttgggtttt ggggaggagg aggaggagtt 2400 

gggatagttt aggaagttag gagttttttg tttttttttg tgttttttgt tttgttttat 2460 

tttttttttt tatttttatt tttttttttt gttttaattt ttttttgtgg ttgttggtgg 2520 

tggtagtttg ggtgtgattt tagtttgagt tggtgttggg gttgtgtatt gttttttata 2580 

ggtaaatttt gtgtatgttt gtgttttttt tttgtgtaat ttgtgagaaa tgggaggggt 2640 

tggagattta tagttatttt gtgttttttt taagtttgga tgttttggga gtttgtttgt 2700 

attttaagtt ttgtttgaga ggtagaaggt ttgtgggaat aaaagttgtg atattttgtt 2760 

tttgttgtgg gttgagttgt gttagtttta gtgtgttatt gaggtttttg tttggttttt 2820 

tttttggggt tgtgtgtttg tttgggaggg ggtagagatg ttgttggggt ttttggtgga 2880 

gttttggggt ttttttgttt tttatttttg gattattttt gaggtttaat tttgtgttgg 2940 

gttgtaaagt ttttggtggg gaggaaataa ggtggagagt gggatgtgtg tttggaaagg 3000 

gtagtggtgg atggtgggga ggtatgggtt ttgttttgtg tatttaagga ggtgtttgtt 3060 

tgattaaggt gtggtttttt ttttggtagt tttggggatt ttggtttagt ttttttgggg 3120 

gtataggatg ttggggaggg tttgaagggt ttttttttta gggtgtagat aaaaggattg 3180 

aattgagtga agattaagat ggagaagatg gtgtttttgt agtgtagtaa agaaaagttg 3240 

tgtggaggtt gtagtttagt gaaatttatt tattattagg tgtataataa gtttttttta 3300 

tttatttaat ttttgaatat ttagttaatg tttgtgtata atgattgttt ttttttatat 3360 

ttgaagatgt ttagaagttg tttgtatgat agttgggtta gaggaggttg tattttattt 3420 

tggattttgg ttgggttttt gtttggtatt ttaatatttg taattttttt tatttgtttt 3480 

tttttttttt ttttttagtt ttttagtaat tttttaaaag aaaaaaagta atttttgggt 3540 

agatgttttt ttgtaatttt attgaggttg tatagattat tatattatta gttttttagt 3600 

ttttaatttg taaggttatt gaattttttt tttttttgat ttttatttat aagtggttta 3660 

ggaattattt tagggttagt taagataggt tgttttaaga gttaaagaat tttgtttttt 3720 

atgttttaaa aaaatgtatt taattaagag aattttgatt ttttttatga tttatttata 3780 

tttatttgta gaattatttt agtttttttg aatggtttta ttatagatta aaaattatat 3840 

atttaaggaa ttaaaagtat atgataattg tatttttgaa gtttttgtta aaatttaaat 3900 

atgtaatttt gttgagtaat attttttttt gtataaaaag aggaagtatg taggtagatt 3960 

tgaatttagt ttttggtttg tttaattgta ttaatatgtt ttttttgtta ttttttggga 4020 

tttttttagt agttattgtt aattttttta aagggtggag gtgagattta tattatgtta 4080 

ttttgtgaag tgttggtaga agggtaggta atttagttta ttgaggtttg agggtggttt 4140 

atttggtttt ttttgtgttt attgaaaaga tgtaattatg agttgattgt tttttataga 4200 

gtttagaaaa taggtttgtt gtaggtggtt gtagaaaaga aatgaataag ggtaatgata 4260 

atggggtttt ggttattggg ggaaatatgg ttattatttg ggttgatgat tttggttggt 4320 

atgttgatag aggtgtgagt ttaataaatt ttaatttata tgatgtttta ttattttaat 4380 

atttatttat ttatttattt ttaaatttag ttttttttta agttttattt ttttttgttg 4440 

agggtttaat ttagttataa ttattttatt tttgtggtag tgttaattta tagt 4494 

<210> 408 

<211> 4489 
<212> DNA 

<213> Artificial. Sequence 

■ • ■ ... ^ 

<220> 

<223> chemically treated genomic DNA CHomo sapiens) 

: '* ■ • . • - • . • 

<400> 408 '\ t ;'*. ' - , 

taaaaagatt taaaaagata at^tataaaa gWtttaagaa ttaaaaagat aattttagaa '60 
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agaggaaatt 
ggtagaataa 
aaaataattt 
atatattggt 
tattttataa 
agtattgttt 
aattgtggtt 
taatttaata 
agaaaaagat 
taaaatgaat 
aaagtagtaa 
aaagggatga 
tttttgattg 
tattgatgtt 
tgtagttaag 
ataattggtt 
taggagttta 
gggttaagta 
gaaaattagt 
tggtttatgt 
aagagattga 
aaattagtta 
gagaattgtt 
tgtttgggag 
taatagaatt 
aatatataaa 
tttatattga 
tattaatgta 
agaaatttat 
aagaatagta 
aaggggaaat 
ttatataatt 
taagatatgg 
ttagaatatt 
atgtgtgtga 
agttttttgt 
tatatttgga 
ttgtatttgt 
ttgtttggtt 
tatagttttt 
tttgtagtaa 
aagaggggaa 
tagattttat 
ttattttttt 



taaatagttg 
attaaaataa 
ttggtgagat 
gtatttattt 
aaatttatgg 
ataattatta 
tatttttatt 
taagaattag 
atataatgat 
ttaaattgtt 
atgattgtta 
ttaggaatag 
tggtagtgtt 
tgtatgttta 
gtatattaat 
atattataaa 
gttttgattt 
atgatgtata 
aattttaatt 
ttgtaatttt 
gattagtttg 
ggtgtggtgg 
tgaatttagg 
atagagtgag 
aagtattaat 
gatgatattt 
ttgtatgaaa 
aatttataaa 
tatttttagt 
attaaagaaa 
tttgattatg 
ttatattttt 
agggaagttt 
agggagggtt 
tgtatttggt 
tttgtagtta 
ttgttatgtt 
taaggatttt 
ttggtttttt 
tattttttta 



gtaagttttt 
taataagata 
agtaaaagta 
tggaaaaggt 
taatgtatat 
aaaatggaat 
aagatttttt 
agttaaatgt 
atgtaatata 
taggaatgtt 
taaaaattag 
aatggagagg 
tatatagagt 
aatattaaat 
tgttttgttg 
gatagataat 
tattggatat 
aatggggttt 
gatagtgtat 
agtattttgg 
attaatatgg 
tatatgtttg 
agatagaggt 
attttgtttt 
ataataaata 
ttttaaaata 
ataatgaaaa 
ttttttttta 



seq_prot . 
gaaaaagtgt 
aagttatatt 
aagggattta 
ataattagtg 
taagaatata 
ataatttaaa 
atattatatt 



txt 
ttaaattttt 
tattaagttg 
ttatatattg 
aagttgaaga 
tgtataagaa 
tgtttattaa 
tattatatta 



attaatagag ttatatttta 
taatttattt atatataatt 



gatataggtg 
tatagtgtta 
ggagtttutt 
ttatatttta 
gtatggaaaa 
gtattttttt 
tggttatatt 
gattattgta 
attgtttaaa 
atttaaagaa 
gaggttgagg 
tgaaattttg 
tagttttagt 
tgtagtgagt 
aaaataaaaa 



atgtttattt 
agtgtagtga 
ttgggttgtt 
tttttgtttt 
gtttttttta 
ttgatgatga 
ggataggata 
gttgagtttt 
aggagaagga 
tttaatttgt 
aaattaaaag 
gttgttgttg 
atattttatt 
ttttttttta 
tatttaaaag 
tggttttgat 
ggggtgagga 
gtgaggtggg 
gttaaaagtg 



tatatttttt 
gttttattgt 
ttatttattt 
gattttagtg 



gtttattttt 
ttaatgtttg 
ttgttgttgt 
tattttgttt 
ttgttttttt 
tttttttttt 
agtgttttga 
taattaatta 
tttgaatttt 
agagagaggt 
ttaattttat 
tttttttatt 
gttttttttt 
tttttttttt 
ttttttagta 
ttgttgttat 
atatttagga 
tttttttttt 
tggggggtag 
ttttattttg 

gggggtggtg 

gtgtttattt 
gttttttttt 
taattttttt 
agatgttatt 
atttgttttt 
gtagatttga 
ttggaagatt 



aaaaatagtt 
tttgataaaa 
tttttttttt 
ttttgagaaa 
tatttaagat 
gagaggtaag 
taaaatgtat 
tgagggttaa 
ttttatttat 
ttggtaagtt 
tggtttatat 
tttttgattt 
attattattt 
taggaattgt 
ggattgtata 
tttttttgtt 
attttttttt 
ttttttgttt 
tgttttttgg 
gttatttgtt 
tttttttttt 
agtggtagag 
tttattttaa 
ttttagtttt 
ttttgaaaat 
ttattttttt 
atatagattt 
tgttgttgtt 
gtagttattt 
ttattatttt 
gaaaggtaag 
gggtaatagt 
ttgggggggg 
tgttagggga 
tttatttaat 
ttttttttaa 
taaaaaaaat 
gagtgggaaa 
gtgttttttt 
tflattggttlL. 



gttaagaaat 
attattgaaa 
aggtgttata 
tagtttttta 
agatataaaa 
tttttattat 
ataaagaaaa 
tataatatag 
tttatagaaa 
tttatattgg 

tgggaggatt 
tttttattga 
tatttgggag 
tgagattgtg 
ataaataaaa 
tgaagaattt tagatttttg 
tttttataaa atatattgag 
agaattagta ttgttttatt 
aatattttga gttaatttta 
ggtatttggg aaattaaagg 
tagttttttt aaaattttta 
tttattaatt gtagaattta 
aagtaaaggt tttgtgttgt 
ttttttgttt gttttttttt 
gtatgaaggg tgtaatattt 
atagaggatt tgtttttgtt 
ttgttgaggt tttatagttt 
tattttgatg tagaggtaga 
tatggggtgg gagtggttat 
aagtttgttt tttttttaat 
attttgtgtt attgatgttt 
tttgtataaa agtttgtatt 
tttattgaat tgtttggttg 
taattatttt tggtatgttt 
taaatatgtg attttttttt 
aggaaggagg agggagttgg 
ttgtttttgt attttttgtt 
gaaaatagtt tatttttttt 
agagagggat atgtgtagtg 
taagtagaga gattttagta 

ggggggtatt ttttattttt 

ttttgaattt aggggggtgg 
gagttttgag agggggattg 
ttttatttat gatttttatt 
tttttttggt tgtgagagga 
tttgttgttg ttgttgttgt 
gttattgttg ttgtttggtt 
ttttagtttt tttttttttt 
ttgtgttatt ttttatagtt 
tgtgtttgtg ggggttttat 
agtattaaat tatatatgta 
tgaaggaggg gttggagtag 
ggaagggtat ttttattttt 
atttgattgt gttttgtttt 
tgaggtgtta gggaaagaga 
ttattggaga gttttttttt 
atgttgggat ttggattgta 
ttttttggtt ttgtatgaga 

ttgttLttaag-^gttttattt- 



tagtgttttg 
ggaaaaagga 
ttgttggggt 
taaatatatg 
tatttataat 
tattagatta 
tagattatta 
gttgtatgag 
aagaaatagg 
attaagaaag 
agaattgagg 
gatgttttag 
ttatattatg 
tatttaaaat 
aaagtggtag 
ttttaaagta 
aagatattaa 
gtaaaatgtg 
ttgggtgtgg 
atttgaggtt 
aaatataaaa 
ggtgaggtag 
ttattgtatt 
agaatttatt 
gtttttaaaa 
attgtgatgt 
ataaaagttt 
attttatgat 
tttaaaaatt 
tttatattat 
atattaagga 
agtaataatg 
taaagttatt 
aatatgagta 
tttagataga 
tttttttttt 
tttaggattt 
aagatggagt 
ttttaatttt 
ttggtttttg 
tattaggtta 
tgttttttgt 
tgtgtatgta 
gtttttgttt 
gggtgttaaa 
ataatgtttt 
tttttttttt 
agtgtttttt 
gtagtagtag 
atttttaaag 
ggggaaggtg 
agtttgagag 
tttttatttt 
ggagagaaag 
tgttgttgtt 
ttggttggag 
ttttgttttt 
ttagggaagg 
tgtttttttt 
gtggagtttt 
ggaggggtgt 
atttgttgtt 
agtgggggag 
tagaaagagg 
tttaggaaaa 
ttggaattgt 
tttggatatt 
tttttttttt- - 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140— 
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tgttttgttt gatttttatg gaaaatttta aatttttagt tggtagaggt rtttggttag 4200 

gttttgattt gtattattgt tttttttttg gttaaatgtt tgttttatat attagtatat 4260 

aaagtttaaa gttttaaaat gtttaaagtt taaatgttat tgttattgtt tgtttttata 4320 

tatgtttttg atatttattt tttatatgat gagaatatta gttattttat aattaatttg 4380 

ggtatatttg taatttttgt aatttgtttg aataagtaat ttttattgaa aattttatgt 4440 

tgttttggtt gtatggaagg ttaaattttt ttattaatag tttgttgga 4489 

<210> 409 
<211> 4489 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 409 

tttagtaaat tattgatgga gaggtttggt tttttatgta gttagaataa tataaagttt 60 

ttaatgggga ttatttgttt agataagtta taaagattat aaatatattt agattaatta 120 

taaaatagtt aatattttta ttatatggaa aataagtatt aggaatgtat gtgggaatag 180 

ataatagtag tggtatttaa attttaagta ttttagagtt ttggatttta tatattgata 240 

tatgaaatag atatttaatt aaaaaaaaag taataatgta agttaaaatt taattaaagg 300 

tttttattaa ttaaaaattt gaaatttttt atggaagtta gataaaatga aaaaatggaa 360 

aatgaaattt ttaaagtaaa aattaattaa gttttttaat attgaagtta atatttaaat 420 

tttatatagg attaaaaagg ggaaaatatt taggtttata aatggataag tagttttaat 480 

ataatttaga ttttgatatt tttttattta gaaataaatg tagtggagtt ttttttggag 540 

aaaaagagtt ttttaatgaa ttttttttaa gtgatattta gaaagtgtgt ttttttttgt 600 

tttttttttt ggtgttttat tggaggaagg agggggttgt ttttgtattt ttttttattg 660 

gagtaagata taattagatg ttagatggag gagagaaatt atttttgata atagtaaata 720 

aaaataaaag tgtttttttt tttttgatag gatgggtatt ttattttata tatttttttt 780 

tattttaatt ttttttttat tttttttagt attatttttt ttttattttg gagttttatt 840 

gtgtatgtgg tttggtgtta ttgttgtttt ggagtgaaag ttagagttag aggaaggtaa 900 

tgaagttttt atagatatat ttattttttt tgttttttat ttttgagtgt tttttttgga 960 

gttgtggagg atgatataaa gggtaataaa gggggggagt ggaggaggag gaggtgaagg 1020 

aggaggagga gagttgggga agtggttgtt tttgggtgta gtgagatgtt tttagttagg 1080 

gttaagtagt agtagtagta gtagtagtag tagtagtagt agtagtaata gtagtagtag 1140 

tagtaatagt agtagtaaag ggtttgtgtt gttaagaggt ttttggtttt tttttttttt 1200 

ttttttatgg ttaaagagga ggagggtgga gggagggagg tgagttggag gggtgagggg 1260 

gtaggagttg tggatggggg tttttgaaga agaagaaatt tttttttttt ttttaggttt 1320 

aatttttttt ttagggttta gggttggagg gtgggggaag gaatttagtt gttttttttt 1380 

tatttttttg aatttaaggt tgaagtgggg tagaattagt gttttgtttt tttaaaaata 1440 

gaaataaaag gtgttttttt tttgttattg tttttttttt tattattagt tgttgttgtt 1500 

gttggggttt ttttgtttgg gggggagggg gggtttgggt agaagagatg ggaggtattt 1560 

attgtgtatg tttttttttg atgggtggtt ggttggttga aggtaggagg agggggaggg 1620 

gaggaaatga gttatttttt tagaaggtgt tgaggtgtta gtgatttgga agatattgta 1680 

gtgggaggta taggagtggg ggtggggagg aggaggaagt tgttatgttt ttaatatttt 1740 

tagttttttt ttttttttta agggaaagtg gaggaatgga agtgggtgtg gatggagatg 1800 

aaaggaggtt atgtgtttgg gtgggagagg ggtggggtgg gagaggtagt gtgtgtgtgg 1860 

ggtatgttgg gagtggttgt gtatggtttg tagtggtagg tgaagtttag tagaggatgt 1920 

ggttaggtga tttggtgaag tgatttttgt aggtgttggt tttttttttt gatttggtaa 1980 

atgtaggttt ttatgtgaga ggtaatggtg ggggtaaatt tgttgtaaat gaaggttagg 2040 

ggtgttggtg gtgtaaggtg aattgaaagt gggaggatgg aaggttgtgg agattgggaa 2100 

ttgggaaggg ggtaggtttg tataggttgg gaaggttagg attaggtgag ttttgttttg 2160 

tggttatttt tgttttgtga gtttgttgag gaatttttga taagtgtagg gattttgagt 2220 

ttatttttgt attggggtag taggtgagga gtgtgatggt ttgagtgtaa gagagaaggg 2280 

aa;ttgtgaag ttttagtaat tgatttttat gattgtagga tggaggattt ttatttaggg 2340 

atagagataa gttttttgta tgtattttga ttagatgtat tatatgtgtt gtttatattg 2400 

gatattgtgt tttttatgtt ttatttttta attttttttg gtattttgga gtggttttgg 2460 

ggaaggagta ggtagggaag ttttgagtgg agtttttttt tatgttttgt gttgttgtta 2520 

taatataaag tttttgtttt tttttggaga gggatgtggg attgtgtagt ttttaatgtt 2580 

gagttttatg attgatgaag gagaagggat gtgattaaag ttttttttta tagtgtagat 2640 

gagagtttta aaaggattgt tttgttaagt ttttttggtt attattttta gtttttgagt 2700 

ttttggtttt ttaaatgtta gttgtttttg ttgaaaataa tgaattttta ttataaaatt 2760 

agaattaatt taaaatattt aagaaaggat ttataaattt atattagtaa agtttttata 2820 

gtgaaatagt gttggttttt ttttattgtt tttatataat taatataaag tattatagtt 2880 

ttaatatgtt ttgtaaagat attttgaagg aatattattt ttgtatgttt tttaaaaatt 2940 

aagaatttaa aattttttgt atttattatg ttggtattta attttgttgg gtgggttttt 3000 

tttgtttgtt ttttgttttg agatggagtt ttgttttgtt ttttagatga gtgtagtggt 3060 
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gtgattttgg 

ttttgagtag 

tagtagggat 

atttttttgt 

tggtgtaggt 

ttttgtaagt 

tgtattgtgt 

tttttttgta 

taatgaaaaa 

tttgtgtttt 

gaaaaattat 

gtggtatttg 

tttagtggtt 

ttttttaatt 

attgtatgta 

gggatataat 

agtataataa 

taataaatat 

ttttgtatat 

ttttaatttt 

agtatatggt 

aatttgatga 

agagtttgag 

tttttttaat 



tttattgtaa 

ttgaaattat 

ggggttttat 

tttggttttt 

tttttaggtg 

ttgggtagtg 

atggtaatta 

gtgtaattaa 

aaaaaatatt 

tttttatata 

aaagtgtgaa 

agaagttttt 

aatattatat 

atttatgtta 

agtggattat 

tttattagta 

gtatagtata 

ttggattgtg 

gtatttttaa 

attagttatg 

agattttttt 

atgtggtttt 

tattttttta 

ttttaaattt 



tttttgtttt 

aggtatgtgt 

tatgttggtt 

taaagtgttg 

tatattatta 

gattttgttt 

tgtttgatgg 

ttgtttgttt 

agtaaggtag 

tttagtattt 

ttttatgtaa 

ttttttattt 

tgatttttat 

gtatttttaa 

atattatata 

tatttagttt 

aaaagtttta 

ttttgttttt 

tgtatattat 

ttttttttaa 

atttttattg 

attttattgt 

aaaatttatt 

tttgtgtatt 



seq_prot . 
ttgggtttaa 
tattatgttt 
aggttggttt 
ggattatagg 
gttaaagtta 
atgtattatt 
aattaaaatt 
ttatagtgta 
ttaatgtgtt 
aagtatataa 
gtgttgttat 
tatttttgat 
ggtaattatt 
gtaatttagg 
ttattatgta 
tagtttttat 
atggaaatga 
agtagttatg 
tatgggtttt 
agtagatata 
ttttattaag 
tgttttaatt 
ggttatttga 
gtttttttaa 



txt 
gtagtttttt 
ggttaatttt 
tgattttttg 
tgtgagttat 
ttaatttttt 
gtttagtttt 
agatttttat 
attaattgtt 
ttagttataa 
atattgatat 
agttaagaat 
tatttttttt 
tattgttttt 
tttgttttgt 
tttttttttt 
attggattgt 
gttatagttt 
aataatgtta 
tgtagagtat 
ttaatgtata 
agttattttt 
tattttgttt 
attttttttt 
atttttttg 



tgttttattt 

tttgtgtttt 

attttaaatg 

tgtgtttggt 

atgttttgtt 

tagtattttt 

gttttaaaat 

tgttattttg 

ttttaaatat 

atagtataat 

tagaatgttt 

tttaattttt 

ttttttagta 

ttgtttttta 

ttatgtagtt 

agtaatttat 

agtttaatat 

ttatgaatgt 

gtatgtttat 

ttttgatagt 

tttttttttt 

aggatattaa 

tttagaattg 



<210> 410 
<211> 4395 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 410 

ttttttttgt 
tttgtttttt 
tgtattaaaa 
ttttagttaa 
tagtaggagg 
agtaaatata 
tataattgaa 
aaaaatataa 
aattgttaga 
ttttgggatt 
ggtaaatata 
agtgggatta 
tttttgaatg 
gaatagtttt 
tttagataaa 
ttgtgggtat 
taattttgtt 
tttaatatta 
gtattatatt 
atttgtaatg 
tttttataag 
ttttttattt 
tttttattta 
ttattatagg 
atattaatat 
tattttaaat 
ttaatttaat 
gtattaggat 
ttatgtattg 
tttttatttt 
tttgatgatt 
attatatatt 
tttgtttttt 



atgttttgtt 
tgttaaaaaa 
atagaggata 
attaattaat 
aaaataggaa 
tttgtttttt 
atagaaaaga 
tataatttta 
agttgattta 
tttttaagga 
ttatttattt 
taggagttat 
aagagagggg 
atgattgttt 
ttttatagat 
taagtattat 
ttttagtatt 
aattaatata 
gggttaggtt 
ttattataat 
ataaaatatt 
ttttttattt 
aaatgaattt 
tttatatatt 
ttaaaatatt 
taaggtattg 
ttgttatgat 
gtgtatttgt 
tgtttaattt 
tgtttagtaa 
tgtgatatta 
tataaagtga 
tttttagtgt 



taagttttat 
aaaaaaaagt 
tttttataat 
tatatatatt 
aatataaaat 
ttttggttgt 
aaaattagag 
agataaaata 
gaatttgtaa 
attaatgtaa 
taaaggtaat 
attgtattta 
ttagaaggta 
ttgatatatt 
tgtggaggag 
tattggaatg 
ttttaagttg 
tagaaattga 
tttagtgtta 
ttaaatttgt 
agtaaaatag 
attttgttat 
9999gagaga 
tatatttaaa 
gttaattgtg 
gttgttgttg 
tttgtttata 
ttattttgtt 
agtagtaaat 
tgatatattt 
tttttttttt 
tattagtggt 
tggttttagg 



attgtattga tttttgtgtt tttgaggtgg 
tttaaaaaga gttattattt aattttttat 
atttattagt aaattaattt ttaaatgttg 
ataataggtt tagatatttt ggaatattaa 
aatttgaggt tttttattag tattttttta 
tttagataaa tggaggatga ttaaatttgt 
ataaaatata ttgaaatgat taaattttat 
atattaaagt agattaattt ttttattgtt 
aaattaattt gaaatgtata aattaggaaa 
gttaagaatg gatgagattt aaatttttaa 
atgttaattt atttatgtta tatatagtat 
aattattttt agggaatggt tttgttagta 
aaaaggagga aaaggagagg taaataatga 
gtagtgtggt tgtattttag aaaataaata 
gagaataagt aagaaataga ttaatgtgga 
ttgtttataa aaatgttttg aatttttaag 
tgaaattaat tttttttaat tttaatttga 
ttattaattt gagaaatgtg atttttgaaa 
aagagattgt gatttgtaat ttaatagtaa 
tttatttttg tatttttgat ttaattaata 
taagtaagtt agtttttata atttgatttt 
ttaatgatat aaaaggttta tgtttttata 
ataatttata aatatttttt gaattaaagt 
tattatttta agaaatgtat ttaggttttg 
tgttatttta tgttttagtt agtttatttg 
ttattgtgtg tttttatagg aaaaaaaaaa 
tttagaatat ataggtatgt atatttttta 
ttttttgttt tgatgaaatt atattataga 
tttttttaat tgtgtggttt gtgattaata 
ttattatata tttttttata ttgtttttta 
attttttttt atttttatat tgtttgtgtt 
ttttagggtg gaaaggggtg gaagttgatt 
_t gtg_tttLt_lLlL_g _ 
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3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4489 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
-1-980 



seq_prot.txt 

taggggtttg ttgagttttg ggagttttta gaagaggaag atttttttgg ttttattagg 2040 

tattatttgt gtttttttgt tttttatttg tgtttttgtt tgggttaatt tttgttgtat 2100 

gtgtttattt ttgaattgta tgttattttt tatttttggg gggtttttgt gtattgaaag 2160 

attgtttttt ggtaggtttt gggatttggt gatggttgat tgtgtgttgt ttttatgttt 2220 

ggttttatga tgttgtaata tagaaagttt atgttggttt tgatttgtgt gggattgtag 2280 

ggtttgttgg agtgtggtgt agaggttttt tttgtgtgtt tggttttggg aaaggggtgg 2340 

gagggttggt . tttgggagtg tatgggtgtg gtggggaggg tatttattgt gaagtatgtt 2400 

gtgtttatgg attatgtttg tgtgttatat tagaggtttt gggttttatt aattttattt 2460 

agagatggga agatttttag tggtgggggg aggatagggt tgagaggtgt taaagatgta 2520 

aagtaagaag gaaataaagg ggggttgaga gggagattga gaggaagggg gagttttgag 2580 

tttatgttgt agttagattt ggatgagttt gttttttgtt ttgggtgggt ttttgttttt 2640 

gttggttttt agtgttgtgt agttagtagt atttttattg tgatgtttgt attatatttg 2700 

ggtgttggtt gttattattt gtgttgttgt tgttaggatt ttttttttgg gtattgttgt 2760 

tgttgtgggg ttgggaggat gtggtgtgtg ggaggtggtg gttgtagggt gagttttggg 2820 

atgttttgag ttggggttgg ggttggggag agggtgtagt gaggtggggg ttagtttaga 2880 

ttgatggtag tgatggagtg ggtggtggtg gtggtgttgg tggtggtggg gtggtttagt 2940 

ttttagtttt agatgtgttg tgtagtaggt tggagtagtt tttttgggag gatgtttagt 3000 

ggtagtgttt tttgttttag tttttgggga ttttttgttg aggtattgaa ggtaggaaga 3060 

aggggtttgt tattggtttg ttgggttgtg tgttattttt gttattttgt ggaaagagga 3120 

gtgggtgggt gggtgtttgg gaggtgggtt ggagggtggt gtaggggagt ggggtggttg 3180 

999999999* tggggggtgg ggaagggagg gaggagaaag gagttggaag agggtagagt 3240 

tattaaatgg gttttttagt tatggtttgg ggttttatga tttttttgga agtttggagt 3300 

ttgggtggga tagtgaggtt gtgtgtggtt ggtgttttgg ggttggtgtg tggtagaatg 3360 

gggttgtggt ggtggtagta aggatatttt agttgtgtgg atttggggga ggggtgggga 3420 

gggggtgagg atttggttgg gatttgtggt ttggtttgtt agggtgtaga gagaggatgt 3480 

agttgtaaat tttgagttgg atttttgtgt tggatggaag gtgtggaagt gggaggggtt 3540 

tttgtgtgaa aattttttgt ggggtttggt gttttatttt ttaaaggtta gattttgtgg 3600 

gttttttgtt ttttattttt ttttttttat ttgtgtaaag gaattgggtg tttttttttt 3660 

tttttttttt ggggtgtagg ttttgttgtg gattttgtgt ttagtttggg agatatggta 3720 

ggggtgtgtt ttagggaaag gtggttgtaa aagttttgtg gttgagtatt gggtttgatg 3780 

tttagttttt ttattaaatt atttttgtaa agatgtgggt tttttgtaat tgagtttttt 3840 

attttgaggt atttaaaatt attttaaggt atatatggat ttttgttttt ttgtgttatt 3900 

tttttttata ggtttgtgtg gtgtgttaaa gtttgggaga tatgagttgt ggggaaatag 3960 

tattggaaga gtttgggttt gtaaaatgtg aattaatgaa tatgaaataa gggtaattgt 4020 

gaggtagtgt tgggaagggt tggagtgtgt gggggtgtag ggagtttttt tttgggtgtt 4080 

gtaatggtat tttttttttt ttttggtttt gttttgtttt tttttggttt ttgtggatgt 4140 

gatttatgta gattatagtt gagtttgttg gattttggga ttatggaagt tattttgttt 4200 

ggttatttat atttgtgttg tggttggggt ttgggttggg gattgtggtg gtgaagaagt 4260 

tggggtagga agagttaatg gtaatggtgg ggttgttggg gggtgggggg ttagtagtag 4320 

atgttgagtt gtgaagatta gggtggtttt gaaaggttga ggaaaggaga aagggttatg 4380 

agaagagttt ggtga 4395 

<210> 411 
<211> 4395 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 411 

ttgttagatt ttttttatgg tttttttttt tttttttggt tttttggggt tattttagtt 60 

tttatagttt agtgtttgtt gttgattttt tgttttttgg tgattttgtt attgttatta 120 

gtttttttta ttttggtttt tttgttgtta tagtttttaa tttaagtttt agttgtggtg 180 

tgggtgtggg tggttggatg ggatggtttt tgtagttttg gggtttggtg ggtttggttg 240 

tggtttgtgt gggttgtgtt tgtagaggtt gagggagggt gggatggagt tgggaaaggg 300 

aggaggtgtt gttgtggtgt ttagggaggg attttttgta tttttgtgtg ttttagtttt 360 

ttttggtgtt gttttgtggt tgtttttatt ttgtatttat tgatttgtat tttataagtt 420 

tgggtttttt tggtgttgtt tttttgtaat ttgtgttttt tagattttaa tgtgttgtgt 480 

gagtttgtag gaggaagtgg tgtgggggaa tgggggtttg tgtgtgtttt ggggtggttt 540 

taaatatttt gaggtggggg gtttaattgt aagaagtttg tgtttttgta ggagtaattt 600 

gg^gggggga ttggatatta ggtttagtgt ttaattgtga aatttttatg gttgtttttt 660 

tttggggtgt gtttttgttg tgttttttaa gttgagtgtg gagtttgtgg tgaaatttgt 720 

gttttaggga gggagggaga gggggtgttt agttttttta tgtggatgga aaagaagggg 780 

tgggaggtag agagtttgta aggtttaatt tttaaaaagt gaaatattaa attttataag 840 

ggatttttat atgaaggttt tttttgtttt tatgtttttt gtttggtatg aggatttggt 900 

ttgggatttg tggttgtatt ttttttttgt gttttggtgg gttgagttgt ggattttagt 960 
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tgggttttta 
ttgttgttgt 
tgttatttta 
ggagtttatt 
ttttggtttt 
tgtttattta 
tgatgatgga 
gtgaggagtg 
tgtttgagat 
tgttgttgtt 
tttggtttgg 
tttgattttg 
gtggtggttg 
gtgttgaggg 
tggttgtagt 
attttttttt 
agtttttttg 
atgatttatg 
ttggagttag 
gtgttttggt 
tagtattgtg 
ttgttggaga 
tttagggatg 
agagtgtgga 
tttggtgggt 
ggagggaagg 
ttatgggtgt 
ttatgagtta 
aagtaaagat 
aaatatagta 
tatatatttt 
gataaattaa 
ttttaattta 
tttaaatatt 
ataagtttgt 
tattttaagt 
agaaaaaata 
tttgttttgt 
aatggtattg 
gatttaatat 
tgatttggta 
tgagggatgg 
gtttaatgtt 
taggatttgt 
gttatggagt 
ttttttattt 
ttttgtagtt 
aatgatgtat 
aagggagttt 
tgatttttaa 
ttatgttatg 
tttgttttag 
ataggtgtgt 
attttttttt 
ttggtttggt 
tttattttta 
taatagaaaa 
aatatgtaga 



tttttttttt 
ggttttattt 
tttaggtttt 
tggtaatttt 
ttttttggtt 
tttgtttttt 
tttttttttt 
ttgttgttgg 

tggggattga 

gttggtttgg 
ggtgttttgg 
tggtggtgat 
gtgtttgggt 
ttagtgagag 
gtgggtttgg 
tgttttgtgt 
tttttaaatg 
ggtgtagtgt 
tttttttgtt 
ggattttgta 
gaattgggtg 
atggtttttt 
gatatgtgtg 
tgatatttag 
ttttgtttgt 
gtgagggtta 
gtggtaatgt 
ttgagtaaaa 
gagagtgttg 
tgtggtttat 
gatgttgaag 
attgattttt 
aaatgtaaat 
aatattaaga 
ggtgagtttt 
gaaaatgtga 
gaagaaaaat 
aaaaatgttg 
tagatttatt 
aatgtttttg 
ttaaattaag 
ggttatttgg 
tataatttat 
ttaagtattt 
tgtttttatt 
agagatgttg 
ttattgtatt 
ttgttttagg 
taaggttttt 
tagttaataa 
tttttatggg 
ttgtagtaag 
ttatttagaa 
tattgttaat 
taagataata 
gtataataga 
gtaaattatt 

gggag 



gttttttttt 
tgttgtgtat 
gggtttttag 
gttttttttt 
gttttgtttt 
tttttgtaag 
ttgtttttaa 
atattttttt 
gttattttgt 
attggttttt 
ggtttgtttt 
gatgtttggg 
gtgatgtgag 
tgagagtttg 
agtttttttt 
ttttaatatt 
gaattagtgg 
gttttatagt 
ttttttttgg 
gttttgtgtg 
tgggggtggt 
agtgtgtagg 
gtagagattg 
tggggttaga 
agtataggag 
gtttttattt 
gggtagtgta 
ggtagtataa 
attgtgggtt 
aatgtgattt 
gatgtgtata 
ttttttttgt 
gaattggtta 
tttagatatg 
aatttaggaa 
aaatatagat 
taaattatga 
attagattaa 
gttaagttat 
gaaattatgt 
ttaaggttag 
aggtttagaa 
attaatttgt 
attttttaaa 
gtttgttttt 
gtagagttat 
gtgtgtgata 
agtttgggtt 
tgatttgtat 
tgaaaaaatt 
atttgattat 
tttaattatt 
aaatgttagt 
attttagaat 
tttaaaaatt 
aaattggatg 
ttagaaatat 



seq_prot . 
tagatttgtg 
tagttttggg 
gagggttgtg 
ggtttttttt 
tttgtattgt 
atagtagagg 
tgttttagtg 

ggggaggttg 

tgttgttggt 
attttgttgt 
gtgattgttg 
aggagggttt 
tgttatggtg 
tttggggtgg 
tttttttggt 
ttttgatttt 
agtttggagt 
gagtattttt 
ggttgaatgt 
ggttggggtt 
gtgtggttag 
aattttttgg 
gtttaagtga 
ggagtttttt 
gtagtagaag 
ttttttgttt 
gaggtgggaa 
gaaaatgtgt 
atgtagttga 
tattaaaata 
tttgtgtgtt 
ggaaatatat 
gagtataaag 
ttttttagag 
atatttgtga 
tttttatgtt 
agattagttt 
gaatatggaa 
agattatagt 
tttttaaatt 



txt 
tggttgggat 
gtgttggttg 
gagttttaag 
tttttttttt 
tttttagttt 
tggtgtgtag 
gaagattttt 
ttttgatttg 
gttgttgttg 
gttttttttt 
ttttttgtgt 
tgatggtggt 
gggatgttgt 
aggatggatt 
tttttttttg 
gttttttttt 
ttttggtgta 
tttgttgtgt 
gtaggaaaag 
gatgtaaatt 
ttgttgttgg 
gggtggagga 
gggtgtaggt 
ttttttaggg 
gtatatttga 
tggagattgt 
ggaatgagag 
ggtggaaatg 
gaggggtttg 
ggaagagtgg 
ttgggtgtga 
agtaataata 
taatatataa 
tgatgtttaa 
attgtttttt 
gttggatgat 
atttattatt 



aagagattaa 
tatttttgta 
tttttgttta 
atatagttat 
tttttttttt 
tttttaaggg 
tgagtaaatt 
ttatttattt 
attttaaatt 
aatttgtttt 
tttagtgtat 
ttttatttgt 
ggaagatttt 
atttgaattt 
aatttattaa 
atggtttttt 
agaagttagt 



gtaaaataag 
ttttttgata 
aataattaat 
ttttatagtt 
ggtaatattt 
tttttttttt 
attgtagtat 
ttttttattt 
tagtttagat 
gatatgttgt 
ttaatttgta 
ggtttttgta 
gatattattt 
tttattttta 
ttggggtaat 
gaattatttt 
gttataatat 
tggatattat 
ttaaaatttt 
gtagtgtgga 



gtttttgttg 
tgtgtagttt 
ttatgattaa 
tttttttgtt 
gttttttaga 
tttggtgagt 
aagggttgga 
ttgtgtggtg 
ttgtttgttt 
tggttttggt 
gttgtgtttt 
ggtgtggatg 
tggttgtgtg 
tatttggatt 
gttttttttt 
tgttattgga 
atgtatagat 
ttgtgtgttt 
tttttgtgtt 
ttttgtattg 
attttaaaat 
tagtgtgtag 
ggaaagtggg 
gtttttggag 
agttagtgtt 
tgatgttgtt 
gaaaatagta 
tattattatt 
ttgttgagtt 
agtaaataaa 
gtagaattat 
gtagttagtg 
ttgataatat 
gtataaatgt 
tttttgaatt 
aaaatgaata 
ttgttagtgt 
tttagattat 
ttggaaattt 
ttttatatat 
tagaaagtgt 
taataatggt 
ttatagtttg 
gttaggaata 
tttggttttt 
atagtgtgat 
ttttagagtg 
ttagtttttt 
agttttgaat 
tattttgaaa 
attttttttt 
taagaaagga 
gtattttttt 
gtataattga 
aagaatgttt 
tttttttttt 
atttggataa 



<210> 412 
<211> 10490 
<212> DNA 

<213> Artificial Sequence 

<220> ",!"..:"' 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 412 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4395 
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agaaggtgtt tgtttttttt ttgttttttg ttatgattgt aagttttttg aattgggagt 60 

tgattaaatt tttttttttt ataaattatt taggtttaag tattttttta tagtagtgtg 120 

aaaataaatt aatatttttt ttttgaggtg tttttttttt aggtaatttg ttgtttttat 180 

gttttttttt tgttttttgt tttttttttt ttttttatga ggtttaagtg ataaatgggg 240 

ttagttttag ttttagtttt agttttagtt ttattttatt gtaggtttgt gtggttgttg 300 

gagaggttgt gttttttttt tttttttgag tttgtttgat atgttttttg gattttggag 360 

gaaattgatt ttttattttt atattggtgt aatatttttt aagattttaa agttgtatta 420 

tttgagttag tttttttttt tatttttatt tgttagggtt gttattggga tagttttaga 480 

gggtggtgtt aatggatgaa tggatggatg gatagtagtt tagggatgat gtttttgttt 540 

gttttgaatt gggttttttt tttaatgaga agtttttttg agtgagtata tatagttatt 600 

ttttggtatt tatggaggat tagttttagg gtttttggga atgttaaaat ttatggatgt 660 

ttaagttttt gatataatgt ggtttagtat ttatgtataa gttatgtata tttttttgta 720 

tatgttagat tgttattaga ttatttatga tgtgtaatat aatgtagatg ttatataaat 780 

ggttgtgata ttgtattttt tagggaatga tgataagaat aaagtttgta tatgtttaat 840 

agaaatataa ttgtttaatt tattttttga atatttttta tttgttgttg ttgaatttat 900 

agatgtagag tttttggata tgagagttaa gtgtgttttg agagtagggt gggtgaggtt 960 

gttaatgagt ataggggagt aggtgttgat taggaggatt ttgtattggg gtatttggat 1020 

gttttgtttt aggatttgag attttagttg gatggtatag gtagatttag tttaggttaa 1080 

agttgttttt ttgaagtttt tttttatttt aagttttttt tggattttgg aatttggtat 1140 

tttttaggtt ttgtgtggaa ggatagatga attaggtttt agataatatg tttagaagag 1200 

tgagttttta ttgtgtgttt ggtatttttt ttataggatt ttttagttag aatatttaag 1260 

ggttatggag agaaatattt agttaaaata ttagaaaaga aaaagtgata ttattagaga 1320 

tattaaaaag attattaggt aatagtatta gtttttgtat tttgagattt aatagtagta 1380 

gttatttttt tttattgtta tgtgtatttt aggattattt tgggtgggga gggttgtggt 1440 

tagggagtag ttatggatgt tttgatgttg gttttgggtt ttgggggtga tagtgatgag 1500 

gaattgggtg tatatatgag tggggtagtt gggtttggtt agagaagtag tatatatgtg 1560 

tatagatgtg tttatttata tatatatgtg tatgtatgtg tataaatata ttgtaggtag 1620 

gtatgttgat gttttaggta gtggaggatt ttgattttgg gtgttgttga tttgggtaag 1680 

gttttattgt gatttgtgtt atgattttag aatgttattg gtgtttagta tttgtttgtt 1740 

ttttggtttg ttttagtggt ttatagtagt tatatatagg tagtggtatt tgtgagtagt 1800 

tttgtggatt taaaggtttt ttttttgaga ggtatgattt aggttagttg atttattaga 1860 

attaggtgag tgtgatttgt tttttttttt ttaggtggat ttggggatag tggttatggt 1920 

gtgggtggtg ttggtttttg tggggtagtt attgaggagg gttatttttg agtatttatt 1980 

aggtgtttgt tttatattgt ttgtgtagat gattggtttt tttgttttta tggtggtttt 2040 

gtagagtggg tgttgttttt aaatgttttt atttgataga tgagatgttt ggggttagag 2100 

aggtagtaat tggtttggga atttggatat gattttgagt tttgttttta gttttgttgt 2160 

gtgttgtgtt ggaatttagg tttgaatttt gtgatttttt tgttttagat tttaaatttg 2220 

tttaggtttt ttattttgat ggggtagagt ttggttttgg tagagttatt ggtatagagt 2280 

tattggtata gatttattga tggtttttag aatatttttg tgttttaagt tgggttttga 2340 

tggttgttgt gggttttatt gaatatatat ggttttttgt ttaggggagt ttgttgtttt 2400 

tgggtagttg tggaaaatga aggagttttg gagggttggt tgaggggaga ttattttttt 2460 

ttgtgtttaa aggggtttgg gtattagggt tttttttagg tattttttgt tttgtgtggt 2520 

ttttttgagg ttttgttttt tttttgtttt gagtattttt aggagggatg gtttatttag 2580 

ttgtttttta ggattaagga tttattgttt ttttttagtg atttaggaaa atgaagtttt 2640 

tttttgttgg gatggtttag aatggtggat tttatagttt ttttgtgaga gatgtggttt 2700 

ttatgtgtat aatagatttt ttttattttt taaatttaat atttttttgt ttaataggtg 2760 

ttatttttaa agtggtttta ttgtttagat tgaagagtta tggtagttaa agtgatgagt 2820 

ggagtagaat tgagtagttg ggagagattt tgttttttgt aggaaattgg gtattgttga 2880 

ggttttgagt attttaggag gttgattgta tagagatttt tggttgttga ttttagtttg 2940 

tttttatatt tttggaatag tttattatgg gttttttatt tttggtaggt ggaaattatt 3000 

taatttgttg gggttggtgt gtttttattt tatggtattg ggggataata ggattttttg 3060 

tttaggtttt attgtattta agtttttggg aagatgttta tttttgtttg ggatttgaga 3120 

ttttagagat tggagtagtt gtgggttatt gggtttggtt tttttttttt tgggggtggt 3180 

ggtggaatgg gggttatgta gttagttagt atttgggagt ttggtgagag tggtttaggt 3240 

gttttttgaa gttgttgtgt atagtgtgat ttttagataa ttttgtttta taggatggat 3300 

gtggtagagg ttgtgggtag ttggtgggta taagagtgag aggatattat tatgaaatat 3360 

gaaaaggtat aagttggttt gttttttgga gggaggtttt ttttagtgtg ttttggttaa 3420 

agggttttgg gttttttagg ajgtatagggt agggatgggt ggttaatgtt tttaggtttt 3480 

tgtatttttt attttggatt ttttattaag gttttttttg ggttataggg atattgagtt 3540 

gggttgttag aggataaggg gtttaagttt ttttgaagtt ataataataa tgttgattat 3600 

ttggggattg tatagtgagt tttttgtatt ttttttattt ttaaaigtatt tgttttagtt 3660 

tagggatggg tttgttttta gaaaggtttt tttgatgtag gatatgtttt attaggttgg 3720 

gttaattttt tttttaggga tatgaattttt ttttgatttt tttgtag|gtt tagtttgagg 3780 

ttgttaggtt agaggtgtgg ggtttattta gggagttggt gggaatggag attgggttag 3840 

gttaggtttt tgggtgtttS gtagttttgt tggtaagtga gtataagagg agtggggtag 3900 

tttgagggtt tggttttgtt tatttggaga tasittttggt gagatgtaag ggttatggtt 3960 

ataigggtgag gggatgtttg gtttagtttt agggttgttfg tttagtdggt trttgagcigt 4020 

' 1,1 ' 1 Sei te ^37 1 " ' l " 4 ' 



seq_prot.txt 

ttatttgttt ttgttttttt ttattttttt agagttatag tttttattgt tttgtgaggg 4080 

gaaaaggtat ggtgataatg ggggttgtag ttttaggaga atgggggaga agatgggtag 4140 

ggttttgttt tgggtatttt atggtgaggt tagggaggta gtagggtttg tggttaaaga 4200 

tttgggtttg gtgttgggaa gggatttggg gttgggtaag aggagtttag ttaggagttt 4260 

attttttagg gattatagga tggagagata gaggattttt ggggaggtag ggtgggaggg 4320 

agttgatgag ttgtgttatt tttgaaatgt agggtgtgtg gtttgggtgt agggagaggt 4380 

aggtggatgt tgggaggtta gaatttgtaa gggttttggg gttgttaagt ggggtgggtt 4440 

tttggtgtag ttagagtata ttgggtaggt tttagggtag gtttttttga ttttggtggg 4500 

gggatgtggt tattttttga gggatttttg ttagggtttg gttgtttatt ttgggtggtt 4560 

tttattttat tttagggtta atttttttta gttttagtag aaagtattat tttgagttta 4620 

ggatgggtag ttttattggg tagtttgatt gttttttatg ttaggggttt tagtaatttt 4680 

ggttaggttg tttttatatt tttttttttt ttaggttttg ttttttttgg gagttagttt 4740 

tataggaagg tttttgtttt tttttttttg tgtttttttt tgggttgagt tttgagttgg 4800 

aaagggatag agttagtttt ttttgggggt tggtatttag gttggggttg ttttaggttt 4860 

tgtgtagttt tttagttttg tttgggttgt tttatagtga gatggagttg ttttttttga 4920 

ttgtgtggga ggtgaaggta agagtttgat gtgtggaggg gttggtttag ggatgtaggg 4980 

attgggtggg tggttagtga ggtagaggaa gtagttggtt tgagtggtgg tgggtgaggg 5040 

taatatgttg ttattgggag gggtagtagt ttttgttgga tttgatttta ggttgttgtt 5100 

tattttggta gtttgataaa attttaaaag gagaattata gttttggttt gggggtggtt 5160 

gtgtgtttgt gttaggattt tatttagagg ttgggattta agattggtgt gtttgtggtt 5220 

tgaggatggt atattttggg gttttaaagt tagtttattg gtgtttattt gtttaaaggt 5280 

ttttagtttt tgaggtttgt ttttttttgg ttttttttag ttggttttta ttagggtttt 5340 

agagtttaag atttagtatt tgtgggtggt tttgggaagt ttggtagttt tgttaatttt 5400 

aatatgtttt atttgatagt aaatttggtg ggagattagt tgaaagagta agtgggtgga 5460 

tatgttggga gattgggaga aatataaaag tagtagaaag gtaatgtgtg gagggaggaa 5520 

gtattttttg tagagatagg ggataggtat ttatggttgt ggtttggtat tattagtttt 5580 

ttagagggtg ggtggtatat tgtttttgtt tagaggattg taggtttggt tgttagattt 5640 

tttgtttatt tgtgtaagtg ttattttgta gggagggaat ttgaatttag ggttgggatt 5700 

atttggagtt taaggttagg gatgttttgg tgatttgaag gaaggaaaag gtttagatta 5760 

gagttttgat tttgagtgtt tatttatttt tttagttttg ggaagggaga ttttgtttta 5820 

gtttgatttt atttttattg aggaattatg gggttaaaat tgataatttt tagaattttt 5880 

gggttttggt ttttattggg gttattttgt ggtttgtgat attagattgt tttttgttta 5940 

tagtttatag attgagtgta taagggaatg tttatgaata tttggggttt gatgtggtta 6000 

gtttttttga atattgagga aatgaagttg aaaaattttg gaagatatta ggtatgttta 6060 

gttagagtat aataaatagg ataggttgtg ttggggttta ggtttttagt tggagggaat 6120 

gttaagatta ttttggggag ttgggggtga aggttagatg aatattttgg gtatagatgg 6180 

tgatatagtt attatagata aatttagttt tggtgatttt ttttggtttt agtaataagt 6240 

taaaatgtag ttttttgtag aaggaaattt tttttttgtt tttttttttt gaagtgttga 6300 

ttgtgggttg attgttattg ggggtaggga gttttttatt tgttttgaga ttgttttttt 6360 

tttttggttt tgttttatag attatgaagg agaagggtaa gaggttattt gagtatattt 6420 

agtgtattga ttgggatgta agtgggatat taaggaagta tatatttttt agggattgat 6480 

atggaattaa gtaagtttat gggagttata gggttttagt agagatgggg tgaatgagag 6540 

ggatgggggt ttttttggag tagaagttag ggttatttag gagggatgat atagttgtta 6600 

agagtttttt tggtttaggg agtagttggt attatgaatt gagtattttt ttggttttaa 6660 

gttttgggtt agattggaat atgtggggtt agaatttagg aggattttga ggagatggaa 6720 

ggtagtaaat aaaattatgt ataatggtga agggtgtttt ttttgattta tggggattta 6780 

tggtaggatt tatgggaggg tggtaggata gagggtttat gagttttttt taggtaatag 6840 

tgatagtatt aaatgttggg agaattaggg gttttggaaa tttttattta ggtttgttgg 6900 

gaatatgata tggtatagtt atgttggtag tttgttgggt agtggtttat aaagtttgat 6960 

ggatttgaat tatatatttt taaagtgtta tagatattga atttattgat ttgtaaattg 7020 

atatttatat gaaattagta tgttaggttt attgtttgat tttttgttat ttatatatgg 7080 

agtttttggg gatggttttt aatatgggga tggggagagt aaggttggtt ttttttttaa 7140 

atggaagatt tagtgagaaa agggaatgag ttggtgatgt ttgtatgaat gtgggtggat 7200 

tttagatgta ttttgttgag ggatagaagt tagatttaat aagttattat agtaggattt 7260 

ttatttttag gttattttgg aaaaggttaa attataggga ttgagaagta gtttgggtgg 7320 

ttaggggttg atggattggg gagaggttgg gtgtataggg gttattttgg agatttggag 7380 

gatgaaggag ttgttttagg aggggttgga gtggtggttg ggagattttg tatattggtt 7440 

tggaattgtg gaggaattgt atatttatag attgaattgg tgtgtgtgta aattgaaaaa 7500 

aaaaaaaaaa aattatttag agtgaaaagg attaggtaag ttattgtata attgggttat 7560 

ttgtatgtta tagatgtgga ttttattgaa atattttttt aagagtttta ggttttgaag 7620 

agtttattgt ttatttggtg aaatatttga atttgaaatg ggatttgttg ttaggttttg 7680 

tagataaagt gaaattaata atatttgtat aaaataaatt aaagtttttt ttttttgttt 7740 

tttaggtagt gggaattatt ttatattttt ttggtatatg aggagtataa tttggtgagt 7800 

arttttggta gtgaggtttt tgggttatat ttttatattg ataggagtgg gtgtttggtg 7860 

ggggtgttgt tgtttttttt aaagttagta tttgtgattt aittaggatat aggaggtagg 7920 

atgttagttt attgttggta taaattttta aggaaggggg tggttttaag gggttaagtt 7980 

gagatataga ggagttaggg tttggatttt tggtgttatt tgggtttgat tattattttt 8040 

tagaataaga aatgatgttt tttttttggg g ttg ttttaa agtttaggag tttggta&t£ _ 81D0__ 
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ttgtatatag 
atttggtttt 
ttttttttta 
taggtgattt 
gtgggttatt 
gaggatgtat 
aagtgaatag 
ggtgattata 
aggatgtttt 
atgtttttta 
ttggagttat 
ttttgttttt 
gtttaggtgg 
gattgggaag 
ttgttttttt 
gaatgttttt 
ggtgggaggt 
ttatattttg 
gtttaaatigg 
tataatttaa 
ttttgttttt 
tggttgagtt 
gttttttttt 
gtagggtaaa 
tttggatatt 
aggtttttag 
atttatgtgg 
ggaggtatag 
agtttggggg 
tataagtaaa 
tagttttatg 
tttgggtaag 
gagttttaag 
ggttattgat 
tggaaggatt 
tgtttgtggg 
gtttttaagg 
gttagatttt 
ggtttagagg 
tttatatgat 



gatggtgtta 
tgttatgaaa 
gaggaagttt 
tagttttttt 
gtagggattt 
tttgggtatt 
ttgttttggg 
gtttttagtt 
tgttgagggt 
tttttattag 
gttttttggt 
tagttgtttg 
tagtattttt 
tttagtgttt 
tgtattttgg 
gttttatttt 
tttggggttt 
ggggaaggtt 
tgggattgtt 
gattatgggg 
tgtggggttg 
ttagttaggg 
ttgttttgga 
gggtagtgtg 
gtttagtgtt 
ggtagtttag 
gtagtgtttt 
ggagattttg 
tttggtttat 
attttttgtt 
ggtagattgg 
tttttttttt 
attttgtttg 
tttggagatt 
aggttttttt 
atgtgtattt 
tttagtagag 
gattggtttg 
gaggtttggt 
ttagatgaaa 



ttagtagata 
tgaatgtgta 
ttttagttat 
aggtgatgtt 
gagttatatt 
ggtgtagttg 

gattttttgt 
aaggtatttt 
tttataggag 
tagagggtat 
tttgattttg 
ggtttttgtt 
ttattttttg 
atttagtttt 
tttaggaggt 
tttagggtaa 
gggggttttt 
ttgattttat 
taggggtttt 
tattaggtga 
taggagtagg 
tttgatttgg 
ggagatagag 
tttattggga 
ttgtattagg 
tggtttgagt 
ttgttaggtt 
gtttagggat 
ttttagttta 
taaggggggt 
gttaggattt 
taggagttat 
atttagtgtt 
gaagttttag 
atatttgttg 
ggtagaaggt 
taagtttatg 
agggtagttt 
taggtgggtt 
gttgagagtg 



seq_prot. 
ttttggataa 
ttttgaggaa 
ttttgttttt 
tttttatggt 
gttgttttgt 
ttggttagtg 
agttagattt 
ttttgtgttg 
tttatggttg 
tttatttttt 
tgtagttgat 
gttattggtg 
ttttttggtt 
ggttattttg 
ttttagggga 
aggttgtatg 
ttagttgttt 
gatgggttgg 
aagattaata 
gtttatggtt 

ggggttggag 

gatgttgggt 
gtatagggat 
gtgtgggaag 
ggaagggttt 
atttttgttg 
gttttatttg 
tttttttgtt 
taggaggttt 
tattttaggg 
gatttgagag 
atttttattt 



ttggagggtt 
tgtgtgtagt 
tttttataga 
gaataggtgt 
tgtgtttagt 
ttttatattg 
gggtaggata 
tggtgagtat 



txt 

ggtgttgaag 
gttttttttt 
tttaatgata 
gattttggtt 
ttttttttta 
agaggtattt 
ggggatggtt 
ttagtttgtt 
atttttaaag 
ttgtggttat 
tttttttttt 
tttatgaatg 
tgattttatt 
ttgtggtttg 
atttttagtt 
ttggggttat 
agtttttgta 
gggtttttta 
ggagtatgtg 
tttttagttt 
ttttttgtgg 
tttttatggg 

gggggtttag 

gtgatagtgt 
ttagaggttt 
tttttattag 
gatattgatt 
ttgtagtgtt 
aggtgggttt 
ttatggttgg 
ggtttaggga 
aaatgagtgt 
taggtgattt 
ttgagttatt 
ttttttttgg 
tgatgttgat 
ggggtttggg 
tttttatgat 
ttgtgatatt 
ttttttgttt 



tgtttgatgg 
tagaggaagt 
tgagtttttt 
tttgtaggag 
tttttttgag 
tttgtagggt 
attttggtta 
gggagatttt 
tttaaattag 
tttttgtgtt 
ttgagagttt 
ggttgattaa 
attaggagat 
aaagttaggt 
aggttttagg 
tagatgggag 
gttgatggtt 
ggattttata 
gtagttatgt 
tttttagagg 
ggttggtgat 
ttgggagttg 
tttttgtaga 
tgtggggagt 
tggaagaggg 
gataagaaag 
gatggggtaa 
ttgttttttt 
ttaaaggata 
ggtttaggtt 
agttttaagt 
tttttatgag 
ttatggggaa 
agttttagtt 
gtttattttg 
aataagaatt 
gagttttggg 
tttttgtttt 
gagtttattt 



<210> 413 
<211> 10490 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 413 

ggatagggaa 
ggtgttatag 
attatgagga 
tttaggtttt 
attggtatta 
ttgagagaga 
ggtggtttga 
gggttgtttg 
gtatttattt 
tttttgagtt 
ttagttatgg 
ggatttgttt 
ggtattgtag 



ttgatggaag 
gggtttttga 
atattgttat 



gtgtttatta 
tgttttgttt 
tagtgtgagg 
gttgggtata 
atgtttgttt 
tttgtgggga 
gttgtgtata 
agttttttag 
gagtggggat 
tttttaagtt 
ttttgggatg 
gagttttttg 
ggtaaggaag 
tggatgaggt 
taatagaggt 
agattttttt 
ttttttatat 



tatttttgat 
agtttatttg 
aagttgtttt 
tgtagattta 
gttttttatt 
tagtaggtgt 
ttggggtttt 
gatgttgggt 
gtggtttttg 
gggttttggt 
attttttttg 
tgggttggga 
gtttttgagt 
agtttaatgg 
gtttaggtta 
tttagtgtag 
ttttggtgga 



ttttatttgg gttatgtggg 
gttagatttt tttttgggtt 
tgggttagtt ggggtttgat 
ttttgttgaa ttttaaaggt 
agatatatgt tttataggtg 
gaaagaattt ggttttttta 
agtttttaga gttagtgatt 
tagataaggt tttgaagttt 
gagagagggg tttgtttagg 
ttagtttgtt tatgaggttg 
ggtagagggt tttgtttgtg 
gtgagttaga tttttgggtt 
tagggttttt ttatgttttt 
ggaatattgt ttatatagat 
ttgggttgtt ttaaaaattt 
aatattgggt ggtgtttaga 
tatattgttt tttgttttgt 

Sdite 389 



ggatgggttt 
agaatagagg 
tttagggttt 
gatttttgtt 
tagggtgagt 
ggttggggtt 
ttttttatga 
tttatggggg 
gtttgaggtt 
ggtttgagtt 
tgttttttgg 
ggggaagtag 
ttattttgtt 
tttttttgtt 
ttttttttta 
gttttttata 
tttgtgggay 



8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10490 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



> i 



ttgggttttt 
tttatggaga 
ttatgagggg 
gagttgaggg 
tatatgtttt 
tgagaagttt 
tgtaagagtt 
gtgattttaa 
ggttggaggt 
taggttatga 
agtggggttg 
tatttgtggg 
ggaggggaga 
gtggttatgg 
ttttgggggt 
aataagttgg 
ggttattttt 
gagtgttttt 
tagaggagga 
agttagagtt 
tgttattgga 
aaaaagaggt 
ttttagtatt 
tttttgggtt 
gttaggttta 
tttgagatta 
atattttggt 
ttatttttgt 
ttgtattttt 
agggggtttt 

tagtaaattt 
tgagattttt 
tgtatagtga 
tgtatatatg 
tagagttttt 
ttagggtggt 
tgttttttag 
gtggtagttt 
gtttgtgtgg 
gattagtttt 
gtgatgagga 
attagtgggt 
gtgagttatt 
tggatgagag 

gggggggttt 

tgggttaggg 
ttaggatttt 
ggaggtgttt 
gttatttttt 
ttttattttt 
gaagaatata 
agatgatttt 
ttttagaata 
gggaaggaag 
gaagttaggg 
ttagggtgtt 
gttggagatt 
tggtattttt 
gggttttgga 
atgatttggt 
gaaattgttg 
tttttttttt 
tttttttttt 
ttagatttag 
attaggtttg 
gtgttaggtt 
tatatgttat 
tgtttttttg 



atttttgtgt 
atttgatgtt 
ttttagtttt 
gattataaat 
tgttggtttt 
ttagtttatt 
gggtagttgg 
tatgtggttt 
ttttttggag 
tagggtggtt 
ggttagggaa 
tattgatggt 
gttagttgta 
ggaggatgag 
tagttgtgga 
tgatataagg 
aggtttggtt 
tattggttag 
ataaggtggt 
attatggaag 
gagggtagag 
ttttttagga 
ttgtttaaaa 
"ggggtagt 
ggtgatatta 
tttttttttt 
gggttataaa 
taatatggaa 
tatatgttag 
ggtttgtttt 
tattttaggt 
gaagaaatgt 
tttgtttgat 
ttagtttagt 
tggttattgt 
ttttatgtat 
tttttgtggt 
attgggtttg 
gtattattgg 
gtttttttta 
gttaagtagt 
ttaatatttg 
gtttaatggg 
tttttaggat 
atgggttttt 
agagtatttt 
tttgggtttt 
ggtttatggt 
ttgggtgatt 
gttgggattt 
tgttttttta 
ttgttttttt 
gatggaagat 
gatagaagga 

agggttatta 

tatttgattt 

"tgggttttga 

tggggttttt 
tgtttatggg 
gttataggtt 
gttttggttt 
taggattgaa 
ttttaggtta 
attttttttt 
tagttttttg 
atagttatgg 
ttttttgttg 
gttgattttt 



ttttgttttt 
ttaggttagg 
tttgttttta 
ttatttgatg 
ggagtttttg 
atgaaattag 
agaggttttt 
ttattttggg 
gttttttggg 
ggaattgggt 
taggggatgg 
agtaggagtt 
tagaattaga 
atgttttttg 
tttttgtggg 
agggtgtttt 
gtaggaggtt 
tagttgtatt 
gatgtggttt 
gatattattt 
gtgattggag 
tgtatattta 
tgtttgttga 
tttaggagga 
ggagtttagg 
tggaggttta 
tattaatttt 
atatggtttg 
gaggatgtgg 
gtgtagatgt 
ttagatgttt 
tttagtaaaa 
tttttttatt 
ttgtgggtgt 
tttagttttt 
ttagtttttt 
ttggtttttt 
gtttttgttt 
tttgtttttt 
tttttgtgtt 
gaatttggta 
tgatattttg 
ttgttaatgt 
ttttaatttt 
tattttgtta 
ttattattgt 
ggttttatat 
gttggttgtt 
ttggtttttg 
tgtggttttt 
atgttttgtt 
tttttatgat 
tttttgtttt 
aggttttttt 
gagttgagtt 
ttatttttag 
tatggtttgt 
taattttatt 
tatttttttg 
atggggtgat 
tatgattttt 
agagtggatg 
ttagggtatt 
tgtaaggtga 
ggtgaggata 
gtgtttgttt 
tttttgtatt 
tgttgaattt 



seq_prot.txt 
tttagggtag gaaaggaaat 
ttttggttgg gatttagtta 
tagttttatg ggaggtaggg 
ttttatggtt ttgggttgtg 
gatggttttg ttatttgggt 
agtttttttt taagatgtgg 
aaattttaag gttttttatt 
gaggtggggt gggaatattt 
ttagggtgta aaaagggtaa 
gggtgttggg ttttttggtt 
ggagatgttg ttatttgggt 
tgggtagttg gagggtagga 
gttggagggt gtggttttag 
ttgatgggga tgagaggtgt 
atttttagta gagatatttt 
ggttgaaagt tgtgattatt 
tttggggtag ttgtttattt 
agtgtttaga atgtattttt 
aggtttttgt agtagtttat 
gggagggttg aggttatttg 
aggttttttt tgaaggagag 
ttttatgata agagttaagt 
tagtattatt ttgtgtgtga 
gggtgttatt ttttgttttg 
ttttgatttt tttgtgtttt 
tgttagtggt gagttgatat 
aaaagaagta atgatatttt 
ggaattttat tgttgggaat 
agtagttttt gttgtttagg 
tgttaatttt attttgttta 
tattagataa gtagtgagtt 
tttatatttg tgatatgtaa 
ttgaatgatt tttttttttt 
atagtttttt tatggtttta 
tttggggtga tttttttatt 
tttgatttgt tagtttttgg 
ttagaatggt ttaggaatgg 
tttagtaaaa tgtatttagg 
ttttttattg ggttttttgt 
gaaggttgtt tttgaaggtt 
tgttggtttt atgtggatgt 
gggatgtgtg gtttaagttt 
ggttgtgtta tgttatgttt 
tttagtattt ggtgttgtta 
tttttttgtg ggttttatta 
gtatgatttt gtttgttgtt 
gttttagttt ggtttagggt 
ttttgggttg ggagagtttt 
ttttggggaa gtttttattt 
gtaggtttat ttggttttgt 
tatgttttgg ttgatgtgtt 
ttgtagggta gggttaagag 
tagtggtagt tagtttatag 
ttgtagaaag ttgtattttg 
tgtttgtggt gattgtgtta 
ttttttaggg tggttttgat 
tttgtttgtt gtgttttggt 
tttttaatgt ttaggaggat 
tatgtttgat ttatgagttg 
tttagtgagg attagagttt 
tagtagaggt gagattaagt 
gatatttaga gttgaaattt 
tttagttttg agttttgggt 
tgtttgtatg aataggtagg 
gtgtgttgtt tattttttga 
tttgtttttg tagagagtgt 
ttttttagtt ttttagtata 
gttgttaaat gaqqtatgtt 
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taatttttag 
gttattagtt 
tttttgggaa 
atgtggttat 
tgtgaaattt 
agttattagt 
tttttatttg 
tttggagttt 
gtttgatttt 
attttttggt 
ttggttggtt 
ggatttttag 
gatatagagg 
ttgatttggg 
aaagtttttt 
tggttagggt 
attttgtagg 
tttaggaaga 
tttttgtaag 
ggaggattta 
gttttttttg 
ttattaggta 
tgttgttaag 
agaagtggtg 
agtttgattt 
tttatttttt 
tattagatat 
atttattggg 
aaatagagaa 
taaagtttaa 
ttttagggtt 
atagtttagt 
ttttttagtt 
aattaatgtg 
ttttaagttt 
ttatttagat 
gaattttatt 
atttatttat 
ttgaagggag 
ttgtgtgtga 
tagtttgtaa 
attgagtttt 
ttagtggatt 
ttgttgtttg 

tgggttttta 

ttttattttt 
ttggaattag 
tggtagttgt 
tttttattta 
attgattttt 
ggatgtgttt 
gaggaagtag 
ttagtatttt 
gtttgttatt 
ttatttgtgt 
gtttttttta 
tgagtgtatt 
tgattatatt 
tgggtagaaa 
ggggattttg 
tgggataggg 
tgatttgaat 
agttttagtt 
aaatttggtg 
gaggttgatg 
tttttttttt 
tttatttatt 
ggagttagtg — 



1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 



seq_prot .txt 

gagttgttag gttttttaga gttgtttgtg gatgttgggt tttgggtttt ggagttttgg 5160 

tgggagttag ttggaaggag .ttagggaagg gtagatttta agggttgaga gtttttgagt 5220 

aaatgagtat tagtgggttg gttttgggat tttgggatgt attattttta ggttatagat 5280 

atattagttt taggttttag tttttaggtg gggttttgat ataagtgtgt agttattttt 5340 

aagttaggat tgtggttttt tttttggaat tttattaaat tgttaaagtg aatagtaatt 5400 

tggggttagg tttagtaggg attgttgttt tttttagtga tagtgtgttg tttttatttg 5460 

ttattgttta ggttagttgt tttttttgtt ttattgatta tttgtttagt ttttatgttt 5520 

ttggattagt ttttttatgt attaggtttt tatttttgtt ttttgtgtag ttagaggagg 5580 

tagttttgtt ttattgtaag gtaatttagg tagagttgag gaattgtatg gggtttggag 5640 

tggttttagt ttgggtgttg atttttagaa aggattggtt ttgttttttt ttagtttagg 5700 

gtttagttta ggagaaggta tagggaaggg aggataaggg tttttttgtg gggttgattt 5760 

ttaggagggg taggatttgg gagaagaagg agtgtaggga tagtttggtt ggggttattg 5820 

gggtttttgg tgtggggggt ggttaggttg tttagtgggg ttgtttgttt tggatttgag 5880 

gtggtgtttt ttgttggagt tgagaaaggt tagttttgag atgggatggg ggttgtttag 5940 

ggtgggtgat tgggttttga taggagtttt ttagggagtg attatatttt tttgttaggg 6000 

ttaagggagt ttgttttgag atttgtttgg tgtattttgg ttgtattagg ggtttatttt 6060 

atttgatagt tttaaggttt ttgtaggttt tgatttttta gtatttattt gttttttttt 6120 

gtatttgagt tatatatttt gtgttttaga agtggtatgg tttgttagtt tttttttgtt 6180 

ttattttttt agggattttt tgttttttta ttttgtgatt tttgagggat gggtttttgg 6240 

ttgggttttt tttatttggt tttagatttt tttttagtat tagatttagg tttttagttg 6300 

tgagttttgt tgtttttttg gttttattgt gagatgttta gaatggggtt ttgtttattt 6360 

ttttttttgt ttttttaggg ttatagtttt tattgttatt atgttttttt tttttatggg 6420 

atagtgaggg ttgtagtttt aggggaatgg gggagaatag gggtaggtgg gtttttagag 6480 

atttgttgga taatagtttt gaggttgggt taggtgtttt tttattttgt ggttataatt 6540 

tttgtatttt attggggttg tttttaagta gatagggtta gatttttagg ttgttttgtt 6600 

ttttttgtgt ttatttgttg atagaattgt tgagtgttta ggggtttgat ttagtttagt 6660 

ttttattttt attggttttt tagatgggtt ttatattttt ggtttaataa ttttgggttg 6720 

gatttgtagg ggagttaggg aggagttttg tttttggaaa ggaggttgat ttgatttggt 6780 

gagatatgtt ttgtgttaga aaggtttttt taaaagtaaa tttatttttg agttgagata 6840 

ggtgttttag gggtgagggg agtgtagagg atttattgta taatttttaa atgattgatg 6900 

ttgttgttgt agttttgaaa aggtttaggt tttttgtttt ttggtagttt agtrttggtgt 6960 

ttttgtagtt tagagggagt tttggtgagg ggtttaaggt aaagggtgta agggtttggg 7020 

ggtattggtt atttgttttt gttttgtgtt tttagggagt ttaggatttt ttgattaggg 7080 

tatattggaa gaggtttttt tttaagaagt agattgattt gtattttttt gtattttata 7140 

atgatgtttt tttgtttttg tgtttattaa ttgtttgtga tttttattat gtttattttg 7200 

tgagatagaa ttgtttaaag gttatattgt atgtggtggt tttggagaat atttgaattg 7260 

tttttgttgg gtttttagat gttggttggt tgtgtaattt ttattttatt gttgttttta 7320 

gggaaaaagg ggttagattt agtggtttat agttgtttta gtttttggag ttttaagttt 7380 

taagtagggg tgggtatttt tttaaggatt tgagtatagt gggatttaga tagagaattt 7440 

tgttgttttt taatgttatg aaatggggat atattggttt tagtaggttg aatggttttt 7500 

atttgttaag ggtgaagggt ttatgatggg ttattttagg gatgtggagg tagattgggg 7560 

ttagtgatta gaggtttttg tgtaattggt tttttgggat gtttagggtt ttagtgatgt 7620 

ttagtttttt atagggaata agattttttt tgattgtttg gttttatttt gtttattatt 7680 

ttggttattg tggtttttta gtttgaatag tgaagttatt ttaggaataa tgtttgttga 7740 

gtaggagggt gttgggtttg ggggatgaga aagatttatt gtatgtatgg aaattatgtt 7800 

ttttgtggag ggattgtgga gtttattatt ttgagttgtt ttaataggag gaggttttat 7860 

ttttttgggt tattgaggaa gaatagtggg tttttggttt tggagaatag ttggatggat 7920 

tgtttttttt gggaatattt gaggtaaaag gagggtgagg ttttaagagg attatgtaga 7980 

gtaagaaata tttggggaga attttagtgt ttggattttt ttgaatataa gggaagatag 8040 

ttttttttta gttagttttt tagggttttt ttatttttta tagttgttta agggtagtag 8100 

gtttttttgg ataagggatt atgtgtgttt agtggggttt atagtgatta ttaggattta 8160 

gtttagggta tagaggtgtt ttgaggattg ttagtggatt tgtattagtg gttttatatt 8220 

agtggttttg ttaggattag gttttgtttt attgggatgg gaaatttggg tagatttggg 8280 

atttagggta gggaggttat agggtttagg tttgaatttt agtatagtat atggtagggt 8340 

tgagagtaaa atttagggtt atgtttggat ttttaggttg gttattgttt ttttgatttt 8400 

agatgtttta tttgttgaat ggggatattt gggaatagta tttattttat gaagttatta 8460 

tggagatgaa agagttaatt gtttatatgg gtagtgtaga atgggtgttt ggtgagtgtt 8520 

tagggatgat tttttttggt agttgtttta tagaggttaa tattgtttgt attgtagtta 8580 

ttgtttttaa gtttgtttgg agggaagaga gtaggttatg tttatttgat tttgatgaat 8640 

tagttggttt gggttatgtt ttttagggag aaaatttttg agtttataga gttgtttata 8700 

gatattattg tttgtgtgta attgttgtag attattgagg taggttagag agtagatagg 8760 

tgttaagtat tagtgatatt ttgaggttat ggtatgaatt atagtggggt tttgtttggg* 8820 

ttagtagtgt ttagagttag ggttttttgt tgtttgaggt gttaatatgt ttgtttgtaa 8880 

tgtgtttgtg tatgtgtgtg tatatigtgta tgtgggtaaa tatgtttgtg tatgtgtgtg 8940 

ttgttttttt ggttaggttt ggttgtttta tttatgtgtg tatttagttt tttattattg 9000 

ttatttttga ggtttkgggt tagtattaga gtatttatg'g ttgtttttta attgtagttt 9060 

tttttgttta ggcftggtt.tt gggatatata tagtggtgga gggaagtgat tgttgttatr 9120 

ggattttaga atataaaagt taatattatt atttaatggt ttttttagtg tttttaatgg ! 9180 
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tattattttt ttttttttga tattttaatt gggtattttt ttttatgatt trttggatatt 9240 

ttagttagag gattttgtgg ggaaagtgtt gggtatatag taggggttta tttttttaga 9300 

tatgttattt aaaatttggt ttatttgttt ttttatgtag ggtttagggg atgttgaatt 9360 

ttagggttta gaaagagttt gggataaaaa gaaattttaa ggggatggtt ttgatttggg 9420 

ttgagtttat ttgtgttatt taattggagt tttaagtttt gaggtaggat gtttagatgt 9480 

tttagtgtag ggtttttttg attaatattt gtttttttgt atttattagt aattttattt 9540 

attttatttt taaagtatat rtggtttttg tatttaggag trttgtattt gtagatttag 9600 

taatagtaga tggaaaatat ttagaaaata aattggatgg ttatgttttt attgaatatg 9660 

tgtagatttt gtttttgtta ttatttttta aagaatatag tattatgatt atttatgtag 9720 

tatttgtatt gtattatata ttataaataa tttagtaatg gtttaatgta tatgggagga 9780 

tgtgtatagt ttatatgtaa atattaggtt atgttatatt agagatttga gtatttatgg 9840 

attttggtat ttttggggat tttagaatta attttttatg gatattaagg gatgattgta 9900 

tatatttatt taggaaggtt ttttattgga ggaaaggttt ggtttaggat agatagggat 9960 

attatttttg gattattgtt tatttattta tttatttatt ggtattattt tttaggattg 10020 

ttttaatgat agttttagta agtggagata agaaaaaaga ttggtttaaa tggtatagtt 10080 

ttgaggtttt ggaagatgtt gtattagtat gagaataggg ggtrtagtttt ttttaggatt 10140 

tagaaagtat attaggtagg tttggggaga ggaaaggaat atggtttttt tagtagttat 10200 

ataggtttgt agtaggatgg ggttggggtt ggggttggga ttggggttgg ttttigtttat 10260 

tatttgggtt ttatgagggg aaaagaaaga atagggggta gaggaggagt atgggggtag 10320 

tgggttgttt aaggagaagg tgttttaggg aaggggtatt agtttgtttt tatattgtta 10380 

taaagaaata tttgagttta ggtaatttat aaaggaaaga ggtttaattg atttttagtt 10440 

tagggaattt atagttatgg tagaaggtga aggggaagta ggtatttttt 10490 

<210> 414 
<211> 6416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 414 

♦ 

tttttgattt atttgtgatt gagggtaaga atgttttttt tttagagatt tggtgttttt 60 

ttttgtaaaa tgaggtttat tttgtttttt tttaagggtt ttagggaaat tttagttaga 120 

gtagtgaggt gttaggtaaa ttttgagggg tattttaatt gttgtgtgag gttaggaggt 180 

ttgaattttt ttattttttt tttttaggat aagggtatat aattggtttt gttaagtttt 240 

tttrtttgttt agatgttatg gagttggatt tgtttttatt ttattttagt agttttttgg 300 

aagatttttg tttagttttt gggatttttt ttgggatttt ttggtttttt gatatttttt 360 

tgrttgagga ggtaaagagg ttttagtttt tttttanttt aattartggt aggtatgatg 420 

gggtgtgggg tttgggggag gttagtgttg gataatatat agaggtttgt aggttattgt 480 

tttttttttt atattttttt tttttttttt ttttgttata ggaaattttg agaggaggag 540 

aggtgtgtta tttttttttt ttttattttt aatttttttt ttgagttttg tagttttttt 600 

ttatagagtt taatttttgg gggttttttt agtgtaaggg ggttgttttt ttgtgatgtt 660 

agttgttttt atgtgagttg ttttttagaa gggaagggag ggatgtatgg gttttgggtt 720 

tgtgggagat atatagttgt ttttatggag gtaggggatt ttggttagga gtttttgagg 780 

gtttagtagg tgtggaaagg gaatgaatta ttttttgttt ttttttaagt tgtgtgtaat 840 

ttttggggtg taggtagtaa aggtgtatag tgaggatggg gtttgtaggt ttgtggaggt 900 

ggtagtaggt gttatagttt gttatgtgtg tgaaatgttg gtgtagtgag tttatgtttt 960 

gagtgatgag atttgggggt tggtggagtg ttatttttat ttagtattgg gtaagttagg 1020 

tgtatggaat tatttgggtt gggagatgat gttttgtatt ttgggttagg tatgtggttt 1080 

atttaggaga tttggttgat ttttaataat ttttgttttt tgtttttgtt tttagagtgg 1140 

ggtttggagg attatgagtt tgtggtggaa gtgtaggttg tttggtttgt gggtggagat 1200 

agttgttttg ttttttggaa aaattttgtt aagtatgaat tgtttaagag ttttttagtg 1260 

agtgtatgag ggttgtttgg gtgttgggat gttttgattt ttaatttgga tgttggagtt 1320 

trtgatttttg atatttgttt atttatagta ttttttgttt ttagaaaaaa tggtttttag 1380 

ttgttttgat gtatatattg gtatatttta tgaagatttt atttaggtgg ggggattttt 1440 

tattttattg tagatttatg attttttagt attggttagt gtttttttat ttttaagttt 1500 

tgtgtttttt ttttatgttg tagaaagagt taaattatag ttttgtgtga ttggggatag 1560 

ttattttttt tgtttgagta ttagtttttt ttattaaatg ggggtgagaa atgtatgtgg 1620 

agtatttttt tgtaaaaatt tgagggtggg ttgggtatgg tggtttatgt ttataatttt 1680 

agtattttgg gaggttgagg tgggaggatt atttaagttt agaagtttga gagtttgaga 1740 

ttagtttggg taatataatg agattttgtt tttttaaaaa aaaaaaaaaa aaaaaaaaaa 1800 

aggttaggaa tggtggtatg agtttgtagt tttaggtgtt tgggaggttg aggtgggagg 1860 

attatttgag tttgggaggt tgaggttgta gtgagttgtg attgtgttat tgtattagta 1920 

tgagattttg ttttaaaaag aaaaagaaaa agagaaai:at tttggtggta gaggggaagg 1980 

gaggaggtta gatttgttat ttttttttgt ttttttttgg tttagaattt tttgaatgtt 2040 

ggt agtt ttt t tgaga tt ta gggttt tttg tagt tgtgg g gtttaggatg gaagt tttg g _2100 
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aaatgttttt tttgtttttt gtgttgattt ggtttttatt attttattaa gggtattttt 2160 

aagqtaaggt tttgagggta ttagttttag tttttttagt ttttggtttt tttagaagtt 2220 

gttttttttt tgttggaatt tttgagtttt tttttttttg ggttttttag gttagttatt 2280 

tttagtttat tatttttttt tatatttttg tttagtttat ttgtttaggg aggtagtagg 2340 

agaaaagatg attttagttt aagttttggt tttattttta tttgttgtgt gattttgggt 2400 

atttttttgt ttttttttgg ttttgaaatt tttttatttg gtttgtgggg ggaggtaata 2460 

gtgggtggga tttatttgta ttaagttttg tttatttatg ttgtttagga tttgaggtat 2520 

ttgtagtatg tggtagatgt gaatgagttt aatgtgtatg tggtgatgta gggttgtaag 2580 

ttttatggga tgtttattga ttttggtttt tgtgttaagg tgaagatttg gttaggtttg 2640 

gtttttggtt tggggaagta ggaattgttt aggtttttgg attttgtttg gggtttttga 2700 

gttttaattt agatatttag attttttttt ttgtattttg gtttgttgga aatttttgga 2760 

tttagttttg ttatgtggag taggggtaga tatgtggttt taaaggtaga tatgggatta 2820 

gttaattttt tttttttgta gtttaataag ttttgaaatg gttataaggg gttttggatt 2880 

ttttgtagtg aagatgagta gagttgtatt tgttggttgg ttgttttttg tttttttaag 2940 

gtgagatttt gggagtggta tggggggttg gtttggttag agggattttt agttttgttt 3000 

ttaggaagtt ttaggaatga ggagggtatt atagttttgt tttttgataa tttttagttt 3060 

aagtttgaag ttataggaag tgtttattga aggtagaaat atagttttgg tttgggtaag 3120 

ttttggtttg aggggggtgt tatagttatg tttttaggat tttttgattt taagtttttt 3180 

tttttttttt atttttagta tggggtgtag ttgtataaga attattagta ggtatagttt 3240 

tgttatttgt atttattttg tttgggtttt ttatttttgg tgagtgtgtt taaggggatg 3300 

ggagggtggg tatgtaggtt ttgttttatg ggtatttggg ttttgttttg attttttttt 3360 

ttttttttta tttttagaga agtgttttag ataatatttt ggtggttatg gatttttttg 3420 

gttatgttgg gtgtgttatt gagaattttt gggaggtttt gagtgtggtt ttggaggagg 3480 

tttaggtttg gagggtgagg tttgttgtgt gtgtgtgtgt ttgtgttggg gatttatttt 3540 

ttgggtggga tttttgaaat aggagggagg aagagagggt ggggggaggt ttttggttgg 3600 

gaagaagtgt tttatttttt tgaggtgttg ggtaatgttt ttaagtatgt tttgattttt 3660 

ttgtattttt tattgtttta tagaagaaga taaattattg ttttagtttg tttatgttag 3720 

tttttggtat gagttttagt gtaggtgggt gatggttttg agttttgggg tgggggttgt 3780 

tttaattttt ttttgtattt ttagtgggga gtagatggta taggggtttt ttttttaaag 3840 

tgattgttta tgtttttttt tattatagtt atttattgta tttaattttg gttttatggg 3900 

tgtatttttt gtgaggagag ttagtggttt attggatagt agggtttggt agatgggtaa 3960 

ggggtagggt tgggtaatag atttagggat aagagagatt ggggtttagg tggtagttat 4020 

ggtttttggg tagtaatgtt gttttatttt ttgttttttg gtagtttgtt tttggtttgg 4080 

gagagttagt ggaattttta gggttttgtt ttttttttgt gttatttgta gaaagtgaag 4140 

tattatttta ttttgttggt gagttttttt gtgtttttgg agttttgtaa tgagatatag 4200 

gatttttagt aatttgtttt ttttattagg tttttttaga ggttttttgg tttttagtgt 42 6Q 

gttttttttt tttttgtata agaagtggga ggttgagtgt ggtggtttat atttgtaatt 4320 

ttagtattta ggatggttaa ggtgggagaa tggtttgagt ttaggagttt gagattagtt 4380 

tgggtaatat agggagattt tatttttata aataatttaa aaatgagtta ggtatggtgg 4440 

tgtatatttg tagtttagtt atttaggagg ttgaggtggg agtattgttt gagtttagag 4500 

ggttaaggtt gtagtgagtt atggtggtat tattatattt tagtttggat gatatagtga 4560 

gaaaattgtt ttaaaaaaaa aaaaagaaag aaaaagaaaa agaaaaaaga aaagaataaa 4620 

aggaaatggt ggggttttgg tttaggaggg agttttttaa gttttgggtt tttttttgaa 4680 

tttttatttt ttttattgta ttttagagtg aggaggaggg ttgtttgtat tttagtatgg 4740 

atgatggtta gatttgtttt attgatttgt tgtagtttgt ggagttttat tagttgaatt 4800 

gtggtatttt gttgtgtttg ttgtgttatt gttgtatgtg ggtggttttt tgattaggtt 4860 

gtggattggt ttatgtttta gtttgttttt aggttgtttg ttgttttttt atttatttag 4920 

tggattttgg ggtgtggtta taggggatgg gatgaggagt gggagggttt tgttatttta 4980 

gttttttttt ttgttttttt gtttttttta gatagaaaat agtttttatt ttagtttatt 5040 

tttgattttt ttttttaagg gaaggttttg ggtggttttt tttttttttt ttagttttgg 5100 

aggtgttgtt ttagggtagg gaattatggg agaagtgggg gtagtttagg tggttttatg 5160 

ttttatattt tgtatagatt gagaggttag ttgatttgtt ttgttttata ttagtgataa 5220 

taaagattat tttttgatat atttatgagt tttgtttggt aaggtttggt tggttgaatt 5280 

aagaagggaa ttagagttgg atgtggtggt ttatgtttgt aattttagta ttttgggagg 5340 

ttaaggtagg agaattgttt gagtttagga gtttgagatt agtttgggta atatggtaag 5400 

attttgtttt tataaaaaat aaaaaaatga gttgggtatg gtggtgtgta tttgtagttt 5460 

tagtttttta ggaggttgag gtgggaggat tttttgtttt ttgagtttgg gatgttaagg 5520 

ttgtagtgaa ttgagattgt gttgttgtat ttagtttgag tgatagagtg agattttgtt 5580 

tggaaaaaaa aagaggggag atttggagag gtgggtattt gtggaggttt tggtggaagt 5640 

gttaaatgga ttgaggttat gttttagttt gtttggatgt tttttaaggg taggagtggt 5700 

tatttgagag tttttagtgt ttattatgta gtttgatata tagtaggtgt ttagttattg 5760 

ttadttaagt aagtgaatag ataagagatt attattttag agagattttt tgatagttta 5820 

agtttagaga ggtaattaat agggtttggg agttggagat gagtttgata gtatgtttgt 5880 

tttttgttag ttttggatta gttgatggat agttggtttt agtatgtata tagttatgta 5940 

tttagtttta ttattaagat atagaatggt tttattattt ttaataagtt tttgtgtttt 600Q 

ttattattag ttatgttttt ttttatttaa attgaaattt tttttttgtt ttagttttgt 6060 

tttttttaga aagttatata aatggagttt gttagtattt agttttttgt atttggtttt 6120, 

tatttttaat atatttttta gtatttagag tgattttttt ttaaatataa atttattatt 6180 
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tttggtaggg tattgtggtt tatatttgta attttagtat tttgggaggt tgaggtgggt 6240 

ggattatgag gttaggagtt tgagattagt ttggttaata tggtaaaatt ttatttttat 6300 

«22*™«i a aaaatt agtt gggtatggtg gtgggtattt gtaattttag ttattaggga 6360 

ggttgaggta ggagaatagt ttgaattggg gaggtagagg ttgtagtgag ttaaga 6416 

<210> 415 
<211> 6416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 415 

H«H g iH a "Sjtaatttt tgtttttttg gtttaagtta tttttttgtt ttagtttttt 60 

tagtagttgg gattataggt gtttgttatt atgtttagtt aatttttgta tttttagtaq 120 

agatggggtt ttgttatgtt ggttaggttg gttttaaatt tttgattttg tgatttgttt 180 

SnHSS^J tttaaagtgt tgggattata ggtgtgagtt atagtgtttt gttgggagtg 240 

gtgaatttat atttgagaaa agattatttt gggtgttgaa ggatgtgttg gagatggaag 300 

ttagatgtag agggttgaat gttagtaggt tttatttata tggttttttg graggggtag 360 

?aaaggaggt tttagtttgg gtaggaggga gtatggttga tagtaggggg 420 

J«?S22«S tgttgggggt gatgaggtta ttttgtgttt tggtggtggg gttggitgti 480 

taa "9tgtg tatgttgggg ttaattattt gttagttagt ttaggittag tggaggataa 540 

g ? a HH" t "aattttta ggttttgtta attatttttt tagatttag? 600 

J£!E53£%Z llllZ^Z 9 gatgatggtt ttttgtttat ttatttattt aattagtait 660 

^"gggtgt ttattatgtg ttaagttgta tggtgggtat tggagatttt tagataatta 720 

2*Hg"*5 5S*H g * atat ttaggtaggt tgagatatgg ttttgattta tttgatattt 780 

££222°C "ttataggt gtttattttt ttaggttttt tttttttttt tttttagata 840 

gg gttttatt ttgttattta ggttgagtgt agtggtataa ttttagttta ttgtaatttt 900 

gatattttag gtttaaggaa taagggattt ttttatttta gttttttgag aagttgqqat 960 

tataggtgta tattattatg tttggtttat ttttttattt tttgtaggga tagggttttq 1020 

S1E2S2S ta 993tggtt ttaaattttt ggatttaagt aattttttta ttttgjtttt 1080 

gg J a H a B g gtatgagtta ttatgtttag ttttggtttt ttttttgatt 1140 

I a S5 9tta ggttttgtta gatagaattt ataggtgtat taaaaaataa tttttattgt 1200 

tattagtata aaatagagta gattaattgg ttttttggtt tgtataaagt qtqqgqtqtq 1260 

aaattgtttg ggttgttttt atttttttta taattttttg ttttagagta gtatttttag 1320 

agttaggaga aggagagggg gttatttaag gttttttttt gaggagaggg gttagqaqtq 1380 

S?™? 9 ,^ 9 . ?22fll Q ZZZ tttatttgag ggaggtaaag aagtagagga gaaaattgga 1440 

gtggtggaat ttttttgttt tttattttgt tttttgtggt tgtgttttag agtttattgg 1500 

f£2S3£2S2? 9 2S tgg i ggg tagtttgaag gtgggttgag gtatgagtta gtttatggtt 1560 

ti%lZnf£%% i^r? tg ^tagtaatg gtgtagtaag tatggtagga tgttgtggtt 1620 

^ g " g 9*99 aattttatga gttgtagtag gttagtgaag tgggtttggt tattatttat 1680 

gttgaagtat aggtggtttt tttttttgtt ttggggtata gtaaaggggg tqqaaqttta 1740 

gggaggggtt tgaggtttgg gaagtttttt tttgggttag ggttttatta ttttttttta 1800 

££££££ 3™H tttt: ! tttttttttt tttttttttt ttttgagata gtttttttat 1860 

HH5J»S£ ag 8 t H§ a S t gtggtggtgt tattatggtt tattgtagtt ttgatttttt 1920 

?£?S2£ a "* tag :"" "gagtagtt ggattatagg tgtgtattat 1980 

tatgtttggt ttatttttaa attatttgta gagatggggt ttttttgtgt tqtttaqqtt 2040 

ggttttgaat ttttgggttt aagttatttt tttattttgg ttgttttaag tgttgigatt 2100 

ataggtgtga gttattgtat ttagtttttt attttttgtg tagggaaaag ggagatqtat 2160 

^???ggttag ggagtttttg gaggggtttg gtgaggagga taggttgttg ggagttttgt 2220 

2SS2i?* agga "" g 9 ggatgtaggg aagtttattg gtaggatgag ataatgtttt 2280 

tJH£«Sl a sgtggtataa agagaggata aagttttggg ggttttgttg atttttttgg 2340 

?5H?i??3i a Sfttgttaga agataggaga tggggtagta ttattattta aggattatgg 2400 

" g H a !H 9 9 a H ttagtt ttttttattt ttgggtttgt tgtttggttt tgttttttat 2460 

ttgtttatta agttttgttg tttaataagt tgttggtttt ttttatggga aitgtgtttg 2520 

tggaattaga gttgggtgtg gtggatggtt gtggtgggga aggatatggg tggttatttt 2580 

ggggagggga a tttattttt tattgggaat atlaagagla gttgagg^ag 2640 

3" ttg 3 ttt: aggatttggg gttgttattt atttgtattg aggtttgtgt tggaggttgq 2700 

5828E52? ^? a99tgg 3 Sgtttgtttt tttttgtgga gtagtggglg atatgggaga 2760 

•?JS222S£ 2H3HH g H5! attatttagt attttaggaa ggtagagtat ttttttttig 2820 

^ a 9999ttt ttttttgttt tttttttttt ttttttattt tagggatttt atttagagag 2880 

■j282S tt1: S 9tataaatat atatatatat agtaggtttt attttttagg tttgggtttt 2940 

Sr^SS 9 !* atat ttagag ttttttgggg gtttttaatg atatgtttig tatggttaga 3000 

gaagtttatg gttattaggg tattatttga ggtatttttt tggagatgga aggagagglg 3060 

a ? a ??^nn? a JSSSSre** ggtatttgta agatagggtt tgtatattta tttitttat? 3120 

^ttttgggta tatttattaa ^gggtggggag Jtjttaaataag _atggatgtag atggtgagat 3i80_ 
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tgtgtttgtt ggtaattttt gtatagttgt attttgtatt gggggtgggg agagaaagag 3240 

agtttgaggt tgagaggttt tgggggtgtg gttgtgatgt tttttttaga ttaggatttg 3300 

tttagattag ggttgtgttt ttgtttttga tgggtatttt ttataatttt aggtttggat 3360 

tgggggttat tagagagtaa ggttgtgatg ttttttttat ttttgagatt ttttgagggt 3420 

agagttgggg atttttttgg ttaggttagt tttttatgtt atttttaggg ttttattttg 3480 

aagaggtgga aggtagttag ttagtaggtg tggttttgtt tatttttatt gtagaagatt 3540 

tgaagttttt tgtggttatt ttgaagtttg ttgggttgta gagagaggaa attgattggt 3600 

tttatatttg tttttaggat tatatgtttg tttttgtttt atatagtaga gttgaattta 3660 

ggagttttta gtaggttagg gtgtagggag aggagtttag gtgtttgaat tggggtttag 3720 

aagttttggg taggatttag gggtttgagt agtttttgtt tttttaggtt aggggttagg 3780 

tttggttagg tttttatttt gatatagaaa ttgaagttag tgggtatttt gtagagtttg 3840 

tggttttgtg ttattatgta tatgttggat ttgtttatat ttgttatgta ttgtaggtgt 3900 

tttggatttt aagtagtatg agtgagtagg atttggtata ggtagatttt gtttattatt 3960 

attttttttt ataagttagg tggggagatt ttaggattag agaggggtag gggaatattt 4020 

agggttatat agtaaataga agtagagtta ggatttaaat taagattatt tttttttttg 4080 

ttattttttt gggtaggtga gttaggtaag gatgtaggga gagatggtaa attggaggtg 4140 

gttggtttgg ggggtttagg gggagaaggg tttagaggtt ttagtagaga aggggtaatt 4200 

tttaaaagga ttagggattg gaggggttgg ggttggtatt tttaagattt tattttagag 4260 

gtgtttttgg tggagtaata gaggttagat tggtgtaaga agtagaaaaa gtgtttttaa 4320 

agtttttgtt ttgaattttg tagttgtaga aagttttgga ttttaggaaa gttgttagta 4380 

tttaggaagt tttgagttag aggagagtag aggagagtga taggtttgat tttttttttt 4440 

tttttttatt attaggatgt tttttttttt tttttttttt ttgagatagg gttttatgtt 4500 

agtgtggtgg tatgattata gtttattgta attttgattt tttgagttta agtgattttt 4560 

ttattttagt tttttaagta tttgggatta taggtttatg ttattatttt tggttttttt 4620 

tttttttttt tttttttttt tggagagatg aggttttatt atgttgttta ggttggtttt 4680 

aaatttttaa atttttaggt ttaagtaatt tttttgtttt agttttttaa agtgttggga 4740 

ttataggtat gagttattgt gtttagttta tttttaggtt tttataagga aatgttttat 4800 

atgtattttt tgtttttatt taatagaaga aattgatgtt taggtaggga aagtgattgt 4860 

ttttagttat ataggattgt gatttggttt tttttataat atagagggga ggtataggat 4920 

ttaagggtgg agaagtattg gttaatgttg gggagttgtg aatttgtagt gaaatggggg 4980 

gtttttttat ttggatgagg tttttatggg atatattagt gtgtgtattg agatagttgg 5040 

agattatttt ttttgggaat agggagtgtt gtgggtagat aagtgttagg gattagggtt 5100 

ttagtattta ggttgaggat tagggtattt tagtgtttag atagttttta tgtatttatt 5160 

ggggagtttt tgaatagttt gtatttggtg aagttttttt ggaagatgaa gtggttattt 5220 

ttgtttatgg gttaggtagt ttgtattttt attatggatt tgtggttttt taaattttgt 5280 

tttggggata agggtaaaaa gtaggggtta ttggagatta attagatttt ttggatgagt 5340 

tatatgttta gtttaagatg taaggtatta ttttttagtt tggatagttt tatgtatttg 5400 

atttatttag tgttaggtgg gggtggtatt ttattagttt ttaggttttg ttgtttaagg 5460 

tgtgagtttg ttgtattagt attttatata tgtggtgagt tgtggtattt gttgttattt 5520 

ttatagattt gtaggtttta tttttattgt atatttttat tatttgtgtt ttaggaattg 5580 

tatatgattt gaggggaggt aaagggtgat ttattttttt tttgtatttg ttagattttt 5640 

agggattttt aattaagatt ttttgttttt atggaagtgg ttgtgtattt tttatagatt 5700 

tagaatttgt gtattttttt tttttttttt gagggataat ttatatgggg gtggttggta 5760 

ttgtggggga gtagtttttt tgtattggag gggtttttga gaattgggtt ttgtgaggga 5820 

ggattgtaga gtttagggaa ggggttgggg atagagggga gggaggtggt atgttttttt 5880 

tttttttgaa gttttttgtg gtaagaagaa aaagggagag aggtgtgggg aagggagtag 5940 

tggtttgtaa gtttttgtgt attatttagt attgattttt tttaagtttt atgttttatt 6000 

gtatttgttg gtggttggga tgaggagagg ttgggatttt tttatttttt taggtagagg 6060 

ggtattaggg ggttggggag ttttaggagg ggttttaggg gttgggtaaa ggttttttgg 6120 

agagttgtta agatgaggtg gagatagatt tagttttatg gtgtttgagt aagagagggg 6180 

tttaatggaa ttagttgtgt gtttttgttt tggggagagg ggatgagaga atttagattt 6240 

tttgatttta tatggtggtt agggtgtttt ttagagttta tttagtattt tattgtttta 6300 

attggagttt ttttgaagtt tttggggaag gatagaatga gttttatttt ataaaagggg 6360 

atattgagtt tttgagaaga aagtattttt gtttttagtt ataggtgagt tgggag 6416 

<210> 416 
<211> 6001 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 416 

ttttaatatt agaaggaata aattttgaat atattatttt taagaattgt aatfatttatt 60 

gtgagggttt atggttttat ttttgaagtt agtgagatta agaatttatt aattttggat 120 

ataaggtgat aggttgaggg tggtggtttg gttttgggtt tttttggggt ttttttaggg '180 
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aatgttttgg tatttgttga ttgagttttg ggaggtagtt ttggtatata gttttttgat 240 

atgatttgtt ttttattttg gggtgttata tatgaaggga ggtgattgtt gtgatggtgt 300 

tggtaggatt gttgtttttg atgtggggtg ggttgagtta ggtttgaaat atgggttttt 360 

aggttgagtt ttgttttttt tattatattt agggttgatt gatattttta gttagtttat 420 

tttggttttt tttttatatg ttaggataat gtagtttttg ttattaattt gggtagttag 480 

agxtgggtta gtgggggata tgggattatg ggtaagggta attgatattt gtttagtttt 540 

aatgtatttg ttttaaatgt ggttaggtgg tggggtaagt aggaatgagg taggggtggg 600 

gttgttttga ggaggatgat tttaatgagg gtgtgagtag gggatttaag ttggaattat 660 

if^S 9 .** 11 tatt 9 ta tat tagagttttt ggttagggag taggttgggg attaggtatt 720 

ttattttagt ggggtatagt ataaagtttg tagggggatg gggttattag aaagttgatg 780 

atatgagagt ggttgggttg gggttgtttg gtggttatgg agaagttgaa gtgttgtagt 840 

agggaggtga agaagaggaa gagttttatg tgggttaggg gttttttgag gtatgtatgg 900 

tggtttgtgg ggaggggagg ggtgttagtg agtttggttr ttgggtgata tttttgtaag 960 

attttatgga aggggatagg gagttgggtt ttttataggt atttgttgag aaaggtagga 1020 

aggtttttgg ttttataaag tggttttggg tatttaggaa gtgtttgggg tggaagtgga 1080 

ag . 99 H tttt tta 9 a "tggtt ttatttttta gtattgatga taggttggtg atgagtgttg 1140 

J^i^S" 1 a 99agatgta gggtgagagt ggggattgga ttttaggatg ttgggatttt 1200 

tgttattaaa tatatggggg atatatattg tttggtatat agttggattt tgttaattag 1260 

ttttgtgttt gagaagtttt atagtatttt ttttgatttt atagtagggt gtagttatat 1320 

tttttagagg tatttatatt gttttttttt tttgtaggtg ttgggttttt taatattttg 1380 

gtaggttttg atttgttttt tttattagat tggggttttg gatggatagg ttagttttgt 1440 

ttatattttg gattttttat ttaagtgggg atagttagtg tggtggtatt gaggattagg 1500 

tggttagggt ttttagagtg ggtttatttg gtagtagtta tgttggggtt attattaggg 1560 

gttggtgttg agttggggtg aggagggtgt taggtttatt ttagggatgt ggaagttttg 1620 

tattttgatg ttatgggatg ttatatgggt tatatttagg gggatgatgt ttttaaagtg 1680 

t-tgtattttg tgaattatgg tagtggtgta gggtatgtga gtttggttat ttatttttgg 1740 

* t9 H 9 !: att tQttttatta tgttgttgat tttttgttgg atatggattg gatagatatg 1800 

tgtttttata atgggttagt atttagggga tatttttttt ttttttgtgt tggaggaagt 1860 

taggtttata ggagtttggt tatgtttgtg ttggaagttt tgggtgtttt agttaagttt 1920 

aggggttttt agttgtattt tttttttttt agtttttgtt ttgggtttta gttgggttta 1980 

tgttgtatat ttaggtgtag gattatgagt aggaggtttt aggttagtgt ggttgaggtg 2040 

gttattattt tggtaaggaa taggttattt attattatgt gtaggttttt attattgaag 2100 

ttgtttttag ggtttttttt ggtttgagta gggttgagag gatatttagg ggatagaatg 2160 

gggtagtttt taaatgattt ttaattttgt atttgttagt ttagatgtgg tttgttgggt 2220 

gatgtattgg tttaattttt tgtttagttt ttttttattt tttttgggat gtttaattta 2280 

ttatttttgt tttttattgt ggtagttatt tttatttttt ttttttttgt taggaaggtt 2340 

ttagttaggt tttggggtgg ttgggttggg ttttaggtta ttttgtgttt agttagtagt 2400 

ttatttagtt gggttaggaa agttttttgg aagtgtagga ttttgttagt tagtgttggg 2460 

atgtgtggga ggatggggat agtatttagt atttatatta gatagaatgg ggttttaatt 2520 

ttttttgtgt tttgtgttta tttggattag ttttaggttt tagttatttt taggaagatt 2580 

"?P£H at tattgatttt attaagtttt tttttaagtg ttagttttta 2640 

tttttttttt ttttgtttag aggagaaatt taaaattgaa atttttaatg tggatgggqg 2700 

tatagagttt ttggtttttt ttggtgtttt ttgatttggg tatatttttt ttatgattat 2760 

g "!: 9aga £ 9 ! t IHH t I tt1: ttttaggttt tttttatagt ggggtttttt ggaatgtttt 2820 

tttttaaatt tatttatgta aattttgttt tttggaggtt ttagtttagt tttggtattt 2880 

tttaggagtt tgttttgtag agattttttg gttttttgtt ttgtattttg tgtaggaagt 2940 

IM^HH*" "ttagtttt ttttgagtta ggtttagtag tttgaggaag tgagggttgt 3000 

H tat H°!5 Stggtgtttg taggtgaggg aggtgattat gttgtttatg gttttgttta 3060 

a ?^?gttgtt ggggtgaaag gggtgttttg ggggtgggag atgtgggtaa ggggttgttt 3120 

tttttgtttt ttgttttttt agtttttgtt ttgtgttttt ttgtttatta tttattggtt 3180 

i?2H 9 23§ a a 99t9gtata aaggtaggtg gtttttttgg ttatttattg ttttagtgat 3240 

tttttgttta ggtttaagtt gtgtaaggtg gatatggaga agtgtttttg tttgtgttat 3300 

9"E g fgtttat agtgtgatag gattattttt gggggtggga tgggtatgtg ggtgttgtta 3360 

^ aa 99«tt ggttttattt tttgttattt attttaattt tggtgtttta tlaggttttt 3420 

HKSSSi* a 9tttggttt agttgggtat agggtttatt ttttgtttat ttatattgtt 3480 

tttttgtttg gggtggggtt tggttttatt ttgtttttgt ttattttgat tattttttta 3540 

£H2 9 §* a 2 a t ttt: 9tttg ttttgtttat attgagtttg tagtataggt gtggtttttg 3600 

" a £*?" a * JSjES&E 5 ag "H 9 3 t tatt 999ttt ttggtgggtt tgggtagtag 3660 

JPJgttttt ttttagttta tagatttgta ttttttttgt gtaggtggtt ttttggttta 3720 

£ tat 5£ gtt gfl"tttatt tttgttttat gtttaggatt ttatgttttg 3780 

■S8KSS2 gttattgttt atttggggtt tatggaaatg tggtttttgt 3840 

tttttattgt tgtttgtttt gggaatgtgg tttgaagttt aggatttggt agatgqqtat 3900 

aggtgggtgg ttggttgtgt ttttgttgtg ggttattatt gttttgtgta tggttgttiq 3960 

SKEJi 1 * 9 9tg " ta " ? tlgttgtagg ?tgaa?atgt ttttjSStg 4020 

ISStfSS l?^i?22H a f gS^agggt tttttgttaa gttaggatta ttttagatta 4080 

taggttttag ttttatttga attttggatg atttttgggg ttattaggag tgaataaata 4140 

'SSSSSg? KgSES'SS; 1 ?^ 99tg9999? 9 ggggttata? «??ig?|at 4200 

ggaggaatt t agt ttggatg tgttatttag gtatgatttt gtaagagi-i-a ttaaaattgt 4260- 
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tgagaggttt tagttagtat tttattttta gatgatggtt tatgttggtg agtagtgagg 4320 

tttgaggatt tatagtgtaa aaggtttgaa ttgggttatt gtattttttt tatttttgat 4380 

tttgtgattt aaatggtatt taggattaat ttatttttta tttttaaggt tttttttttt 4440 

ggtgttagta gaagggattt tgtattttat aatatatgtt gtttaatggg tttgtatgtt 4500 

tattgttaag tttagtttta tttttaggtt tttgttttat ttttttttgg tttttggaaa 4560 

atttagtttt ttatgttatg tataaatgtt tttttttagg atgtttttta aatttgtttt 4620 

ttttttttag tttggttttt gatttagttt gtggtttaat ttattattta tgtttgttgg 4680 

tggtggggta tttttaggat ttttgttgtt ttttaggatt ttttttttta tttggttgaa 4740 

gtagtatggt gtgttttgga agtttatatg tagtaaggtt gtttagtttg ggtagtggta 4800 

ggggatttgg tgggtagtgt gtagtttagt gttggtgttg gtgtattagg tttattagga 4860 

gtaggaagat ggttattatt atggttaggg gtattagtgt ttttagtttt atggttgttt 4920 

tattattaat tgggtttttt tggatatatt tggtattttt attttattag gtatagagga 4980 

ttaggtagga tatttttggt atattgagtg tgtgattttt tttttataaa gggagttgat 5040 

gatggttttt gttttttgtt gtgagtgaat ttgttgtgtt gattgtgttg ttagtggtag 5100 

agttaggtta gggtaggtat gggttgtttt agaggttttt gttgttgttt tttgttttag 5160 

gtttttattt agggtagggt ggtagaaagg tttggttgga gaagttattt ttttttttta 5220 

ttttaagttt tttaagttta tataggtttt tgggataatt agggttttag tggatttggt 5280 

tatttatttt ttagttaggt ttatatattt taatgtagtt ataatttttt ttttagaata 5340 

tgattttgtt ttttttttat ttttatttgt ttattttaga gtgattttta gtatttttat 5400 

ttgttattgg tatttatttg gggttttaga gtttttgatg atgagtggta ttatgggttt 5460 

ggttttttta ttttattttg tatttttgat atgtatagat gttatgtata tatttgatgg 5520 

tgtatagatt ttttgtttat ttttagatat ttgtttattt gtttatattt gtagggatat 5580 

gattatatat gtagaaaatt atttatataa agataatatt tatatatata tagatttata 5640 
ttgatattta gggtatatat ttttttttat atatattagt tatatatata tatagatttg . 5700 

gtattaagta ttttattttt tagttatgtt tgaggttttt tggatgggat tttttttgtt 5760 

tagaggttgt ttttggtgag ttttaaagtt gttatatgga ttttagttta gtttatattt 5820 

tgggttttgg ttgggttatg attttttgtt tgtaataggg ttgtttttag agtttttagt 5880 

tggtagtttg aaggtttttg ttttagtttg tgatagtatt ttttagggtt gtttgagggt 5940 

tgttattttt tattgttttt tggtttttat gtttttgatt agaaatttgg tggaaatatt 6000 

a 6001 

<210> 417 
<211> 6001 
<212> DNA 

<213> Artificial Sequence 

<220> • , 

<223> chemically treated genomic dna CHomo sapiens) 

<400> 417 

taatgttttt attagatttt taattagaaa tatggaggtt aggaagtagt ggagaatgat 60 

gatttttagg tagttttgga ggatgttgtt ataggttggg gtaagggttt ttaggttatt 120 

aattgggagt tttgggaata gttttgttgt aaataggaag ttatggtttg gttagagttt 180 

agaatgtggg ttgagttggg atttatgtga tagttttgag gtttattggg agtagttttt 240 

ggataggaga ggttttattt aggaaatttt gggtatggtt gggaagtggg gtatttggtg 300 

ttgggtttgt atgtgtgtgt gattggtgtg tgtgagagag aatgtgtgtt ttgagtgtta 360 

gtgtgagttt gtgtatgtgt gaatattgtt tttgtgtggg tgattttttg tatgtgtaat 420 

tgtgtttttg taagtgtgaa taagtggata agtgtttggg agtggataag agatttgtgt 480 

attattaggt gtgtgtatag tgtttgtgta tgttaagagt gtaaggtgaa gtgaagggat 540 

taggtttatg atgttattta ttattaggag ttttaaggtt ttaggtaagt gttagtgata 600 

gataagggtg ttgaaggtta ttttggagtg ggtaggtggg ggtagggaaa gggtaaggtt 660 

atgttttgga ggaggggttg tgattatatt agggtgtatg agtttagttg ggaggtggat 720 

ggttgggttt attgagattt tggttatttt agaagtttgt gtgggtttgg ggagtttgga 780 

gtggggagag ggggtgattt ttttgattag gtttttttat tattttattt tgggtaaggg 840 

, tttggagtag gaagtagtgg taaggatttt tggagtagtt tatatttgtt ttggtttgat 900 

. tttgttattg gtagtatagt taatatagta ggtttattta tagtagaggg tgaaggttat ^960 

tattagtttt ttttataagg gaagggttat gtgtttggtg tgttgagagt gttttgtttg 1020 

gttttttgtg tttggtgggg tgggggtgtt aggtgtgttt agaggagttt agttggtagt 1080 

'gaggtagtta tggggttaga agtattggtg tttttggtta tgatagtggt tatttttttg 1140 

tttttggtgg atttgatgta ttggtattaa tgttgggttg tatgttattt gttaggtttt 1200 

ttgttrattgt ttgggttggg taattttgtt gtatgtggat ttttagaata tattatattg 1260 

ttttgattag gtgagggagg aggttttgga gggtggtaga ggttttgagg atgttttatt 1320 

attagtaaat atgggtggtg ggttaaatta taggttggat tiagaagttag gttgagaagg 1380 

ggaagtaggt /jttgggggatg ttttggggaa ggatatttat atatggtatg aaggattgga 1440 

;tttt£taaag gttaaggaag . agtagggtaa gggtttggag gtggagttgg atttggtagt 1500 

gggtatgtaa gtttattggg taatatatgt tatggagtat aaagtttttt t ( tgt,tgatat 1560 

t^gaaggaaa ggttttggga atggaagatg agttagtttt gagtgttglut taaattatga 1620 
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aattgaggat gaagggggtg tagtgatttg gtttaaattt tttgtattgt gggtttttqq 1680 
gttttattgt ttattggtat ggattattat ttgggaatgg gatgttaatt ggggtttttt 1740 
ggtaattttg gtgatttttg taaggttata tttgggtgat gtatttaaat tgagtttttt 1800 
B 3 " 3 ! 3 ?. 33 Sgtgtgattt ttatttttgt tttaggatta ggaggttggg tttttttttt 1860 
i a1: IiS ttt: atttttggta g^tttggggg ttgtttaagg tttaaatagg attaggattt 1920 
gjagtttggg gtgattttgg tttgataaga ggttttgatt ttttttttqt agttatqota 1980 
ttgrrrtggg gatgtgttta gtttgtagtt ggtttggatg ttggtggttg tgtttaatgg 2040 
2 l 9 2l2 g l Z Qtgtgtgagg tgatggtgat ttgtggtgag gatatggttg attgtttgtt 2100 
tgtgtttatt tattaggttt tgggttttgg gttgtgtttt taaggtaagt ggtggtgggg 2160 
gatagagatt gtgtttttgt gggttttggg tggatagtga ttgtagttta agtagtgttg 2220 
atagggtgtg gggttttgga tgtgaaatag agataaaggt tagtgagtgg gttgaggata 2280 
^?gg«agg aaattatttg tatgggggag gtgtgagttt gtgggttggg agggggtggg 2340 
gttattgttt agatttgtta gaagtttggt gggtgaggtt gatgtgttga agtggtggtg 2400 
gtggggattg tgtttatgtt gtgggtttag tgtgggtggg atgggtggga ttttttttga 2460 
2H2 a ag H t gg^tagggtg ggtagagatg aggtggggtt aaattttgtt ttaggtaggg 2520 
2 a HH ggtgagtaaa gagtgggttt tgtgtttagt tggattgggt tagggattgt 2580 

gggagatttt gtggagtgtt agggttggag tgggtggtgg agggtggggt taaggttttt 2640 
atggtaatgt ttatgtgttt gttttgtttt taggggtgat tttgttgtgt tatgggtttg 2700 
tgtggtgtga gtagaggtgt ttttttgtgt ttattttgtg taatttgggt ttgggtaaga 2760 
agttgttgga gtagtgggtg attgaggagg ttgtttgttt ttgtgttgtt tttgttgatt 2820 
aagttggtgg gtgatgggta gaagggtata aagtgggaat tgggaaggtg ggggatggag 2880 
aaggtaattt tttatttgta ttttttattt ttaggatgtt ttttttgttt tiitggtttt 2940 
ttggataaag ttgtgagtaa tgtgattgtt ttttttattt gtgggtgttg ttttgagtat 3000 
9a i ga ""! jES**?? 9 g"g«ggat ttagtttagg agggattgaa ggaggagttg 3060 
ggttttttgt gtgaggtgtg gagtgagaga ttgaggagtt tttgtagggt gagtttttga 3120 
gaggtgttgg ggttggattg gggtttttga agggtaggat ttgtatagat gggtttggga 3180 
S 3993 ?: 3 *" taggagattt tattgtaaga agggtttgga ggaggagggg atattttaga 3240 
^ a J 99 "?H ggagaggtgt gtttgggtta gggggtatta ggagaggtta aggattttgt 3300 

atttttgttt atgttggaga ttttgatttt aggttttttt tttgggtaag glgagagagg 3360 

S* 99399 " 9 . gtatttgggg agggatttgg tgaggttagt ggtaaggata ggtaggtttt 3420 

gggttttttt ggagatggtt ggggtttgag attggtttag gtgaatgtag agtataggag 3480 

ggattgagat tttgttttgt ttggtgtagg tgttgaatgt tgtttttgtt tttttgtata 3540 

""agtgtt ggttggtaag gttttatgtt tttaaaaggt ttttttgatt tagttggatg 3600 

a9 " 9 H aa 5 t gagtatagg atgatttggg atttagttta gttattttga gatttgattg 3660 

^9"" ! ggtaaagaag gagaaggtga gagtggttgt tatggtgggg ggtaagggtg 3720 

5* gg91:tgaa tgttttagga ggaatgaggg gaggttgggt aaaaggttgg attagtgtat 3780 

tatttggtga gttgtatttg ggttgatagg tgtagaattg gaggttattt gggqgttatt 3840 

" g £i" a " «ttgagtat ttttttggtt ttgtttaggt taaggggagt tttglgagta 3900 

§HH aat9a tgagaatttg tgtatagtgg tgggtaattt gttttttgtt gggatggtga 3960 

H a H5H at Htgttggtt tggggttttt tgtttatgat tttatatttg gitgtgtagt 4020 

?^ 9a §H ta ? tt: 9gggttta aggtagggat tgagggagga agggtatagt tgggggtttt 4080 

? 99 ?iH ag i tgggatattt ggggttttta gtataggtgt ggttaggttt ttgtaigttt 4140 

aa "*""J SHHS 939 gaaggagagt gttttttggg tgttgattta ttgtggggat 4200 

gtatgtttgt ttagtttgtg tttaatagga gattgatgat gtgatagggt aggtgtggtg 4260 

a J ta9a9atg ggtgattagg tttatatgtt ttgtattatt gttgtgittt atgaggtgta 4320 

ftg"ttggg gatattattt ttttgagtgt gatttatatg atattttgtg atattgaigt 4380 

atagggtttt tgtattttta aggtaggttt ggtgtttttt ttattttagt ttagtattag 4440 

tttttggtga tagttttagt atggttattg ttaggtgggt ttattttagg aattttggtt 4500 

anll?%£lZ lltl^ll^l atatitgattg tttttgtttg gatggggggt ttagagtata 4560 

ggtagggttg gtttgtttat ttagagtttt agtttagtgg ggaagataaa ttaggatttg 4620 

" a H aa 85 ggaggattta gtgtttgtag ggagaggggg tagtgtgggt gtttttgaga 4680 

ggtgtgattg tgttttgttg tggggttgga gagggtattg tggagttttt tgggtgtagg 4740 

a " a SH2 at agagtttagt tgtgtgttag gtagtgtgtg ttttttgtgt gtttggtggt 4800 

aggggtttta gtattttaga gtttagtttt tatttttatt ttgtattttt tgtttaggga 4860 

?3??SJS a ttattaattt gttattggtg ttgaaggatg aggttgtttg gaagaagttt 4920 

""9"ttt attttgaata ttttttggat gtttagggtt attttgtgaa gttggaggtt 4980 

HH t 2 tttt ""agtagg tgtttgtggg gagtttggtt ttttgttttt ttttgtggag 5040 

ttttgtaggg gtattattta ggagttaggt ttattgatgt tttttttttt tttataggtt 5100 

HH gggg3g ""tggttt gtatggagtt tttttttttt tttatttttt 5160 

i§" 9tag H Jtttagtttt tttgtggttg ttggatagtt ttggtttagt tatttttgtg 5220 

ttgttagttt tttggtgatt ttattttttt atgagttttg tgttgtgttt tgttagaatg 5280 

SS&£2H° "H* 39 *" Sttttttagt tagaggtttt aatgtataat alagtaatgt 5340 

ggtagtttta atttgggttt tttgtttatg tttttgttgg gattattttt tttagggtaa 5400 

"ttattttt gttttatttt tgtttatttt attgtttggt tgtatttgag atgggtatgt 5460 

^? 399 H ga S tagatgttag ttatttttgt ttataatttt atgtttttti ttgatttait 5520 

tttgattgtt tagattggtg ataaggatta tattgttttg gtatgtgggg aaggggttag 5580 

9 S^H a ?-2gfS 9lt9t tagttagttt tggatgtggt ggagagggtl ggatttagtt 5640 

-tggaggttta-tattttaggt— t*aat%tagt-ttattttat^ti^ggattg-^ttttff 
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agtattatta 
tgttagggag 
tttttgggaa 
tgtttaaaat 
tggtgagtgt 
a 



taatagttat 
ttatatgtta 
ggttttagga 
tggtgggttt 
tatagttttt 



ttttttttat 
gggttatttt 
aaatttagga 
ttggttttat 
aaagatggtg 



seq_prot. txt 
atatgatatt ttaaaatgga 
ttagggttta gttggtaggt 
ttgagttatt gtttttagtt 
tgattttaag aatgaagttg 
tgtttagagt ttgttttttt 



agataaatta 
gttagaatat 
tgttattttg 
tggattttta 
tgatgttaag 



5760 
5820 
5880 
5940 
6000 
6001 



<210> 418 
<211> 6456 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 418 

taagaatttt 
ttagtttttt 
ttattatgtt 
attaattatt 
aagaattagg 
ggatttaatt 
ttttgatttt 
aagtaagtta 
ggaaataaaa 
taagtattat 
attattttta 
gttttataaa 
attattaatt 
aaagtgtttt 
tattatttta 
aatatttttt 
aaatatatta 
agtataatgg 
tgttatatag 
atatgataag 
tttgttttaa 
ggatgatagt 
aagagtttta 
atttataggt 
ggttgaggta 
gattttttga 
ttattagttt 
tggttttttt 
tttgatgatt 
tattatttta 
gatgagaaaa 
atttaatttg 
aaaaaagaaa 
ttattaagaa 
aggagagagt 
gggtagaata 
aattagttta 
gttttggaag 
gattattagg 
gtaaaaatat 
aaagttttta 
taaaaatttt 
ggttggttta 
tagggagaaa 
ttgatttgtt 
gataaaattt 
ttttgttttg 
ggttttagat 
gtttttaggt 
i ttggt:ttgta 
gttttgaata 



aagttattat 
tggtaaattt 
tttgttagtt 
tatgatttga 
tttaagattg 
ttgataattt 
gtttgagtaa 
tggggattaa 
agggaatgaa 
tgaattttat 
aggatttttt 
aaaggttagg 
tgttgaaatg 
tatatatgtt 
atatgtaaaa 
tgaaatttat 
attttaggag 
tagaaagtgt 

gggtgttttt 

tatttttaat 
aattgtagtg 
tgagttatag 
aaaatggatt 
ttttatatat 
taatgtggtt 
tgttttttag 
gggataagag 
gagtttgtag 
aatgttaaat 
tttttagtgt 
aagatatttt 
tagaagtttt 
aggatgatat 
aatattatta 
aagaattgtt 
tttatgatut 
aagagtatgt 
aaatttttat 
agttartttt 
tatgatgttt 
ttgattatta 
ttaatggtat 
ttttgttttt 
atgtttttiaa 
ttttggtttg 
aatttttatt 
tatntttttg 
ggtgagggtg 
gttttagtat 
tttgtatgtg 
tttttttata 



ttttgtgtat 
attaaaatta 
tagaggttta 
aggtatttat 
taaaattaat 
ttttttggaa 
atatttgtag 
tggttgtttt 
gatttgtagt 
ggtttgtgat 
tggttttaga 
tgaattgatt 
taaataattt 
attttatttt 
ataattttag 
agtatagaat 
gataaattaa 
gtttttggta 
gtagtatttt 
tggatatttt 
gtggaattta 
ggggttaaat 
ttgttgtttt 
aattttattt 
ttaattttat 
tttgagttta 
gagataagtt 
agagtttggt 
tgtttgattt 
atatagtagg 
tgttatatat 
taggtatttt 
aattaaataa 
ttttaagtta 
gtttttttat 
aattatatta 
ttgtttagag 
gatgttgttt 
attatgtaat 
ttgtattttg 
tatggtattt 
ggttatgaaa 
aaaatgtatt 
gttgtatagt 
ttgtttggta 
ttgtaagatt 
ggtttgagga 
tagttttttg 
tgttattttg 
ttttaatgtt 
ggtgtttttt 



gtaggtagta 
aattttaaat 
attatttgtg 
aaaagtattt 
tttaggggta 
agaaaagaat 
tttttaaaat 
aaggtagagg 
ttatgagggt 
tgagagtttt 
gtttgtaatg 
tttttttgtg 
tttagatatt 
atttttatga 
aaaataatgt 
ttaagttatg 
gagaatagga 
taagaatggt 
tatgtttgga 
ataggtaata 
ttttttggga 
atattttttt 
taataaataa 
tttaaaatat 
aattatttat 
aatngtgttg 
aaaatttttt 
attttaaagg 
tttataaaaa 
tatgtatttg 
ttttttaaga 
tttttaatgt 
tgttaattat 
tatattttta 
ttatttgtat 
t:attttggat 
atataaagta 
gagattgggt 
tttaataaag 
ttaatttgat 
ttatttttta 
aatatgtggg 
tatttttttt 
tgtagggaag 
aggttattgg 
aataggttgt 
tttttggttt 
gtttatttga 
tggatggtgg 
atgaaatggt 
ttgattttug 



ttagtgttag 
ttttgaaaaa 
ggggagggtt 
gtttttttaa 
atagtaaatt 
tgttgttaat 
ttagaatttt 
tagaagagaa 
gatattatta 
ttttgatatt 
tttgggtttg 
aagttgtttt 
tgaagtattt 
ttttaatagg 
ttttgttttt 
ggttagttag 
tggattagat 
gtaagttttg 
aggatattat 
gataaaataa 
tgtattaggt 
tttttttttt 
aatatatatg 
ttattttttt 
tgttttattt 
ggaataaagt 
aagaagattt 
ttagttttgg 
tatttagttg 
ttgaatagaa 
ttgagaaagg 
aaagttatgt 
ataatgggga 
tttaatattg 
aatttatttg 
ttatatttaa 
aaatagatat 
attagtattt 
agaaattgag 
ttatatttgg 
gttttattta 
ggttagtggg 
agattttttt 
tttgtttagg 
ttttggtttg 
gtaggatttt 
ggtttttttt 
ttatgttggg 
agagtggtat 
ttttgttttt 
ttatgtattt 



taataataag 
attaatatgt 
tataattatg 
gaaattatta 
ataggtttgg 
tgtttttatt 
tagtttggtt 
gaaataggaa 
ataatttttt 
ttataggtta 
aatatgattt 
atggtaagag 
ttaaatttat 
taggtgttat 
agtttataag 
aaatgtttag 
aaaatttata 
taaataatag 
ttatttatat 
aataagaatt 
gattttttta 
taatattata 
aaaaaaagtt 
ttattagtta 
atatatttat 
attgtataaa 
tgtagatatt 
gtttttttat 
tgtttattta 
gttttaaatg 
taaaattata 
atttttgttt 
aattttagga 
gttattatat 
ggaaaagtat 
ttttttggag 
agggataata 
aaatttttta 
agtatgattg 
ttttatttat 
ttttgttttt 
gtatgtttta 
ttagggtggg 
ttatgttttt 
tggttttgtt 
tttagttgtt 
gtgtgtgtga 
gtgtttgaag 
gttttttttg 
taagttgtgg 
gtattgttgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
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ttagtgtggt ataggttgtt tttttttatg ttgtttggat gttgtttgtt gtggtgttgt 3120 

tggggtgggt ggggggtgtg tttaattagt tggtggttgt tgtttgttta ttttggggtt 3180 

ttttggagat tggtttagtt ttatttgttt tttgttggtt taggtgtttg ggtgttgggg 3240 

tttttgttgt tgtagaagtt ggatggagag atttttttgt ggggttggtg taattttggt 3300 

gttgttgttg tgaaattttg tttgtagagt tgttgttgtt gttgttgttg gaggagtgag 3360 

ttgatttttt tttttttttt tttgaagtga agtttttaat atagatatga ttatatagtt 3420 

tttgtttaag tgtgttttag tttagaatta ttgtatttaa aggaggagat gggaggataa 3480 

gaagaaagtg taattagata gtttagaagt tttttttggg gtttttttag attttttttt 3540 

ttttttttta tgaagttttt attgttattt tttgtgtgtt tttgtttttt tttaagttgt 3600 

ttaatttttt ttggtttttt tgagaaaagt gaagttattt tttttttttt gtagggttga 3660 

agttgttttt gtgtggagag gttgtagggg ttttttgggg atgagtgagt ggtgtgggat 3720 

gtagtggaat gggagggggt gtgttgagta gtttagagtg tgttgggagt gtgggggagg 3780 

ggaggtgttg ggtatgttaa tgttatggtt taatatttat ttttatatta ttgtgttgtg 3840 

ttgttatttt ttttgttttt tgaggtttgg atgtgtaggg agatggggtt taggggtgtt 3900 

ggagggtgtt taggggttgt gtataagttg gagtggaagg attgtgggtt tatttgtgtg 3960 

gggttgtaga atgtgggtgg gggtttttag gattttgtaa tttgatgttg ggagtgtgtg 4020 

agaaggggtg gttagatatt tgtttagaaa ttgttttttt tttttttatt tttagttttt 4080 

ttatgaggag atatttaaaa atgatttgta tatataaatg gtttttttgg ggtaaaggag 4140 

ttttggttgg aaatggaatt ttttttttgg ttgggaggat ttttttggtg gaaattttgg 4200 

gaagattagt tatttggttg gggagggttg gggtttattg ggggtgggtg agtgagtggt 4260 

tttgggggag gggagatatt gttttttatg ggattttaag ggtgggagaa aggggtgttt 4320 

gttataattt tgtttgttgt ttttttaaag aataatttta ggaaggggat agtataattg 4380 

tgttttgttt ttaagtttta aattgatttt gatttgagta aaatttaatt ttttttttaa 4440 

agggtggggg gtggggggtt aaattttgtg tttttagggg tgagagagaa ggtttttgtt 4500 

tttggggttt ggatttgtga attttttttt atttttagga tttttttgtt gtaggtagta 4560 

gaggtggttg ttttattttt tttttggtta aaggtttggt ttagtttgta gttggatgtt 4620 

tttggagttt tttgtataat aaagtgttag agtagggtat tttaattttt tttttttatt 4680 

tagtagttat ttgggattta gggggttttt tttttttagg gttgttgtgg ttttggtagg 4740 

atttttatta tgatggttgt atgtttggaa tttttggggt attttggaat tatttgtttt 4800 

gtaattgttg gtttgtttgt tttttgttag gaattaaata aatggtgtga tttttatgat 4860 

gagtgggtgg agtggattgt ggtgagagta gagtttttgg tatttattat atttagaatt 4920 

ttttttaaaa gttgaggaaa gagtttttag ggattgtttg ggggtttgag ttaggttgag 4980 

ggttatatag gtagttgggg agttgttgtt ggttgtttaa tttgtttgtt ttgtattttt 5040 

ttaatttttt tttgtttgag tttagttttg ggttttgtag tgaaggaagt gttggggatg 5100 

ttttattttt gtagggaatt gtgtttgttg tttttgttgg gttttgtatt gttttggttt 5160 

tttttttatt tgttttttag atggttggtt ttggtggaga gaaatgggag tagtaatgtt 5220 

tttttttttt ttttttttta tttttttttg ttgttgttgt tgttttttta tgttggttga 5280 

atgtgatttg attttatttt aaaaaaagtt tgatatagtg ttttaatatt tttgtatttt 5340 

tttgtgatta taaaggttgt agttgttttt ttttgttttt tgtttttgtt ttttgttttt 5400 

atatttaatg aaatttttta tgtgtttttt ttgaagtttg ttaagtatta taagggttgt 5460 

tattagtata gtgataagag gttatattgg ggattgtgat ttttgtgtgt gtgtatatgt 5520 

agatgtgttt atatgtatat ggatgggtgt gttgtgggga ttagggttgt ttattgtggg 5580 

gtgtgtgttt aattttaata ttttaaatta taaaagtaaa tttggaaatt atagtaaatg 5640 

tgtgtttttg tggggtttta agtttgggta ttggttggtt gtgtgtattt ttttgtgtat 5700 

aatatgtttg taggaagttt gtgttaatat ttattttgtg gaggggaagg tggagtttat 5760 

atttatattt tagtgagttg gagagggagt ttttgagggg gaaatgtaag tatatttttt 5820 

agtatggagt ttttagaggt agtgtatttt aaattttaaa tgtgaattat tatgtgagtg 5880 

tttgagtaga gttttagttt tgtttggaga aattgtatga ggtgtgtttt gtgtgggtgt 5940 

gtgtgtgtat ggatatgtgg aagtaggatt ttggtgttgt gggtgttttt aggttgggat 6000 

tttttttttt tattagataa ggagaatttt ttagtttttt aaaattggtt tttataatta 6060 

attttgtttt agtttgtaag gggttaatta aagttgattt taaggaaata taggtttttt 6120 

ttattttttg ttatttttta tttattttat ttatagtttt gttttggatt ttggttaatt 6180 

ttttagattt tttttgggat tttttatttt gtagtttatt tatttgagtg tatagttttt 6240 

tatttgaggg gtttagttgt gttgggtttt ttgagggggt tgtgagtgtt agttggtttt 6300 

ttgtatgtgt ttgtggtttt gtggagtagg taattagatt ttggggaagg agttattagt 6360 

a IH tttttg Qtgagggggt gggtatagtg gtggagggtg gagggatggt ggagggtgtt 6420 

gtttgtgttg tttgttgggg gtggatgggg gttgtt 6456 

<210> 419 

<211> 6456 , 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 419 , : -i •• ' ' . ; 1 
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agtggttttt gtttgttttt agtgagtggt gtgggtggtg ttttttgttg tttttttgtt 60 

ttttgttgtt gtgtttattt ttttgttgga gagagtgttg gtaatttttt ttttagagtt 120 

tgattatttg ttttgtgagg ttgtggatat gtgtggagag ttgattgata tttgtagttt 180 

ttttgggagg tttgatgtga ttgggttttt taggtgagga gttgtgtgtt tgggtgggtg 240 

ggttgtgggg tggggggttt tgggggagat ttgggggatt ggttggggtt tgaggtagaa 300 

ttgtgggtgg ggtaagtgga gggtggtggg gggtgggggg gatttgtgtt tttttggaat 360 

tggttttggt taatttttta tgaattgaga tgggattggt tatgggggtt agttttgggg 420 

agttagagga ttttttttat ttgatgagag ggggagattt tagtttaggg atatttgtga 480 

tattgagatt ttgtttttat gtatttgtat gtatatatat ttatatggaa tatgttttat 540 

atagtttttt taaatggaat tggaatttta tttggatatt tatataataa tttatattta 600 

gggtttaaaa tgtattgttt ttagggattt tatgttggag aatgtgtttg tatttttttt 660 

ttgaaaattt tttttttagt ttgttgaggt gtgggtgtgg gttttatttt tttttttatg 720 

agataaatgt taatgtaggt tttttgtggg tgtgttatat atagaaaagt gtatgtagtt 780 

ggttagtgtt taggtttgga gttttgtgaa gatgtatgtt tgttgtagtt tttaagttta 840 

tttttataat ttaaaatatt ggagttaaat atatgttttg tagtgagtag ttttggtttt 900 

tgtagtgtgt ttgtttgtgt gtatatgggt gtatttgtgt gtgtgtatat gtgggagttg 960 

tggtttttgg tgtggttttt tgttgttgtg ttggtggtga tttttgtggt gtttggtagg 1020 

ttttgggaga ggtgtatgaa ggattttatt gagtgtgaag atagagagta gagatagaaa 1080 

gtagaaggag gtggttgtag tttttgtagt tgtggaggaa tgtggaaatg ttgaagtatt 1140 

atgttaaatt ttttttgaaa tggggttaag ttatatttag ttagtgtggg aaagtaataa 1200 

taataataaa aaaaagtgga ggagaggggg ggaaaaggta ttgttgtttt tgtttttttt 1260 

tgttgaagtt gattgtttgg ggggtgagtg gggggagagt tagggtagtg tggggtttgg 1320 

tgggggtagt aggtgtagtt ttttgtgggg gtgggatgtt tttgatgttt tttttgttgt 1380 

ggagtttggg gttagatttg agtgggaggg ggttgggggg atgtaggata ggtgggttgg 1440 

gtggttggtg gtggtttttt ggttatttgt gtggtttttg atttggttta gatttttggg 1500 

tagtttttgg gggttttttt tttggttttt aaggaaagtt ttgaatgtag tgagtgttag 1560 

aagttttgtt tttgttgtgg tttgttttat ttgtttattg tgggggttgt gttgtttgtt 1620 

taatttttgg taaaaggtga gtgggttagt ggttgtgagg taggtaattt tgaggtgttt 1680 

tgggaatttt ggatatgtgg ttgttgtggt gaaagttttg ttgaggttgt ggtggttttg 1740 

gaggagggag gttttttagg ttttagatag ttgttgaatg gggagggaaa gttagggtat 1800 

tttattttgg tgttttgttg tgtagagggt tttgggggtg tttggttgta gattggattg 1860 

ggtttttagt tggaggggga atggggtagt tatttttgtt atttatagtg gggaggtttt 1920 

gggaatggga aaaggtttgt gagtttgggt tttgaggata gagatttttt tttttatttt 1980 

tggaggtgta ggatttaatt ttttattttt tattttttga aagggaggtt gagttttgtt 2040 

taggttgaaa ttagtttaaa atttaaggat gaggtgtgat tatgttattt tttttttgaa 2100 

gttatttttt aaggagatga taagtgaaat tgtggtgaat attttttttt tttatttttg 2160 

ggattttgtg ggggatggtg tttttttttt tttaggattg tttgtttgtt tatttttgat 2220 

aagttttaat tttttttgat tagatggttg gtttttttaa ggtttttatt agggaggttt 2280 

ttttagttgg agagagaatt ttgtttttaa ttgagatttt tttattttaa aaagattatt 2340 

tgtgtatgtg agttgttttt aaatgttttt ttgtagaaaa attgaaagta gaaagaaagg 2400 

aagtaatttt tgagtagatg tttgattatt tttttttgta tatttttagt attgggttat 2460 

agggttttgg gagtttttat ttatattttg tggttttata tggatggatt tatagttttt 2520 

ttgttttagt ttgtgtgtga tttttgaatg ttttttggta tttttggatt ttgttttttt 2580 

gtatgtttgg gttttaggag gtggggaggg tagtggtgta atataatgat atagggataa 2640 

atattaagtt gtgatattga tgtgtttggt gttttttttt ttttgtgttt ttggtatatt 2700 

ttgggttgtt tggtatgttt ttttttgttt tattgtgttt tgtgttgttt gtttattttt 2760 

gaggggtttt tgtaattttt ttgtgtgaag atggttttag ttttgtaggg aaagaaaagt 2820 

aattttgttt tttttggagg aattaggaag gattaagtgg tttgggagag ggtaggagtg 2880 

tgtggagggt agtgatggag gttttgtaaa ggaggaggag gggagtttgg aggaattttg 2940 

aggaaggttt ttgggttgtt tgattgtatt ttttttttat ttttttgttt ttttttttag 3000 

gtgtaatgat tttggattga gatgtgtttg ggtagaggtt atgtaattgt gtttgtgttg 3060 

aggattttgt tttgaggagg gaagaggagg gattggtttg tttttttggt ggtggtggtg 3120 

gtggtgattt tgtaggtgga gttttgtggt ggtggtatta gggttatgtt agttttgtgg 3180 

ggaggttttt ttatttagtt tttgtagtgg tgaaagtttt agtgtttgag tgtttgagtt 3240 

ggtggggagt aagtaaagtt agattgattt ttggggagtt ttggagtagg tgagtggtgg 3300 

ttgttagtta gttgagtgta ttttttgttt gttttagtgg tgttgtggtg ggtggtgttt 3360 

aggtggtatg gagaaggatg gtttgtgttg tgttgattag tagtatgaat gtgtggtgga 3420 

gattggggag ggtgtttatg ggaaggtgtt taaggtttgt gatttgaaga atggaggttg 3480 

ttttgtggtg ttgaagtgtg tgtgggtgta gattggtgag gagggtatgt tgttttttat 3540 

tatttgtgag gtggtggtgt tgaggtattt ggagattttt gagtatttta atgtggttag 3600 

gtgagttagg gagttgtgtt tttgttattt ggggttttgt gtgtgtgggg agggttgagt 3660 

tgggaatttt taggtttaga aaggtgtggg gtagagggtg attggggagg ttttgtgtga 3720 

tttgttggtt ttatagagtg agagttaagt tttgttgata gagttgtgag ttagagttgg 3780 

tgattttgtt agatagtgag ttgaaagata agttaaaaaa atatgatttg agtaggtttt 3840 

tttgtagttg tgtaatttag aagtgttttt tttttgttta ttttgaagga gggtttgggg 3900 

gaagtgggtg tgttttaggg ataggataag ttaatttgag atatgttttg ttagtttttg 3960 

tatatttttt gtggttatgt tgttggggaa tttttgaaag atagagtagg tgagattgga 4020 

gaatgaggat gttgtgtgat aattagtggg aattttatgg gtggagttaa gtgtaaatta 4080 

sefte 401 



seq_prot.txt 

ggttaataag atgtagaaat attataatat ttttgttagt tatattttta gttttttttt 4140 

attaaagttg tgtagtaaga atgatttttg gtggtttaag gagtttgagt gttgatgttt 4200 

aattttaaat agtgttataa aaattttttt taaaattgtt atttttgtgt ttgttttatt 4260 

ttatattttt gggtggatat attttttaga ttaatttttt aaagggttaa atgtgagttt 4320 

aagatgtggt atggttagat tataaatgtt ttgtttatgt tttttttaga tggattgtgt 4380 

gagtgggtgg ggagatagta gtttttattt ttttttgtgt ggtaattagt attaaatgaa 4440 

. aatgtgtgat ttaggataat ggtgtttttt tagtgatttt ggaatttttt tattgtgtgg 4500 

ttgatattgt ttggttgtgt tatttttttt ttttttaaat agagatgtat agttttatat 4560 

tgaaaaggaa tgtttagaaa tttttgtaaa ttgggttgtg attttatttt ttttgatttt 4620 

gagaaaatgt gtagtagaaa tatttttttt tttatttatt tagggttttt gtttaataaa 4680 

tatatgttta ttgtgtgtgt tgaggatgaa atgatgtaaa taagtatagt taaatatttt 4740 

tgtaaaaagt taaataattt aatattagtt attaaagtgg agagatttaa gattgatttt 4800 

tgaaatatta aattttttat agatttagag agattaaata tttatagaat ttttttggga 4860 

ggttttaatt tgtttttttt tattttaggt tggtgatttg tgtggtgttt tgtttttaat 4920 

gtagtttaga tttaagttag aaggtattaa agaattataa atatataaat aaggtagtaa 4980 

atggttataa agttaagatt gtattatatt ttggtttagt tagtgaagga gaatgagtgt 5040 

tttagaagat gagattatat ataaaaattt atagatgatt tttttttatg tgtattttat 5100 

ttgttaaaaa taataaaatt tgtttttggg gttttttgtg atattggagg gaaaagggga 5160 

gggtatattt agttttttgt aatttagttg ttgttttagg ggaattattt aatatatttt 5220 

gaaggatgaa ttttattatt ataattttaa gatggaagtt tttgttttat tttgtttgtt 5280 

atttatggag tatttagtta agggtgttta ttatgtatat gaatgaatag tgttttttta 5340 

aatgtagagg tattatagag gtatttttgt ataatattat tgtttataaa gtttatgtta 5400 

tttttatatt agaggtatat tttttgttat tgtgtttgta gattttgttt gatttatttt 5460 

atttttttga tttatttttt tgagattgat gtgtttttag atgtttttaa ttgatttatg 5520 

atttggattt tgtattgtga attttaggag gatattttta tgaattgaag ataaaggtat 5580 

tgttttttaa aattattttt atgtgttaag gtgatgatgg tatttatttg ttaaagttgt 5640 

gagaataaaa tgagatggtg tatgtgaaag tattttgtaa atttgaaagt attttagata 5700 

tttaaaggat tgtttgtgtt ttagtagatt gatggttttt tgttatggag tagttttata 5760 

gggaggggtt aatttatttg attttttttg tagaataaat tatatttaaa tttagatatt 5820 

ataggttttg gaattaagaa gatttttgag gataattagt ttgtaagatg ttagaaaaga 5880 

tttttagtta tgagttatga aatttagtga tatttgagga aattattgat gatgttattt 5940 

ttatgagtta tagattttta tttttttttt gttttttttt tgtttttttt tttttgtttt 6000 

tgttttgaaa taattattgg tttttatgat ttgtttggtt agattaagaa ttttggattt 6060 

tagaggttgt gaatatttgt ttagataggg ttaggaaata aggatagttg atagtaattt 6120 

tttttttttt aagggagagt tgttaaagtt gggtttttaa atttgtgatt tattattgtt 6180 

tttgggatta gttttatagt tttaagtttg gttttttaat ggttttttaa aagggtagat 6240 

gtttttataa gtatttttaa gttataagta attgattata attataggtt tttttttata 6300 

gatagttaag tttttaagtt aataggagta taataaatat attagttttt ttaaaagttt 6360 

aagatttaat tttgatggat ttgttagaaa agttgatttg ttattgttgg tattggtatt 6420 

gtttgtatgt atagggataa tgatttgaaa tttttg 6456 

<210> 420 
<211> 4499 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 420 

tttagaggtt attatttgaa ttttggttaa ttgtgaggaa aggttggaag aggagtttgg 60 

agatggattg tggtaatttt ggaagattta tgttttggaa tttgagagaa agagaggagg 120 

ttatattatt taggagagag gggttgttga ggtggggtga gttggaattt gtttttataa 180 

gttatttttg ttttttggtt tttagggatt tgggggtttt tggttttttt ttttattttg 240 

gtttaggatt aagttttttt ttaatttttt tttttgtatg tggattttta gatttagttt 300 

ggtgttttag ttagagttta ggttatagtg gggagtgtga gtgaggagta tgttttttga 360 

tgggtttttt agtgagggtt gaaaggtaga tgttgaaaag gtgatttgtt tagatttatg 420 

ggattagttg gggtagtttt gggattttta gagtttttaa agatgatggt ttaaaaatta 480 

ggagtagtgt gagtggttgt tgagttttat tttatttgta agtttttgtt aagtattggg 540 

taggaaatat gatagttaag aggattggag tgttgatttt tttttttggg agtgatattt 600 

gggatagttt ttaaggttga agggtttagg ttttttttgt tgtttttatt tggggatgtt 660 

"""S 9 "" ""gttgtt gtgtatatgt attagttggg tagtaggaga ttagttaatg 720 

agattgtaaa tgttgtttgt ttttatttat ttatttttga gattgagttt tgttttgttg 780 

JSiSJHS SS^ISS^S t 9 t 9 att "g atttattgta agttttgttt ttagggtata 840 

32K52 SSK5S8£ «tttgagta gttgggatta taggtgtttg ttattatgtt 900 

^JSS^iKJXEE SgSHgg "f? 9g ?H3B *tgtgttagt flaggatggtt 960 

-ttgatttttt -^attttgtta-tttgttttt t ttggtttttt aaagtgttga gat^ataggt 1020" 
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gtgagttatt gtgtttggtt tgttgtttgt ttttagttat tgattgttaa aagattaagt 1080 

tgttggtaag ttgatattaa taattttttt ggaaaaattt tttaattaag ttaataatag 1140 

aaaaatattg gttgtaattg tagatattag ttattttgaa gatgtgtgag agggaggtat 1200 

agtgggtttt gggtaggttg taggggtgtg tgtgtgtgaa tgtgttgggt ttgggagatt 1260 

gagagtgtgt gtgtaagatt ttgggaggga gttttttttt tggggtgagg agtgtggatt 1320 

gtagtttttt taggttagtt tttgtttaat attttttata atttttaatt tttaagtttt 1380 

gtatatttgt tttgttttag gttggatagt tatttttagg gatttttttt tttgttttat 1440 

ttagaaggag gtagttgtga gttgggtgtg gtggtgggtg tttgtaattt tagttattag 1500 

ggaggttgag gtaggagaat tgtttgaatt tgggaggtgg aggttgtagt gatttaaaat 1560 

tgtgttaatg tattttattt tgggtgatag attgagattt tgttttaaaa ataaataaat 1620 

aaataaatga agttgttgtg agtgtttttt tttttttttt tttttgatta gttgagttgt 1680 

attgattaag gattatttgg tttgggtggt gggataggga gggtttgggt gagttttttt 1740 

tggagttttg gagtttgtag gagatttgtg ggggtgggtt gtgtttttaa gagagtgagt 1800 

gtttttaaga gagtgagttt tttgttatgg gtggtgttta taaaatgtgt ttttgtgggt 1860 

gatagagggg tgtggtgggg tttaggaggt ttgtgggtga ttttgagtat tgggttggtt 1920 

ttgagtgagg ggttagtttt ggggttggga ggaaggtgag aggtagtgtt attgtttttg 1980 

attttttggg tgattttatg aaggaaggta gagggttgag ttttagtgtg gattagttaa 2040 

atttgatggg tttttgttgg ttatttgttt ttaattttgg ttgggttggg aggggttttt 2100 

ttgtaagttt ttttgtttgg tggaggtttt ttttttttaa tttttgtgat gggtgtatgt 2160 

tttaagttgt ttttgtttgt tgggtttgtt tgttgttttt agggtttggt ggtttgtggt 2220 

ggtttgtgtt tattgtgttg gttttattga ttttgattaa tgtaggtttg agatgtggtg 2280 

gtttttgttg agtgggtgtg gtgtttgggg ttttttggga gtaggtgtga aggatgtgtt 2340 

tggagtttgt tgtttggatt tttggttttt gggtgtgtag taattttttg tggtaggatt 2400 

gtaattgtgg aggtttgtag ttgtgattaa atttgtgttt tttttgttgg ttttgtggtt 2460 

tgtagtttat ttgggttaat ggtgttaatt ttggattttt tttaatgttg ggttttgtgg 2520 

tatttgggaa agtttttatg taggaaggtt tagggttttg tgggtgtata tatgggaagg 2580 

tataaaagtt atagatattg tttgtttaaa ttgaagtttg gtggaaagtt atgttttttt 2640 

attttttaaa aaatttaatt ttgttgaagt atagtttata tgtaataaaa tgtatttatt 2700 

ttaagtggat atttagatat atattttgat atagttatat tttattatta tgattaagat 2760 

atagaatatt tttatgtttt taaaaattat tttgtgttgt tattttagat tattttgtgt 2820 

ttttttattt attttatttt atatatttga gttttaggta attgttggat tgtttttttt 2880 

tattatagat tatgtttttg tattttttgg agttttatat taatggaatt atatattatg 2940 

agttttgtgt ttgattttta tttagtttaa tgttgttgag attttttttt attgttggaa 3000 

agattagtag tttatttttt tgtgtgtgag atggagtttt gttttgttgt ttaggttgga 3060 

gtgtagtggt gtgattttag tttattgtaa tttttgtttt ttgggtttaa gtgatttttg 3120 

tgttttagtt ttttgagtag ttgagattat aggtatttgt tattatattt ggttaatttt 3180 

tgtatttttt taatagagat ggggttttat tatgttggtt aggttggttt tgaatttttg 3240 

attttaggtg atttatttat tttggttttt ttaagtgttg ggattatagg tgtgagttat 3300 

tgtgtttagt ttttttattg ttgatattgt attgtttggt tgtattgtaa tttgtttaat 3360 

tttttttttg tttatagata tttggatttt tttttagttt gggttatttt gaattaagtt 3420 

gttaggaata tttgtttaat tttgtgtgat tttatgattt tatttttttt gggtgtatat 3480 

atagaaatgg aattggtggg ttatattaga aaatatattt ttaatgttgt aagaaattgt 3540 

aaaattattt tttaaagtgg ttgaagtatt ttatattttt attaatagta tatgaaagtt 3600 

ttagttattt ttatatagtt attaattttt agtattgtta gtttttttaa ttttagttat 3660 

gttagtgggt gtttagtggt attttattgt atttttaaat tttatttttt aaataattat 3720 

gttgagtatt ttatgatgtg tttgttttaa aaatgtgtat attttattta gtgtaatgtt 3780 

tgtttaaata ttttatttat ttttattagg ttatttgttt tatattattg agttgtaaaa 3840 

gttttttata tatttaggat ataatttttt tatttatgtt ttgtatatat ttttttatag 3900 

tttgtggttt gttttattat tttttttttt tttttttttt tttttttttt tttttttttt 3960 

tgagatggag ttttattttt gttgtttagg ttggagtgta atggtgtgat tttagtttat 4020 

tataattttt gttttttagg tttaagtgat ttttttgttt tagtttttta agtagttggg 4080 

attatggtta tatattatta tgtttggttg attttgtatt tttagtagag atggggtttt 4140 

tttatgttgg ttaggttggt tttgaatatt tgattttagg tgatttgttt gttttggttt 4200 

attaaagtgt tgggattata ggtatgaagt attatattta ggtttttttg ttttgttttg 4260 

ttttgttttt gtttttgttt ttttaagata gagttttgtt ttgttattta ggttggaggg 4320 

tagtggtatg attttaggtt attgtaattt ttattttttg ggtttaagtg atttttttgt 4380 

tttggttttt ttagtagttg agattatagg tgtgtgtt&t tatgtttggt taatttttgt 4440 

gttttttagt agaaatgggg ttttgttatg ttggttaggt tggttttgaa tttttgatt 4499 

<210> 421 

<211> 4499 ^ 
<212> DNA 

<213> Artificial Sequence 

<220> /' - * : ; • ! . : , ' ? : ; ' /. : ; V"' 

<223> chemically, treated genomic DNA CHomb sapiens) ! 

<400> 421 "< 1 ' ,h 1 1 ' ' : ' " '* ' : ^' 
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ggttagaagt 
taaaaattag 
taggagaatt 
tttttagttt 
aaaataaaat 
ggttaaggtg 
aaattttatt 
ttagttattt 
tgagttgaga 
aaaaaaaaaa 
tgtgagaaaa 
tttatagttt 
atattgtatt 
tagttatttg 
taattaaaat 
gtttttatat 
tagtttttta 
tatgtattta 
gtttaattta 
ttaaataaat 
tggtttatat 
agaagtttga 
aaattagttg 
gagaattgtt 
ttagtttggg 
tttaataata 



ttgagattag 
ttaggtatgg 
gtttgaattt 
gggtgataga 
aaaaaagttt 
ggtggattat 
tttattaaaa 
gggaggttga 
ttatattatt 
aaaaaaaaaa 
tatatgtaaa 
agtaatataa 
aaataagata 
agaaatgaaa 
taaaaagatt 
attattggtg 
taatattaaa 



atagtgtatg 
aagaaagtag 
tatagggtag 
ttttgattat 
gtgggtgtat 
agaaggtata 
ttttttatgt 
tatagggttt 
gttgtgaagt 
attttgttgt 
atatgttttt 
gttgtatttt 
ttgtgggtta 
tatgtgtttg 
aagatttttt 

tggttgattt 

aggagtagtg 
attgatttgg 
tttatagagg 
tttgtttttt 
gagggatttg 
tagtttagtt 
tttgtttgtt 
ttttgaatta 
tagtagttgg 
taaagtagga 
gggtttgggg 
gtttgtattt 
gttttttgga 
tatttttttt 
tattattaat 
tttaattttt 
tttgtaattt 
attatttttg 
gtgtggtggt 
gtattttgga 
gatagagtga 
attgattggt 
atgtttttag 
ggtattattt 
jfctagtatttg 



agagaaatga 
aaatagttta 
tgtggtataa 
ttgtaatttt 
gattagtttg 
ggtgtgatgg 
tgaatttagg 
tgatagagtg 
gggaggggtt 
attttattga 
tttggtggtt 
tttgggatga 
ggtagtggag 
tttattgtat 
gttttttatt 
gtatatttgt 
ggtattgaga 
tggtggagga 
gaggggttgt 

tgtatttgtt 
gaatttgtgt 
ttaggttttg 
ttgtgggggt 
ttagtttaat 
gtgttggggt 
gtgttgtttt 
tgtttggaat 
tgtattttgt 
tggagatgta 
tttaggtttt 
aattagaggg 
tttgagatgg 
ttgtaatttt 
ggttataggt 
ggaagagttt 
gttaggggtt 
tttattttgg 
tttaatatgt 
ttatatattt 
ttagttaaaa 
taataattag 
tagtattttg 
ttaatatggt 
gggtgtttgt 
ggtggagttt 
gatttagttt 
tttttgttgt 
gtgggagtga 
ttaggagaaa 
gtgagagttt 



tttggttaat 
tggtgtatgt 
aggaggtgga 
gtaagatttt 
gggtgtggtg 
ttgaggttaa 
atataaaatt 
ggtaggagaa 
gtattttagt 
aaagagagag 
atataaataa 
aataaataat 
tatatatttt 
tttaaaaata 
ggtaatatta 
ggagtgtaaa 
aatatatttt 
aattataaag 
aattgggaaa 
ttagataatg 
agtatttagg 
gttaatatgg 
tggatgtttg 
aggtggaggt 
agattttatt 
ttaataatat 



tatgaaattt 
gtttagaatt 
tagtataagg 
tgtgattgtg 
gtaaattgta 
gagttttaat 
aggattttgg 
ggaatttaaa 
ggtgtgagtt 
tgtgtattta 
tttggaggat 
taattggggt 
ggagtgatag 
tgggaaggag 
tagggttagg 
ttagtttttt 
ttgttttttt 
tgtttatgag 
ggatattgtt 
gtttgttttt 
ttttattttg 
aaggagagga 
agttttggtt 
tgttttttag 
gtttgttatt 
ttgggagtaa 
gtggggagtg 
agaaaaagtt 
ttgtgtgtat 
ttgaaatgat 
gatttttttg 
taattaaaag 
ggaggttgag 
gaaattttgt 
agttttagtt 
gtagtgagtt 
taaaaataaa 
ttagttagtg 
tggagagaat 
aagttaatgt 
a^gagtggaa- 



atgataaaat 
ttataatttt 
ggttgtagtg 
attttgaaaa 
ttttatgttt 
gtgtttaaga 
agttgggtat 
ttgtttgaat 
ttgggtaata 
aaaaagaaaa 
gggagttata 
ttgataaaaa 
taaaatgaat 
tagtaaggta 
aaaattggtg 
atgttttagt 
ttgatatgat 
ttatatagaa 
aagtttaaat 
taatattagt 
gaggttgagg 
tgaaattttg 
taattttagt 
tgtagtgagt 
ttatatataa 
taagttaagt 
tagaaaatat 
taggtgtgtg 
taatttttga 
ttaaaatatg 
ttttaataaa 
ttgaataaat 
gtttttttgt 
gttgatatta 
tagttatagt 
gaggttagag 
tttgaatgtt 
tggtgggatt 
atgagtttgg 
aagtttttgt 
gatgggtgat 
gttttttttt 
tttagtttta 
ttttttaggt 
tatgataggg 
gtaaattttt 
ttgtttgggt 

ggggatgttt 

tgttgtttag 
gtttaagtag 
atgtttggtt 
ttgtttagtt 
ttgggtggaa 
tttttttgaa 
atatttttgt 
tagtatttat 
gagggattat 
taaataatgg 
gagggtggat 
ttttattaaa 



tttattttta 
agttattggg 
atttgagatt 
aataaaaata 
gtaattttag 
ttagtttgat 

ggtggtgtgt 
ttgggaggtg 
agagtgaaat 
tgatgaggta 
ttttaaatat 
tgagtgaaat 
atgttataag 
ttattgggta 
gttatgtgga 
tattttagaa 
ttattaattt 
ttgaataaat 
gtttatggat 
aataaaaaag 

tgggtggatt 

tttttattaa 
tatttgggag 
tgagattgtg 
aaaggtaaat 
gaaagttaga 
aaaaatataa 



atttgggagg 
gagattgtgt 
taaataaaag 
tgtgtgtgtg 
ttgggttttt 
tttagttttt 
tggaatttgg 



§gatggggtg 
ggatatggaa 
tatttaaata 
gttgaatttt 
aatatttgta 
gtggagattt 
ttagtttggg 
tgtaggtttt 
gtttagatga 
gtgtttattt 
gatatgatgg 
tgggtgagag 
tgggtgagaa 
tggtaagggt 
gtggagttgt 
gggttggttt 
tttattgtat 
agtttgtttt 
tgtggatttt 
tggatggttt 
gtagtgattt 
ggtggagtgt 
tttttttgtt 
tgtagttgtt 

tggggtgggg 

gttggtttgg 
attttgtata 
agtttatttg 
aattgtaatt 
tgatgttggt 
gttggatgtg 
aatgaggtta 
aatagaaaaa 
ttgaggtagg 
tattgtattt 
taaataatgt 
gtggtggaga 
tagttttgaa 
ttggttatta 
tagttgttta 



ttaaaaaata 
gaggttaagg 
atgttattgt 
aaaataaaat 
tattttggta 
taatatggag 
gattataatt 
gaggttgtgg 
tttgttttaa 
agttatagat 
atagaaaatt 
atttgaatag 
atgtttaatg 
tttattagta 
gatgattgga 
aatagttttg 
tatttttata 
atttttaata 
aagagaaagg 
"gggtgtgg 
atttgaggtt 
agaaatataa 
gttgaggtat 
ttattgtatt 
tattgatttt 
tatagagttt 
tttatagtag 
agtagaggga 
atgttttgta 
tttatttaaa 
ttaaaaagta 
gtttttgtgt 
ttttgggtgt 
tggattgtgg 
tgtggttgta 
tagattttag 
ggtggaagtt 
atgtgggtta 
tagtttaagg 
gatttgtgag 
ttgttgagtt 
ttggggagtt 
tttatttggg 
ttttttgttg 
tttggagatg 
aaggttttgg 
ttgattgatg 
tgtttgtttg 
attggtgtga 
ttagtttttt 
tttttttgag 
tgagtgtgtg 
gagagttgta 
tatattttta 
ggatttattg 
aatgtttttt 
ttattagtag 
gtggtttatg 
ggagattgag 
aattagttgg 
agaatggtgt 
tagtttgggt 
ttgtagtttt 
gttagaagag 
ggttgtttta 
tgttttttgt 
tgttgttttt ~ 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020— 
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ggtttttgga ttattgtttt tgggaatttt gggagtttta gagttgtttt gattggtttt 4080 

gtgagtttaa gtgagttatt tttttagtat ttgttttttg atttttgttg aaaaatttat 4140 

tagaggatat gttttttatt tgtgtttttt attatggttt aggttttggt tggagtatta 4200 

ggttgggttt gggaatttat gtgtagaaaa agaagttaaa gagaaatttg attttgggtt 4260 

agagtgaaag gaaagattaa aggtttttaa atttttggaa gttaaagggt aggaataatt 4320 

tatgagagta gattttagtt tattttattt tagtagtttt ttttttttag gtggtatggt 4380 

tttttttttt ttttttgaat tttaggatat gggtttttta gaattgttat agtttatttt 4440 

taaatttttt ttttagtttt tttttatagt tgattaagat ttgaatggtg gtttttaaa 4499 

<210> 422 
<211> 4500 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 422 

ttatgtgagg atatagggta atgatagtta tttatagtta aggaagagag gtttttaata 60 

gaattaattt tgtttatatt tggattttaa atttttagtt tttagaattg tgtgaggaga 120 

aatgtttgtt gtttatgtta tttggtttgt ggtagattaa tatggttatt tattagttgg 180 

ttggttttgg gtaggttttt tatgtttttt gtgttttagt tttttttatt gtagaatggg 240 

atagtaatgt gtttattttt tgggttgtgg tgagttggtt aagatgtgaa gagtttggag 300 

taggttgagt aggggtattg taaatttgtt tttgttttta tattttttta tgatttagag 360 

gttggggata ttgggtattt tgtttgtagt ttttatgtta gtttattttt aaatatttta 420 

attagagttt ttataaatat agttaaatgg agtagaggta ggtagggttt gtttttgata 480 

gtttttgtta aggatgtttg attaaggttg gtaaggaagg tggttagggt ttagagttta 540 

ttgttttttg ggaggagaaa ttaaaatata aatgaaaaag ggttttgggg ttgagtaggg 600 

tatggaggtg gttaggatag gagtttagtt gtaggtatag ttttaattta tgtgattttt 660 

agggatttgg aaagggattt ggtttagttt atgtggtagt tattttttta attttttaat 720 

tgtggtttta ggatggttgt tgggattttt agagatggat ttttatggtt tttaggggtg 780 

taaaggaagg atagtgtgga gtgtgttgta agttttaatt gatattgttt gagtatttgt 840 

tgtatattaa gtgtttttat atagtgaagg tgaggttgta ggttatgttt agtttataga 900 

tagagatatt aaggttattt aggtattaag gtgatgtagt tattgaggtt atatgagaag 960 

taagagggta tttttgtggt aggtgggtag tgggagttag tgtgagttgg ggagtggaga 1020 

tttaaagagt tgtgttgggt ggttagggag gttttgattt aatgtttagt attaggtgag 1080 

ttgatttggg ggtgtttatg tgtttattta tgggagatta ggaggtagag gtaggttggg 1140 

atgagagttt tgggggtaga gttggtttgg gtggtttttt tttatagatt ttatagttgg 1200 

agatggagat ggtttagggg tatgtttgtt tgtggttttg gttttttaag agttttgaga 1260 

ttaatgtttt tggtttagtt taagttgtat aggaaagtta ttttgtggat taagtttggg 1320 

tatatagaga gggttggatt ttatatgtat ttgtggtagg tttttggaga tttttggaat 1380 

atttttgtat ttttttttta tttttaagta atattttttt gggaaaatgt ttatttggtt 1440 

ttagttgaat ttaggaaatt gattgttatt tatttatata tttagagttt aattggggtg 1500 

gtagggatat atgtatagga ttgtgtttat gtttgggttg tatgtttgta gttgggggat 1560 

atatatatag agtgatgtgt atttattatt tatatatata tattggttat atatatgttt 1620 

atattaaatt atttgtttat aattatttat atatatatta tttttatatt tatattttat 1680 

agttgttgag gagggtgagg ataggttgtt atttttattt tttttatagg atgagttgta 1740 

aaatggttta gaggttttag tgagggtgta tgtgtgtttg ttgagtgttt tagtgttttg 1800 

ggatagtttt ttttagtatt gttgatatag tttttattgt tttttgaagt gtagataaaa 1860 

tggagggtag ttttgtttag ggtttttttg gttttaggaa gggttaatag gtagtaaggg 1920 

ttgttttttt taggaatgtt ttgtttgagt agtaagtgat tgtttagttt tggttttggg 1980 

ggattttttt gagatttgtg agtagatgaa gttggggatt agtagatata gttttaggtt 2040 

gggaggaaaa atttatgtgt gatttatgat gtttttttag atgtggggtg atgtaggtgg 2100 

tttggagtta gggttgatgt ttaggatttt aatatttttt tgtatttaag gggtggattt 2160 

gggggttatg taggtattaa ggtaaggtat gagtgtgtga gtgtgtagta gatagtatgg 2220 

tagtgtggtt ttattttaag ttgtagggta gagtatgaga gatttaatgt ggtttttgga 2280 

tggggtttta attattttgt atggttaagg tttaagtttt aagagttggg tagttttttg 2340 

tatgtggggt ggttataagt ttgtagttgt tttttttgtt gtatatttat tttgagttta 2400 

"ttggagta ggtttgggtt tttggggtat ttttatattt tatttgttta tttttaaata 2460 

aptttaggt ta gaaaattt tttgagaagg gatagtagga tttgggattt ttttatttat 2520 

ttatttattt atttatttat ttttttttta ataaatattt tggtgagtgt agtggtttat 2580 

gtttgtaatt ttagtattta gggaggttaa ggtgagtgga ttatttgagg ttaggagttt 2640 

9 2S»Ji«2 t1: SBdttaatat ggtgaaattt tgtttttatt agaaatataa agattagtta 2700 

22«i22 t9a t 9 tatat t:tg taattttaat tatttaggag gttgaggtag gagaattatt 2760 

H^ tta9 2 a 99tagaggt tgtaatgagt tgagattata ttattgtatt ttagttttgg 2820 

^!^agt gagattttat tttaaaaaaa aaaaaaaaaa taaaaaagtt ttgtgggttg 2880 

ggtatagtgt taattgttag taatattta§ gtaaggaaga tgtttatttg gtttttgttt 2940 
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attaggaatt tggaggttag aaatatagtt ttrttgtaga taatttgtag taggatagaa 3000 

aataaagttt ggttatggtg gggttattta gtttgaggta tttaggggag gttttttgga 3060 

ggaggtggtt tagttaaagt tttgaaggat gtataagagt gagtgagtta aggaaggggt 3120 

tggtatgagt taagttgtag agtttagggg ataggttagg ttggagaagg gatgttgaga 3180 

. ggttttatga agtttgtttt ttgttgttaa tgaatttttt atggaggttt ggaagtattg 3240 

tatggtttta gtatatattt tgattttatt atttttgttt ttgtatatta gttggggatg 3300 

ttgttttagg aagtgatttt attttttaaa tataatgtgt ttttttgtaa tttaattttt 3360 

ttgggggtgt gtggaggttt ggaagttggt ttgtttgttt ttgtagttgg tggttgaagt 3420 

gtttttgttt ttttggttat ggtttttttg tttttttttt gtgtgtgtgt ttttttgaga 3480 

tagggtttta ttttgttatt taggttggag tgtagtggtg tgattttgtt ttattgtaat 3540 

ttttgttttt tgggtttaag tgattttttt gttttagttt tttgagtaga tgggattata 3600 

ggtgtatatt attatatttg gttaattttt gtattttttt agagatgggg ttttattatg 3660 

ttggttaggt tggttttaaa tttttgattt tttgatttgt ttgttttagt tttttaaaat 3720 

gttgggatta taggtatgag ttattgtgtt tggttatatt tatattttgt ggggattagg 3780 

gttttttgtt ataattaatt tataatagtt gaaggaggtt atgaggtttt agtgggattt 3840 

ttgggggata gggatgggag gaaatttagg attagataga ttataaaatt gtttatagat 3900 

ttttgttttt agttgttttt gttgagaatt gggagttaga aaaaggaggt taaattttag 3960 

ggtaatgttt atgttttttg taatttttgg ttttattaaa ttagtagtat aattttagtt 4020 

ttattttgta gagttagata taatggggta ttagaaggga attgaatttt tttttaatta 4080 

taatggttta tagatattgt ttaatttgta atttgggagt taatgtggtt tttttttttt 4140 

gtagttatgg ggttgtttgt atatatatgt ttttatttat ttggatggag tatggggttt 4200 

attttgtaag gtgagatgtt ggataaggtt gtttttagtt tggtgagtgg agtgttgtgt 4260 

ttagttttgt tttttttttg tttttagtta ggtttattat atagaggatg ttagtgtggt 4320 

tgttatatgt tatatggagt atatatgtat ggagttattt taataggtag gtaggagtat 4380 

aatttttttt ttaattatat atttagtatt tttgagaaaa tgagatttta gtaatttaat 4440 

tggattgaga agtggggtta taataaaatg gaatttgtaa taattgtgta gttaaattag 4500 

<210> 423 
<211> 4500 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 423 

ttaatttaat tgtgtgattg ttataaattt tattttatta tagttttgtt ttttagttta 60 
attgaattat taaaatttta tttttttaga agtgttgaat atgtaattag aggaaaaatt 120 
atgtttttgt ttgtttatta ggatggtttt atgtatgtgt gttttatgtg atatgtgata 180 
gttatattga tgttttttgt gtggtgggtt tggttggagg taggagagag gtagggttgg 240 
gtgtggtatt ttatttattg ggttgggggt ggttttgttt gatgttttat tttgtagggt 300 
gggttttatg ttttgtttgg gtgaatgaga atatgtgtgt gtaggtggtt ttatgattgt 360 
aagaggagaa aattatatta atttttagat tataaattaa ataatgtttg tgggttattg 420 
taattaaagg gaaatttaat ttttttttaa tgttttatta tgtttggttt tgtaaggtaa 480 
agttgggatt gtgttgttag tttaatggga ttagaggtta taaggagtat gaatattatt 540 
ttgagatttg gttttttttt tttggttttt aatttttagt aggggtagtt gggggtagaa 600 
gtttgtgagt ggttttgtaa tttgtttagt tttgaatttt tttttatttt tgttttttaa 660 
aagttttgtt ggggttttat agtttttttt gattgttgtg ggttgattgt ggtaggaggt 720 
tttggttttt atagagtgtg ggtgtggtta ggtatggtgg tttatatttg taattttagt 780 
attttgggag gttgaggtag gtggattaag aggttaggag tttgagatta gtttggttaa 840 
tatggtgaaa ttttgttttt aggaaaatat aaaaattagt tgggtgtgat ggtatgtgtt 900 
tgtagtttta tttatttagg aggttgaggt aggagaattg tttgaatttg ggaggtggag 960 
gttgtagtga aataagattg tgttattgta ttttagtttg ggtgatagag tgagattttg 1020 
ttttaaaaaa atatatatat aaaaaaaaaa tggaagagtt ataattggaa gagtaagggt 1080 
attttaatta ttagttgtaa gagtaggtag gttagttttt gggtttttat atatttttga 1140 
gggggttaag ttgtaggaag atgtattgta tttaaaaggt gaggttattt tttggagtag 1200 
tatttttagt tggtatgtgg agatggaagt gatgggattg gggtgtgtgt tggagttatg 1260 
tggtgttttt aggtttttat gaagggttta ttggtagtag aggatgggtt ttatggggtt 1320 
ttttagtatt ttttttttaa tttgatttgt tttttgggtt ttgtgatttg atttgtgttg 1380 
gttttttttt tggtttattt atttttgtgt attttttaag gttttagttg ggttattttt 1440 
tttagggagt tttttttgag tgttttaggt tggatgattt tattataatt aggttttgtt 1500 
ttttgttttg ttatagatta tttgtagagg ggttatgttt ttaattttta aatttttggt 1560 
gggtaaaaat tagatagata ttttttttgt ttgggtgttg ttagtagtta gtattgtgtt 1620 
tagtttatga gatttttttg tttttttttt tttttttgag atggagtttt gtttttgttg 1680 
ttgagattgg agtataatgg tgtgatttta gtttattgta atttttgttt tttgggttta 1740 
agtgattttt ttgttttagt tttttaagta gttgggatta taggtgtgtg ttattatgtt 1800 
tggttaaiitt: _tl:g:tatt^^^ 186a 
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gaatttttga ttttaaatga tttatttatt ttggtttttt taagtgttgg aattataggt 1920 

atgagttatt atgtttgtta agatgtttat taagaaaaaa atgaatgaat gaatgaatga 1980 

atgaatgggg gaattttaag ttttgttgtt tttttttaga gagttttttg gtttaagagt 2040 

tgtttgagag tgggtaggtg ggatatgggg gtgttttggg aatttaggtt tgttttaggg 2100 

tgagtttggg atggatatat agtaggaggg atagttgtag gtttgtgatt gttttatata 2160 

tgggggattg tttagttttt agaatttagg ttttggttgt ataggataat tgagatttta 2220 

tttagagatt atgttgggtt ttttgtgttt tgttttgtag tttgaggtgg gattatattg 2280 

ttatgttgtt tgttgtatgt ttatatgttt gtgttttgtt ttagtgtttg tatggttttt 2340 

aggtttattt tttgggtgta gagaaatgtt ggggttttag gtgttagttt tggttttagg 2400 

ttatttgtat tgttttatgt ttgggagggt attatgaatt atatatggat tttttttttt 2460 

ggtttggagt tatgtttgtt aatttttagt tttatttatt tatagatttt aggagaattt 2520 

tttggggttg gagttaggta gttatttatt atttgagtgg agtgtttttg gggaaggtag 2580 

tttttgttgt ttgttggttt tttttggggt tagggaggtt ttgagtagag ttgttttttg 2640 

ttttgtttgt attttagagg gtagtggggg ttgtattaat ggtgttgggg agaattgttt 2700 

tgggatgttg gaatatttgg tggatatgta tgtatttttg ttgaagtttt tgagttgttt 2760 

tgtagtttat tttatgaggg aggtgggagt agtgatttgt ttttattttt tttagtagtt 2820 

{Jtagggtgtg agtgtggagg tagtgtgtgt gtgagtggtt gtgagtaaat gatttagtgt 2880 

ggatatgtgt gtgattaatg tgtgtgtgta agtgatgggt gtatgttatt ttgtgtgtgt 2940 

gttttttagt tgtgggtatg tagtttaggt gtgggtatgg ttttgtgtat gtgtttttat 3000 

tattttagtt gaattttgag tgtgtgagtg ggtggtaatt aattttttgg atttagttga 3060 

gattaggtga gtattttttt aaaggaatat tgtttgagaa tgaagggaga atgtaggggt 3120 

gttttaggag tttttaggag tttgttatag atgtgtgtgg agtttagttt tttttgtgtg 3180 

tttagatttg gtttatagga tggttttttt gtgtggtttg ggttaagttg ggggtattgg 3240 

ttttagagtt tttggagagt tagggttgta ggtaggtatg tttttgagtt atttttattt 3300 

ttggttatag aatttgtgag gaggaattgt ttgggttagt tttattttta ggatttttat 3360 

tttaatttgt ttttgttttt tggtttttta taagtggata tatgggtatt tttaggttag 3420 

tttatttggt gttgggtatt gagttagagt ttttttggtt atttagtata gttttttggg 3480 

tttttgtttt ttaatttata ttaattttta ttatttattt gttatagaga tgtttttttg 3540 

ttttttgtgt ggttttggta gttgtattat tttggtgttt gggtggtttt ggtgtttttg 3600 

tttgtgagtt ggatatggtt tgtagttttg tttttattgt atgaaagtgt ttagtatata 3660 

gtaggtgttt aagtagtgtt ggttggggtt tatagtatat tttatattgt tttttttttg 3720 

tatttttgga agttgtgaga gtttattttt gggagtttta gtagttattt tggggttgta 3780 

gttaggaagt tagagaagtg attgttatgt gggttgagtt aggttttttt ttaggttttt 3840 

aaaaattata taaattgggg ttgtatttgt agttgggttt ttgttttggt tatttttatg 3900 

ttttgtttag ttttggagtt tttttttatt tatgttttaa tttttttttt tagaaaatag 3960 

tgagttttga gttttagtta ttttttttgt tagttttggt tgggtgtttt tagtaagggt 4020 

tgttgggagt agattttgtt tgtttttgtt ttgtttggtt gtgtttgtag gggttttggt 4080 

tggggtgttt aggagtaagt tggtgtgaga gttgtaagta aggtgtttag tgtttttggt 4140 

ttttgagttg tggggaagtg taagaatagg agtaggtttg tagtgttttt gtttagtttg 4200 

ttttaggttt tttatatttt aattggttta ttatagttta agaggtgggt atattgttat 4260 

tttattttat agtagaggaa attgaggtat agagggtata ggaagtttgt ttagggttag 4320 

ttgattagta agtagttgta ttggtttgtt atagattagg tggtataaat aataggtgtt 4380 

tttttttgtg tagttttgaa ggttgggagt ttgaggttta ggtgtgggta gggttggttt 4440 

tgttggaggt tttttttttt tggttgtgga tggttgttgt tgttttgtgt ttttatatgg 4500 

<210> 424 
<211> 6499 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 424 

tattgtatat ttttatttat aattggtagt taattattga atatatatag atataaatgt 60 

attttaatat attattattt ttatttagat tttgtgtaat atttaattat ttaattatat 120 

taataaataa aagtttttag aaattgatgt atttagtata attaaggggt aatttgttta 180 

tgattatagg agttatttta tttatattaa tttgatgtga ggagataatt aaattttttt 240 

ttttttataa gaaaaattta attttgggag gtttaggtag gagaattttt tgaatttagg 300 

aggtagaagt tgtagtgagt tgagattgta ttattgtatt ttagtttggg taataagagt 360 

gaaattttgt tttaaaaaaa taaataaata aaataaaata aaaagtaaaa tttaattttt 420 

atattttttt tttaaaaata ttatattatt gaaatagaat agaattttgg attttaatgt 480 

ttttatgifctt tttgataatg taaagattat attaagaata atattgattt tttgtgtata 540 

tgtgtgaaat tatgttgagt tgatgtattt agaaaagaaa ttgtagtgat attgggtata 600 

gaatttttaa ttttaatttt ttttttgttg tggaatagaa atttagtttt attttttttg 660 

tttataatat ataagagtag ataattagat atgaggtgat taaatatata aattgatttt 720 

atagttatta tggaaagatt ataatatgtt tggatgaggt gggggaaaga gttttttttt 780 
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gaattgagat ttgaataggt tttagataga ttaaaaattt atggtttgtt ttttttaatt 840 

ttttattatt gattatttgg gagaattaaa atggtttgga ttataaagtg attataatta 900 

gtagggtagg tatattaaat aaaaatgggg tgagtgtgtt tggatgggaa aaattggggt 960 
gtagagaaat attttttgtt ttggaggttt tgtttatagt gagtttgagg atgtagatga - 1020 

tatggaatat gagttttatt aggatatata tttttttgta tagttttttt tgattagtgt 1080 

tgtttattgt tttaggtatt atttttagtt attttgtgtt aggtatttag gatagttttg 1140 

tttaaagtat gattgttttt tggttagtga agaaatagta aaqaaatqaa aatttatttt 1200 

tgaagttttt gtagtaattt aatatttttg taatgttgtg gtatatgttt tatatttttt 1260 

aaaagtattt atttgtagat tagaaataaa agagtttttt agtatagata gtttgagaga 1320 

tgttattgta gagtgttttg ggaggatggg atttttttag atattgatgg ttttttggag 1380 

tggtgttgga gtttgttgag atagggatgt ggggattgtt gtggaatagt atgttgtgaa 1440 

aataggtata gatgggttgt tgttatgtat ttagtatttg gtaggtttag gtattttgtt 1500 

ttggggagat agagtaatag ttgttatttt taaaaatgtg atataaatat gatttttgtt 1560 

ttttattgta tgtgtatgta tgtgtgtatg tgtatgtgtg gttggtgtgt gggtaaataa 1620 

tattggtgag ggagttagta gtgtagagat gtagagatga atttaatgaa tttatatttg 1680 

gaggtttttg aaaggagttt tgtgtatgta taattaggaa gtagaagtgt agttgtaggt 1740 

tagttttatt ttttggtgga aatagtaaat tatagtttta gtatttttta tttgtagtaa 1800 

aagaaatgta tgttgtattt gggaggggta ggagaatttt ttagaatttg gtgggtttat 1860 

tagttagggt tgtgtagaga aatagaatta atgagataaa tatgttagat agatgataga 1920 

tagatagata gatagataga tagatagata gatagataga gatgatagat gatatagata 1980 

gatgatagat gatagataat agatgatata tagatagata tagtaaatag atttatagat 2040 

atgatagaga gagaatgaga gagatttata ttaaagaatt ggtttatatg attgtggggg 2100 

ttgtaaattt tatatttgta gaatttgttg gtaggatgga aatttaggta agagtggata 2160 

ttgtagtttt gagttgaaag tttatagggt agtaagttgg aaatttaggt aaggttttta 2220 

ggttgtagtt ttaagaagaa ttttatattt tttaggaaat tttagtttgt gtttttaggg 2280 

tttttaattg attgaatgaa gtataaatat tatggaagga aatttggttt atttaaagtt 2340 

tgttggatta agtattaatt ttaattaaaa aatgttttta tagtaatatt tagattattg 2400 

tttagttaag tatttgggta ttatagttta gataagttga tgtataaaat ttattattat 2460 

aagggaggaa tttatatata tattttattg tagtaaaatt tttatagttt aatgttttgt 2520 

atttttttta gaaggaaatg ttttagtgta gagttgaata ttgtattttt ttgtagttta 2580 

tataaatttt atataatttt gatttgttga gttttttttg aaaattggat aatttaagtg 2640 

tttatgaaag gttttaattg tttaagaaaa tagattgttt gtgtgaatta taaagaaaaa 2700 

gggattttag aaggaatatt ggtattttgg gaagtaggtt ggggtaaggt ttgtataagt 2760 

gaattagaag ttttaggtat gaagttagta tttttgttat ggtttatgtt agttgagttt 2820 

ttatttttgt tttgtgtttg gtttttagat tttgtgagtt tgttttggga tagggttatg 2880 

gttaatttag tagagatttt ggtaaagtat tttgggaatg agagtgagaa aggtttaaag 2940 

tagttaaggt ttaaagataa tttttgattt tattttttta aatgttagtt tttaagattt 3000 

taggtttttt ggattttaat ttttatgtta ttttaaaggt tttaaatttt agttttagta 3060 

gttttagttt atagttttag gtttttagag attattttag taatttttaa atattatttt 3120 

agggatttag ttttgatagt taagattatt ttttttataa aataattgta gatattaggt 3180 

gaagattttt aaagttttaa ggaaatttta aattttattt tgaggatatt gattttatta 3240 

aagttttgag gaaattttaa attttatttt gaggatatta gttttattgt taggaattgg 3300 

gattttaatt tatagtattt ggattttgag aatagaggtt ttggggttaa atgggttgaa 3360 

tttagtattt tatttttatg tttttgggtg gatagtaatt tttttttatt gtgttttttt 3420 

gtgggtttta ggttttatat ttgagggatg tggttttttt tttttatatt atgttggtta 3480 

agaatgattt atatagttat tatggaatat tgtatggaga taaggagtgg ttgtgttttt 3540 

gtttgaaggg tttattttat ttatgaggta ggatgatttt taaggtagtg ttatttatta 3600 

gagttaaagt ttaggtaagg tgtttagggg ttagttgtaa tatgaagtat attagttaag 3660 

gtagttttaa agagtattgt tagaggaaga atttatattt ggttgtttaa tgaggttgaa 3720 

aggtaaattt aggttaagta ggttaagtaa agaggttagt ttagtatttt gggaggtaga 3780 

ggtaggtgga ttatttgagg ttaggagttt gagattagtt tggttaatat ggtgaaattt 3840 

tgtttttatt aaaaatataa aaattagttg ggtgtggtgg tatgtgtttg taattttagt 3900 

tatttaggag gttgaggtag gagaattgtt tgaattaggg aggtggaggt tgtggtgagt 3960 

taagattagg ttattgtatt ttagtttggg tgatagagtg aaattttgtt ttaaaaaata 4020 

aataaataaa taaataaaaa taaaaaagaa gttagaaagt ttatgaaaaa tatttggagg 4080 

gaagaagtta gtttaagaat aaaagtatta ggttagggtg tttaagattt ttaaggattg 4140 

gttttggttg tgtagatatt taaagagatt gtgtaaggtt ggataggagg gtgagttggt 4200 

taaaaaggtg atttgttaat tgtggaatta attattgttt aatgttttgg ataattgagt 4260 

ttgatttgta atggttaggt ttattattgt aaattagttt tgaagtataa ataagtttat 4320 

taaattattt tttttgttta tatttttttt tattttttga tttatgtaat ttttgtagtt 4380 

tatatttaaa taaaaaataa taatatggat ttgttattat gtatgttgtg gtttattttt 4440 

aattgtaaag tttttgatta tttttttgtt atgttgattt ttatgttaga agaaaatgtt 4500 

ttgttattta ttatgagtgt agaaaattgt ttattagaaa tgattttaaa tagtaaagga 4560 

tgtatatttg aaataatagg gttgtaatga aaaagtatat attttttaat tttatttaat 4620 

atattgtatt tgtagagtaa aagatataga aaagtatttt atgttttaaa aatgtaagtt 4680 

tgagagtgaa aaatagagat aataaaaatt tttttttaaa tattattggt ggaaaagttt 4740 

H-S 9 ^*?*? aaaagtatat ataaaatgtt tgtgaatttt tgtgtaataa aattatatat 4800 

_ a _tatatg_tgt„^ 4860- 
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tttgagatgg agttttgttt tgttgtttag gttggagtgt agtggtggga ttttggttta 4920 

ttgtaagttt tgttttttgg gtttatgtta tttttttgtt ttagtttttt aagtagttgg 4980 

gattataggt gtttgttatt atgtttggtt aattttttgt atttttagta gagatggggt 5040 

tttattgttt tagtggggtt ggttttgatt ttttgatttt gtgatttgtt tgttttggtt 5100 

ttttaaagtg ttgggattat aggtgtgagt tattgtgttt ggttaatgta tgtatatttg 5160 

taatttagat ttgtaaatgt agttatattt atatttatat attaaatgta ttaaaaagat 5220 

ttattaagtt ttttatagtt agtttataaa tttttagttt ttttatatat ttgtattttt 5280 

tgttttaata tgatataata gttaaaaaaa tttaaaaatt aataattaat taagtttttt 5340 

ttttttgtag attatattag ttattattta gaaatattaa tttattgttt taaaaataat 5400 

ttttttagag atgtttggta tttagtatat gaaattattt ataattgaat aatataaata 5460 

aaatagaaaa aagaatttat ttttgggata tttgaagaaa gtgatttata aatttagttt 5520 

atttttgagt tttattattt ttaggaaaat ggtttttatg ttatttttat gttatgtttg 5580 

atttgttttg tttgagtttg ttttttagaa agattgtgat ttttggtttg ttgaatatga 5640 

aatttatatg attatttttg ggaaaaattt tttttttagt gattgtaagt taggttttgt 5700 

ggttgagaag tattagttgt atttggttga ttgattgtat tattttgttt ttatgttgtt 5760 

tataaagata tatttgagat ttggtaattt ataaaagagg tttaatagat ttatagtgtt 5820 

aagtgggtgg gggtgtttta taattatggt ggaaggtgaa aggtatgttt tatatggtgg 5880 

tagataagag aagagagttt gtgtaggaaa attttttttt ataaaattat tagattttat 5940 

gagatttatt tattattata agaatagtat gggaaggatt tatttttatg atttaattat 6000 

ttttaattag gattttttat aatatgtggg aattatggga attataattt aagataagat 6060 

ttgggtgagg ttatagttaa ttatattatt gatttttatt atattataaa gaattatttt 6120 

gtagagagaa tttgtttttt atttttaata atttgtttat tttggatatt ttatataaat 6180 

agagttatat aatatttgtt gttggttttt ttatttagtg taatgttttt aaggtttatt 6240 

tgtgttagag tatgtatttt tgttttattt ttttatatta ttaaataata tttgattgta 6300 

tggatgtgtt atattttatt tgatttttta ttagttgata gattgttggg tttttttatt 6360 

ttttggttat tatgaatgat ggtattatat gaatatttat atataagttt ttatgggtat 6420 

atgtttttat tttttttggg tatatattta ggtgggggat tgttgttata tggtaatttt 6480 

atgtttaata ttttgagga 6499 

<210> 425 
<211> 6499 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 425 

tttttaggat gttaaatata aagttattat atgatagtaa ttttttattt aggtatatat 60 

ttaagaaaga taaaaatata tatttatgaa aatttgtgtg tgaatgttta tatagtatta 120 

ttatttgtaa tagttaaaaa gtagaaaaat ttaatagttt gttaattgat aagagattaa 180 

ataaaatgtg gtatatttat ataattgaat attatttagt gatgtaaagg aatgaagtaa 240 

aggtatatat tttaatatag atgaattttg aaaatattat attaaatgaa aaagttagta 300 

ataaatatta tataatttta tttatgtgaa atatttagaa taaataaatt attagagatg 360 

gaaagtagat tttttttgta agatagtttt ttataatatg atgaaagtta gtgatatggt 420 

tggttgtgat tttatttaaa ttttattttg aattgtagtt tttataattt ttatatgttg 480 

tggagggttt tggttggagg taattgaatt atgggggtgg gttttttttg tgttattttt 540 

gtgatagtga gtaagtttta tgaggtttga tggttttata agaggaaatt tttttgtgta 600 

agtttttttt ttttgtttgt tattatgtga gatatgtttt ttattttttg ttatgattgt 660 

gaggtatttt tatttatttg atattgtgag tttgttaaat tttttttgta aattgttaag 720 

ttttaggtat gtttttatga gtagtatgaa aatagagtaa tatagttagt taattaagta 780 

tagttgatgt tttttggtta taggatttgg tttgtagtta ttaaaaaagg aatttttttt 840 

aagagtgatt atgtaaattt tatatttaat aggttaagga ttataatttt tttggaaaat 900 

agatttaagt aagataagtt aaatataata tagagatgat atagagatta tttttttaaa 960 

aataatgaga tttagggata aattgagttt gtaggttatt ttttttagat gttttaaaag 1020 

taagtttttt tttttgtttt gtttgtatta tttaattata aatgatttta tatattaagt 1080 

gttaaatatt tttagagaaa ttatttttaa aataatgaat taatgttttt gaataatggt 1140 

tgatatagtt tataagaaaa aaaaatttgg ttaattatta atttttaaat ttttttaatt 1200 

attatattat gttaagataa aaaatgtgaa tatgtgaaga agttgggaat ttatggattg 1260 

attatgagga atttagtgaa tttttttggt gtatttgatg tgtggatatg aatgtggttg 1320 

tatttataaa tttgagttat aaatatatat atattggttg ggtgtggtgg tttatgtttg 1380 

taattttagt attttgggag gttgaggtgg gtagattatg aggttaggag attgagatta 1440 

attttgttaa aatggtgaaa ttttgatttt attaaaaata taaaaaatta gttgggtgta 1500 

gtggtgggtg tttgtagttt tagtt^tttg ggaggttgag gtaggagaat ggtgtgaatt 1560 

tgggaggtgg agtttg^gt gagttgagat tttgttattg tattttagtt tgggtgatag 1620 

agtgaga^tt tgttttaaaa aaaaaaaaaa aaataaaata aaataaaaaa aataaatata 1680 

tatatatata tatatatata tatatgattt tattgtataa aaatttatag gtattttata 1740 
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tatagggagt gggttggggt ggtatttgag gtgtatatag tgttatatat tgtgtttttt 1680 

ttatatgaat gtttgatgtt ataagtgtga tatgtttttt gtttgtatgt agtgtagtga 1740 

ttgattaaat ttaatttggt tttttttatt tgtttatttt tgtttattag attaataaat 1800 

tattagtagg ttaagaaaaa agaaaatagt aattaaaagt tgtattgtta agttgtgttt 1860 

tttgtgttta tagtagtttt ttattaaatt tgggattgtg aagggattat aaagaaggtg 1920 

attaatatag ttttttttaa ggttatgtga ttttttttta ttttattttt ttttttttta 1980 

ttttatttta ttttatttta gatgaaagtg gaaagattat ggtaatatag aataagtggt 2040 

tattgtagtg tttttttttt ttaaaagatt taattgttgt tgaggtagaa attgaatgtt 2100 

ggtgtttttt agtttttttt ggtagatttt tgtgaggatg tttttttagt ttttttttgg 2160 

aatatataaa gggtatagtt atgatgttta attgttatat tgtgtgggtt gtttttgtta 2220 

atagtgtaga ggtaatttgt ttttatagta tattatgttt gttttaaaag aggatgatga 2280 

tgattggtga gtttttattt ttagggtttt ttaaattgta attgaggaat taaggtgttg 2340 

ttttttttta gagtggttat tagttattgt tgatttaagg tttgatatta tggtggttgg 2400 

ttagtaaatt taattgtttg ttgtttagag aatgttaggg taatgtatgt ttatttatta 2460 

attagaaaaa aagtaggaat ggataaattt tttttgatag tatgttattt attagtttta 2520 

aaaattggta tataagaaaa atagagttag ggggattttt ttttaaggta taaatgaatt 2580 

tattttttag atattggtaa taatagttgg tatttttaga gtgtttatta gatgtttagg 2640 

tattttttgt taatattttt gttttatatt tgagaaaatt aaggtataga taagtaatgt 2700 

gtttaaaatt atagggtgtg tgggattagg atttgggttg ggttggtatt ggtgttaata 2760 

tttgtgtgtg atttttatat tattgtgtag aggtagtgtg agagtagggg tagagttgat 2820 

aatgagtttt tttaaattta tttttttttt tttgtattta ttaaagtata ttttgagata 2880 

gtgatttaag tttggtgtta attgtttaat tttaatatta atttgtattg attaaatatt 2940 

tgatggtggg ttagttaagt agagaaaatt taatatatat gtttttgtaa ggtttttttt 3000 

ttatataaat ttaatttttt tttttttttt tttttttttg attttttatg gttttggaat 3060 

ttgttggggt ttttgtgagt tggtttgagt tgtaaagttt tttttttttt tgtttattta 3120 

ataaatgtgt agtatttttt aggatatggg tggtaaagtg gagataataa atatttttga 3180 

taaaagtgtt gttagtgaga ggatgatttt tattaaatgt tgtatattgt tttataagat 3240 

ggtatgtgaa aaagatttgg gggaagaggt aaatatttat ttttgtagta gattgagttt 3300 

ttttaagaat ggttagttta aaatagttta taatagtttt tatattttaa tagatttgat 3360 

tttttatatg gttttatttt gtttgggtgg ttaattgata taatttatat tttttttttt 3420 

tttttaggtt attatagtat attatttagt aaagtatgtt ttaaagtttg tttgtattat 3480 

atgattaaat ttttatttat tagtgtttga aatattttag ttgtgaaggg gtttttaaat 3540 

gttattatgt gtgatgtttt tgtgtaaaag tagtaaatgg gagaaaaaaa ttttataaga 3600 

atgggtattt atatatttat atgaatgtat atatatatat atatatatat atatatatat 3660 

attttatttt gggttttaat tataggttat tgtatatttt aaattttttg taaaaggttt 3720 

taaatatttt tttttttttt gaagattgag tgagttgtaa attttgggaa taaatatata 3780 

tataaatgaa tggattatag tttgaaaaat ataattatgg tagtattaag ttatgagatt 3840 

ttatattggt tttgtttagt ttgttttggg ttttgtggga aaagtggaat gtatgaggtt 3900 

attttttttt taatttttaa tttttaaatt gtgggtttta tatatttaag tagtaggttt 3960 

atattttttg attaggtaat aggttatgat tttgagattt tttgtggagt tttggttata 4020 

aaggaggatg tagtatgatt aaagttatgg gtagatgaaa gagagatgag ttaaagatta 4080 

gttgttttgt gattttgttt ttttatttaa tatttaaatt ttagaaagtt attaagttgt 4140 

taggtagata tatattttgt ttgtttttag atgatttatt ttttattttg atatgaggtg 4200 

tggatttagg tgattatttt gaaatagatt atggtataaa agttagtaag tatattttat 4260 

ttttttttat ggttaaaaaa ttgatagaat ttagttaagt tggtaaattt tttggattgt 4320 

gttgtaatta tttaattttt aattgtttga tttttttata ttatttggaa attaaggtat 4380 

ttttaaattt ggaaggaaat agaattttta ggttatgaat ttaataaaga ttttgtatta 4440 

9 4441 

<210> 428 
<211> 4343 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 428 

tttaaagaat tattaaatta ttttttgggt ttttgaatta gatggagttg aattattttg 60 

gttgttaatt gagtgaaggg tgttggggaa aagtgttatt gtgtaagtta aggaaagggt 120 

tagagaattg aaaagatggt agttgtatga aagggttttt gtgttgggat ttttttttta 180 

agttatttaa aggtataaat taagaattgt aattttggag atatgataaa tgaatataat 240 

tttggttttt tgttttattt taaagtgaaa ttttattgtt ttttaagaat ttgtatatga 300 

agttttatai; attttttatg ttagtttttt tttttgttat ttttgttttt tagtttgttt 360 

ttttttaata gttggatata ttatt;gtttt atttttttat tagttgtgat tgaagtagtt 420 

tgaggttaat tattttttta gtaattgata aagaattttg atgaagagga aagtgatatt 480 

_gt«;tttga^t^ 5.40- 
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ggtagatttt ttttttgtaa ggagtttagt agggtggtgg tggtggggtt tgtggaagtg 600 

tgtgggtaat atatggtaag tggagaaaat atttatattt atggtgaaag agtaggagaa 660 

gatttaaaga aggttatttt atagagattt taagtaagag atattgattt gatattgtat 720 

ttttgatttg gatataatag agaatttaaa gaaagagttt ttttttgtgg tttttttgtt 780 

attgttattt ttaataattt gaattattgg taatttaaaa aagaaaagat tttgtttatt 840 

aaatttgata aaagtattat attggttttt ttttgggttt tggttaggtt gttatgtttt 900 

tttttttttt tttgtgttag gtttttaaat ttagaatttg gagttgtggt tattgagttt 960 

gtgttttggg tttgtgggtg gtgtttgttg ttgtgattgg gtggttaggt gagatggagt 1020 

gtatgggtag agtgtgtatt tggggtgatt ggtttttggg ggaagatata aataatgaat 1080 

tggtgttaga ttattttttg atgagatata tgaaattttt agtgttttta gtaggtattt 1140 

tgtattatag ttttttaaga aaggtaaaag gttttttttt ggttttttag tggtaaaatt 1200 

tgtagttgtt gtttagtatt ataattaaat gtgtgtgggg ttgtgggata gaggtgtttt 1260 

tgtgtatttt tggatagatt tatagttgtt tggtggatat atatgtgtgt gtttggttga 1320 

ttggattagt agttgtttgt gttttttttt ttgtgtatgg tggtggtttt gtattaagat 1380 

ttgttgagga aatgaagagt agggttattt tttattttgt tgttgaaagt ttattttatt 1440 

ttttgggtgt gtaggaagag atggtgtgag ggtaggaagg tgtgaaattg gggttttgtg 1500 

ggagatttat tgtggtttta tatttatttt tgtttgaaaa ttgttagtgt tttatttttt 1560 

ttaaatttta gtttttgagg gggaggtagt gagatgggat ataggtgtgt ttgggttttt 1620 

gtttgggagg ggttttttta ttttgggatt ttttgtggag tagtttataa ttgattggtt 1680 

tgttgtatag aggttatgtt taatggattt ttgtagataa aagggtttgg ttattttttt 1740 

tttttataat ggtttttgag gttaatgttt taaagaggaa ataaagggat tgttttatag 1800 

tatataaggg gggtggtggg aggaataggg agaggaggag gaaggggtat tggatatgtg 1860 

gagtgggagg aggttttttg ttttttaatt ttttattttt tgtgattgtt taggtttttt 1920 

gagttttgta tttttattta ggatttgaaa tagtggggag gaggtgatag tgtgtggagg 1980 

gttttgtttt gtgatttgat ttttagtttt ttttattttt ttttttgttt ttttaaagaa 2040 

tttttgaaag ggagaatgga aaagatgagg ggatttatat tttgtgagtt gtagttttga 2100 

aaagttagtt ttagaggggt tgttttttgt gaggtttggg tgtttatata ggattgatgt 2160 

gtggtagtgt tatttgggtt tggattttgg gattgtggga ggtgggagtg tttggggtta 2220 

ggatttgggt gtatgttgag ggtatttggt gaaggttggg gaaagtgggg tagttttgaa 2280 

atgtggtttg tatgttgttt tgggagttgt ttttgggttt ttttgtagat atagttttta 2340 

ggtgtgagtg gtgtgtgtgt tggttggggg tgttttttat aggaagttta tatttgtaaa 2400 

ttgggtggtg gggggggggg ggggtggggt gtggtgggag agagagagtt ttagtggttt 2460 

ttgttttttg gtttttttgt tttggtttta ggaaaagatt aaaataatag ttaaatagta 2520 

attgaagata agtgatgtaa gaaataaata atttgataat aaaaatgttt atagggatgg 2580 

ttttttaaaa ttttttatta tggaaattag agtatatgta taaaaggaga gagaaaggta 2640 

ttgtgtagtg ggttttaaat ttgattaagt atttgaattg tggagtgaaa ttttttggag 2700 

ggtttgttaa aatatagatt gttgggtttt agttttagag ttttttgttg tgtattttta 2760 

agtagatttg gggttagttt ttttttgaga atttgtaatt tgttagagtt tttaggtggt 2820 

gttgtttttg gtttggaaag gatattgtta tttattggtt taataaattt ttgtgtattt 2880 

ttttttattt ttaatatgta taatattttt ttattttgtt tattttattt ttttttatat 2940 

tttgtttggt ttgttggaga attataaagt aatttttagt tattttatta ttttatttat 3000 

atagatgatt tttgatttgg tggttattta agttgagagg attgtaggtt tttttgtggt 3060 

tattttatat tggggtagaa tttggtattt ttttttggga aatttatgtt tatattttgg 3120 

ttttgagatt attttttggg agtttttttt tagtagattg ttatatgttt aaatattatt 3180 

attgggtttg ttggtaagtt ggtttggttt ggttagggta ttatttggtt tttatttttt 3240 

attttttgtt tattgttttt aatttgttat tatgtatttt tataattatt gttttttttg 3300 

aggtattttt ttaataaggt gttttttttt tagggtggag ttttgagtgt ttagggatgg 3360 

ggtggtattt ataatagtgt aagtttatag gattatttgt attagttagg tttaggattt 3420 

atattttaag gggagttttt ttgggttatt tgattttttg tataattttt gattggaaaa 3480 

ggggatattt taggtatagt ttttgggaat ggttattatt gatatttgtt gtaatttttt 3540 

ttgattgttg ggttttagtt tagtttggtt atttgtgttg tttttgtttt agagaggtgt 3600 

taagtgtttt tttagatttt atttggttag ttttaagtgt gtagttttta tttatttttt 3660 

tgtttttttt ttgggttgtt ttgaaataag tttttttttt tttttttttt tttttttttt 3720 

attttggttt aggatattat ttgttttttt tatagttttt ttattttagt tttttttagt 3780 

tttattttta ttttaatttt tagaggtttt ttgggttgtt ataaagttta gatatttttt 3840 

tttatttttt aaatttttag agatggggtg ttatttattg tgttatttag gttggagtgt 3900 

agtggtgtga atttggttta ttgtagtttt aattttttag atgtaagtga tttttttatt 3960 

ttagttttta agttattggg attataggta tgtattatat aggtttgtta attgttgtat 4020 

tttttttttt ttttgtagag atagggtttt gttttgttgt ttaggttggt tttgaatttt 4080 

tgagtttagg taatttgttt gttttggttt tttaaagtgt tgggattata ggtatgagtt 4140 

attgtgtttg atttagattt ttttattgtt tttttttttt ttttgatttt ttatgaagta 4200 

agtatattat atgaattttt tattttttta tttgttgtta aagaaagttt tggggatgga 4260 

gtaggttttt ttattttaga gttttgtttt tatttttata ttgttttatt tttttatgtt 4320 

tttttttttt tattagtttg gaa * "* 4343 

<210> 429 • 

<21]> 4343 • - - . .: ? ' . J 1 /i 

<212> ' DNA " l " 5 :*?■ 
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<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 429 

ttttaggttg gtgaaggaag aaaaatatag gaagatgagg taatatgaaa ataggaatag 60 

agttttgggg tggaggaatt tattttattt ttggggtttt ttttaatagt agatggaaaa 120 

atgaagaatt tatatagtat gtttgtttta tgaagagtta ggaaaagaaa gagaataata 180 

aaagagttta ggttgggtat agtggtttat gtttgtaatt ttagtatttt gagaggttaa 240 

ggtaggtgga ttgtttgagt ttaggagttt aagattagtt tgggtaatag ggtaaaattt 300 

tgtttttata aaaaaaaaaa aaaatataat aattaatagg tttatgtggt gtgtgtttgt 360 

agttttagtg atttaggggt tgaggtggga ggattgtttg tgtttgggag gttgaggttg 420 

tagtgagttg aatttgtatt attgtatttt agtttaggtg atatagtaag tgatatttta 480 

tttttaaaag tttaaaaaat aaaaaaaaat gtttaaattt tgtaatagtt taagagattt 540 

ttgagggttg aggtggggat aggattggga ggggttggga tgagaaagtt gtggaaggag 600 

tagatggtat tttaaattag gatggggagg ggaggggagg gaaagaaggg aagtttgttt 660 

tggagtggtt taaaggagag gtaaggagat aggtagaggt tgtatattta ggattagtta 720 

ggtggagttt aagagagtat ttggtatttt tttggaatag aagtaatata gatagttaga 780 

ttgagttagg gtttgataat taagggaggt tatgatagat gttagtgatg gttattttta 840 

gaagttatgt ttagaatgtt ttttttttta attaggaatt atgtggaggg ttaagtagtt 900 

tagagaagtt ttttttgggg tgtgagtttt ggatttaatt gatgtaaata attttatggg 960 

tttgtattgt tgtgggtatt attttatttt taggtattta gaattttatt ttggagggga 1020 

gatgttttgt taggaaaata ttttagggga aatgatggtt gtgggagtat atgatgatag 1080 

attggggata atgagtagaa aatgaaagat ggggattagg tggtgttttg attaggttag 1140 

gttagtttgt tagtagattt agtggtgatg tttgggtatg tgataatttg ttggaaagga 1200 

atttttgaag ggtggtttta gaattaagat atgggtatgg attttttaga gagaggtatt 1260 

agattttgtt ttaatgtgaa atgattataa ggaagtttgt agtttttttg gtttaagtaa 1320 

ttattgaatt aaaaattatt tatataggtg aaatgatagg atggttggga attgttttat 1380 

aattttttag taaattaaat aaaatgtggg aggggataga atgaatagag tggggaggtg 1440 

ttgtatatat tgaaggtggg agagggtatg tggagattta ttggattagt ggatagtagt 1500 

gtttttttta aattagaagt agtattattt gagggtttta ataagttgta aatttttagg 1560 

agggagttga ttttagattt gtttaaaaat atataataaa aaattttagg gttagggttt 1620 

agtaatttgt gttttaataa gttttttaga ggattttgtt ttgtggtttg gatgtttgat 1680 

taaatttgag atttattgta taatattttt tttttttttt ttgtgtatgt gttttgattt 1740 

ttataataaa gggttttaaa aaattgtttt tatgggtatt tttgttgttg agttgtttat 1800 

tttttatatt atttgttttt aattgttatt taattattgt tttgattttt ttttggagtt 1860 

agaatagaaa agttagaaag tagaagttat tgagattttt ttttttttgt tgtgttttgt 1920 

tttttttttt ttttattgtt tagtttgtaa atataaattt tttgtaagag atgtttttgg 1980 

ttggtgtata tattgtttat atttgaaggt tgtgtttgta gaggggtttg ggaatgattt 2040 

ttagggtgat gtgtaggttg tattttggag ttgttttatt ttttttggtt tttattgggt 2100 

gtttttagtg tatgtttagg ttttggtttt aggtgttttt gttttttgta gttttagagt 2160 

ttgggtttag gtggtattgt tatgtgttga ttttgtgtgg gtatttgggt tttgtgagag 2220 

atgatttttt tagaattgat tttttagaat tgtagtttat gagatatggg tttttttatt 2280 

ttttttgttt ttttttttag aagtttttta aaaagatggg gaaggaggtg aggagagttg 2340 

gaggttgagt tatagggtgg ggttttttat gtattgttgt tttttttttg ttgttttggg 2400 

ttttggatga gggtgtggag tttgggagat ttgggtggtt gtaggaggtg aagggttaag 2460 

aagtgggagg tttttttttg ttttgtgtat ttaatgtttt tttttttttt tttttttgtt 2520 

ttttttgtta ttttttttgt gtgttgtaaa ataatttttt tgtttttttt ttaagatatt 2580 

agttttaggg gttattatag gaaagaaagg tagttaggtt tttttatttg tgagaattta 2640 

ttaagtatgg tttttgtata gtaggttggt tggttatggg ttgttttatg aggggtttta 2700 

aagtgggaga atttttttta agtgaggatt tagatgtatt tgtattttat tttattattt 2760 

tttttttggg aattggaatt tagggggagt ggaatgttgg tagtttttaa atgagagtga 2820 

atatagagtt atggtgggtt ttttgtagaa ttttagtttt atattttttt gtttttgtgt 2880 

tatttttttt tgtatatttg ggaaatggga taaatttttg atggtaagat gggaaatgat 2940 

tttatttttt gtttttttag tgggttttgg tgtggggttg ttattgtgtg tgggagaggg 3000 

gatgtaagta gttgttggtt tggttgatta gatgtgtatg tgtgtatttg ttagatagtt 3060 

gtgqgtttgt ttgagggtat gtgaggatgt ttttgttttg tagttttgta tgtatttgat 3120 

tgt^gtgtt^ ggtgatagtt gtgaattttg ttattggggg attgaaagag ggttttttat 3180 

tttttttagg aagttgtgat gtaaaatgtt tgttgaaagt attggaggtt ttatgtgttt 3240 

tattagaaaa taatttaata ttgatttatt atttatattt ttttttaaga attagttgtt 3300 

ttaaatgtgt attttgtttg tgtgttttgt tttatttagt tgtttagtta taataataaa 3360 

taLttgtttgt aggtttaagg tataggttta gtgattgtgg ttttgaattt tggatttggg 3420 

gatttggtgt agggggaggg gagaaggtgt ggtagtttgg ttagggtttg gggaagggtt 3480 

agtgtgatat ttttattaag tttagtgggt ggaatttttt tttttttaaa ttgttaataa 3540 

tttggattgt tggaagtaat ggtggtagga, aagttataaa agaaaatttt ttttttaaat 3600 

tttttgttgt^ttj^ -3660- 
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tgtggaatga ttrtttttgg attttttttt gtttttttat tatgaatata aatatttttt 3720 

ttatttatta tatgttattt atgtattttt atagatttta ttgttattgt tttgttgggt 3780 

tttttgtaaa aggaaagttt gttttgtttt ttttttttag ttgggagggt agggggagga 3840 

gggagtagtg ttgtttaagg gataatgtta tttttttttt tattaagatt ttttgttaat 3900 

tgttggggaa ataattgatt ttaagttatt ttagttataa ttaatagaaa aatggaatgg 3960 

tgatgtgttt agttgttaag gggaggtagg ttggaaaata aaggtggtaa ggaagggggt 4020 

tgatatgaaa aatatgtaaa gttttatgtg taaatttttg ggaaatagtg gaattttatt 4080 

ttaaaataaa ataaaaaatt agaattgtgt ttatttgtta tatttttaag gttgtggttt 4140 

ttgatttgtg tttttggata gtttaaaagg agaattttag tatagaagtt tttttatata 4200 

gttgttgttt ttttggtttt ttgatttttt ttttaattta tataatggta ttttttttta 4260 

atatttttta tttagttagt agttagaatg atttagtttt atttgattta ggggtttagg 4320 

gaataatttg gtagtttttt gag 4343 

<210> 430 
<211> 4476 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 430 

tttttaattt ttattttaag aagatttaga gtaataatgt agaaaataag tggtttgagg 60 

aggattggga gttagggtgg aaagttagta gtatagttaa ggtgtttgtt agttttttgt 120 

ttggtttgga gaggaagaag attattttaa ttttttttat ggtgagttgt attttttgtt 180 

tttttttttt tttttgtggt ggggtttgtt ttttttaggt agtttttttt ttaatttaga 240 

ttttgttttt tttggttatt atttttgttt atagtaggaa gttttgtttt tagtagtaaa 300 

tgtagaattt tttttttaat ttattattgt ttgttttagg tggaagggat aggaagtttg 360 

ttttatgaat ttggggggag aatttggttg tagattattt tggttttttg atagaatgtt 420 

tgttttttat tttttataga atagtgtttt ttttattagt ataaattgaa gtaggaattt 480 

tttatttttg gagtgggtta gttttggtta ggtttttatt tagaatggta aagataggtg 540 

agagatttgg gttttgtttg ttttattttt taggagttat gttttatagg gtgatgtttg 600 

ttagtagtat tgttttttgt ttttgttagt gtattgtttt ttgttttttg gagttttatt 660 

ttggttgtgt ttagtttagt tttttttttt tttattttat gttattgttt tttttttatt 720 

tttttttgat tgttatttgt agtttgttaa gtgtggggtt gattgtggtt attttagtta 780 

tatgtttgtt ttttgattat ggttagggta tggtagttgt ttttttttag atatgagtag 840 

ttaaggtttt gtgttggggg ttttagttta gggtagaatt aagagatgtt tattttgagg 900 

ggtgtatata tagagggtga ttttagttat ttttatgaga ttagagtttt ttagttttta 960 

ttggttgtat tttttggtgt tgtatttttg gttttatttt ttttgagttt atgaaagttt 1020 

ttttttagta atattttatt ttttttgtag aagaaatttt tttgttttta aaatttttag 1080 

gaggttagtg tagrttggag gtagtggttt tttgtttgtt tttttttatt tttgattttt 1140 

ttttttaggt attgatttat tttgttgttt tttgatttta tttattttta ttttttagtt 1200 

ttgtattttt tagttittgat ttgtattttg ttgttgttta ggtttgattt ttattttgtt 1260 

agagtttttt agttttggtt ttttttttgt ttttgttttt taatttaggt tttttgtttt 1320 

tatttatttt taatatgggt ttttttgttg tttttgtttt ttagtttaat tatgttaggg 1380 

ttttgggttt ttatggtttt tgtttttgtt gtagtttttg tggtttggtt ttgttagtat 1440 

tagaaattta tgttggtttt tgtgtatttt aataaggttt ttgggttgtt ttttatggag 1500 

agtaattgtg aggtgttgtg gtttaggtaa gtgtgttggg gtagttggtg tatttgttgt 1560 

ttttagttta ttttattttt ttgttttaat aatttttttt ttttatttgg gggttttgtt 1620 

gtgtttttgt tattttagtt ataagaaatg ggtttgtttt ttgtggttag gaagtggagg 1680 

gaataaaaaa gagtattaat gttttttttt tttagttttt ttttttagaa taggtatgta 1740 

ggaagttgtt ttaaggtttt aaagggaaat ttttttgttt tgaatttttt agggtttttt 1800 

tagggatttt ggggatagtt ggtattatag ggatttaatt tttaagggtt ggtttttatt 1860 

gttgttttga gggtttagtt tgtttggttt ttagggagtt tgttgttttt agtttaaatt 1920 

atattttata taggggtttt tttttgtttt tttttttttt tttttaaatt attttttttt 1980 

atttttatga gatttttttt ttattattgt ttttagtagt tatatttttt tttttgtgtt 2040 

tttgtgatgg ttgtttttgt tttagtattt ttttttttgt ttttattata gggtgtttag 2100 

gtgtttagtg atggttgttt tgtattttaa tttgtttttt ttaattttat tttttttttt 2160 

atagtatagt tttttagtgt gtttttgttg tttttttatt tgtttataat attagtagta 2220 

gtggtggagt tttagattga attaattttt tttggggtgt gtttagttga agtatttttt 2280 

atgttgggta gttttgatag gtgtgggatt agtagaattt gttttatggt gtgattttag 2340 

tttttttttt tggttatagt tagggttggt ggggggtttt tgggagtatt tttagtaagt 2400 

ttatttttaa gtttgtatgt aggtaagtaa ggagttttgg gtggtagaga ggtttaggtt 2460 

aggttttttt gttttatttg gggtgggttg ggggttgggg gttggggttt gggatatttt 2520 

gtattggtat tgggttttgg ggttagaaga ggttttagga agtat^agaa attaggtttt 2 580 

tgttaatatt ttatgtgttt aggtttattt ttttrtaggtt tttttttaat tttttatagg 2640 

tattttttat ttttgggttt taagtagatg gttgatgttg tttttttttt aggagagjigt 2700 
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gaatttagat gttaaaataa aagttttttt tttttttttg ggtttttatg gaaatttata 2760 

tttggtgatg tagttgtaaa gttatgaggt atgagttagg ttggggttag taaggaaaat 2820 

tttgttttgg ttttttgttt ttttgattgt tttttttatt agttggtttt gtttttggtt 2880 

gtaggtgtag ttatgttttt ttgtttgggg gttttagggt gaaatttata aatgaaatta 2940 

ttggtgaggg tttatagtgg tttttttttt aatttaattt tgatgtgtta aaggtttttt 3000 

gtgttggatt tagggtgggg atttttttat ggggtttttt atatttgagt ttttagttat 3060 

tatagaggtg tagtttgaag tgtatttagt taattggttg gttttttggg atgttttgta 3120 

tttttatttt gttatttttt tttttgtttt ggagtagtag tttaggaagt agtaggggtt 3180 

ttgagagaat aggtttgttt gttttttttt tatgttttat tttattttgt tggaggagtt 3240 

aaagttattg ttttatttga gttttagaat gtaagtgtga ggtttgtaga gagtgtgggt 3300 

*2 91:1r H2£ aa gtttgggatt ttagttttgt tgagtggttt ttttgaaaat gggatgattt 3360 

ttgaatttgt aaagttattt tttttatttg tttatttata tattgttttg ttggtttttt 3420 

ttttattttt ttttttattt tgtttttttt ttgtttgttt ttttagaaat ttgtttttta 3480 

ggtttgttag aaggtaggtg ttgagtttgt tgtgtgtgtg tgtgtgtgtg tgtgtttgtg 3540 

ttttgtgttt tgtttttatt attaattttt tttttttggt ttttattttt ttttatttgt 3600 

ttaattaagt ttgtgtggtt tttttttttt tgttatttta aagggatgaa gattgttttt 3660 

gattgggtat tagtataagg tttgtttttt gtgtttttag tataaatagg tagggttaag 3720 

aggttatatt ggtttattta gggtaaatgg ttttaaaaat gagggttttt tttgggttta 3780 

t 525HHSi ttgtttttaa gtaaggggag attgttttag ggaagttttt ttttaagatt 3840 

gttttttttt atttatttta ttttattttt attttttttt attttaggtt ttggttataa 3900 

gtgttgggat ttttttttgt tttttttttt gttatgtgta tgtgttatga ggaagtttta 3960 

gggttataag tgtatttggg atggtatttt gtttgttgtt ttttgggttt tttgaatttt 4020 

^gjtatgt gatttttttt ttgtggttta tgggattgta ggattttgga gatattatgg 4080 

ggattttggg gttttaaggt tttagtttgg tttttttaga ttttaggagt ttttgtttta 4140 

taaatggagt atagtatttt tttttggtag tttttgtaga attagtttta tttatttgta 4200 

tttttgtttt agtatttttg ttattagtag tatttggata aattaagttt tttttttttt 4260 

aggttgtttt ttttagatat ggtttgtttt ttttaaagtg tggggagttg ggatatttta 4320 

gggtaatggt tattttattt taagttgtag taaaataata ggagattttt gtatttttat 4380 

ttaggggttt ttttttatag tttttttttt ggttttttta tttttggttt tatgtttatt 4440 

ttttttgtag gttttgggga ttaaatatta agttga 4476 

<210> 431 
<211> 4476 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna CHomo sapiens) 
<400> 431 

*H 9 53 tg9t atttggtttt tgaggtttgt agagaggatg agtataaggt taggggtgaa 60 

a? 39 " 399 * a 9gggattgt gaagggagat ttttgagtaa gggtgtagag attttttgtt 120 

gttttgttat ggtttggggt agaatggttg ttgttttaga atgttttagt tttttgtatt 180 

ttggaagaga taggttatat ttggagaaaa tagtttagag gagaagaggt ttgatttatt 240 

^ a 9atgttgt tggtggtagg ggtgttgggg taggggtgtg ggtgggtggg gttagttttg 300 

^SS^Hi* ta99a ggg9g tgttgtgttt tatttgtgag ataaaagttt ttaaggtttg 360 

ggggagttag attgaaattt tggggtttta gggtttttgt agtgttttta aagttttgtg 420 

attttatagg ttatggggaa gaggttatat agttttgaag tttaggaaat ttaggagata 480 

ataaataaga tattatttta gatgtatttg taattttgga gtttttttat agtgtatgta 540 

tatgataagg gaaagggtag gaaaggattt taatatttgt gattaaaatt tggggtgaga 600 

Sff a9 HHH2 §32 999 i 999 gtgggtgaga ggagataatt ttaaagggag gtttttttga 660 

gatagttttt ttttatttga ggataggtgt taattgtggg tttaaggaag gtttttattt 720 

ttaaagttat ttgttttgag tgggttaatg tggtttttta gttttgtttg tttgtattgg 780 

gggtatggag ggtaggtttt gtgttgatgt ttaattagag gtagttttta tttttttaag 840 

atggtagaga gaggagggtt atatagattt ggttaagtag atggaggaga gtaagagtta 900 

gaaaagggga gttagtgatg gaggtaggat ataggatata gatatatata tatatatata 960 

HHH?H 9 Stttaatgtt tattttttgg taaatttgaa agatagattt ttaaaaaaat 1020 

aaataaaaaa aagataaaat aaaaaaagaa ataaaaaaaa aattaataag atggtatgta 1080 

S2J22S t: 5 gB t99 "g a g 9t ggttttatag gtttaagggt tattttattt ttggaagagt 1140 

HiJi? 9 i?i gattagagtt ttaggttttt agatttgttt atattttttg taggttttat 1200 

a I5 t9tattt: ^Sggtttgg atggggtagt ggttttggtt tttttagtag agtggagtga 1260 

aatgtagagg aagggtaggt gagtttgttt ttttaaagtt tttgttgttt tttgagttgt 1320 

; a ^335 a fi 9 tagggagagg aatggtagga tggggatgtg gagtatttta ggaggttagt 1380 

SE2££52 aa 2H 9ta J3 3 tt9t " t9 g ttgggggttt Sggtgtgaga 1440 

aa iH5 9tga agagattttt attttgggtt taatatagga aatttttggt atattggagt 1500 

iffiHHSSS aagaggttat tgtaggtttt tgttagtgat tttatttata ggttttattt 1560 

.taaaattttt-gggtagaagg- -gtacgg«g-^^«i^i»g-%t--agaaatagaa---t:raai:ragi:g 1620— 
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ggagaggtag ttaaaggggt aagagattag gataaaattt tttttgttgg ttttagtttg 1680 

gtttatgttt tatgattttg tagttgtgtt attaaatata agtttttatg ggaatttagg 1740 

agagaagggg gggtttttat tttagtattt gagtttatat ttttttagag aggaggtggt 1800 

attggttatt tgtttggggt tagagaatga ggggtgtttg tgaggagttg gggagaggtt 1860 

taagaggggt gggtttgggt atataaggtg ttgataaaga tttaattttt tgtgtttttt 1920 

gaagtttttt ttaattttag gatttaatat tggtatagag tgttttaaat tttaattttt 1980 

aatttttaat ttattttgag taaagtagga aggtttggtt tgagtttttt tgttatttaa 2040 

agttttttgt ttatttgtgt ataaatttgg aggtgaattt gttgaagatg tttttagagg 2100 

ttttttgttg gttttggttg tggttagagg gagaggttgg ggttatattg tagggtaaat 2160 

tttgttggtt ttgtatttgt tggagttgtt tagtatggaa ggtgttttgg ttggatatat 2220 

tttgggggaa gttagtttgg tttggggttt tattattgtt gttgatgttg tgggtggatg 2280 

999aggtaat agggatatgt tggggggttg tgttgtggga agagaagtgg ggttgagggg 2340 

gatgaattag ggtataggat agttattatt gggtatttgg atattttgtg gtggggatag 2400 

ggagggggat gttagggtag aggtagttat tatgaggata tagaaggaag ggtgtgatta 2460 

ttgaggatag tggtgagaag gaagttttgt ggaagtggaa ggagatggtt tgggaaggga 2520 

gggagggagg taaggaagga tttttgtgtg gagtgtggtt taggttggag atagtgggtt 2580 

ttttgggagt tgggtagatt ggatttttaa ggtggtaatg gggattaatt tttgaggatt 2640 

gagtttttgt gatgttgatt atttttgggg tttttaagga aattttggaa ggtttagaat 2700 

aaaaaggttt ttttttggag ttttaggata gttttttgta tatttgtttt gagagggaga 2760 

attggaaaag aggggtatta atgttttttt ttgttttttt tattttttgg ttgtagggga 2820 

tagatttatt ttttgtggtt aagatgataa gaatatagta ggatttttaa gtgggaagaa 2880 

gaaattgttg gggtaagggg gtagggtggg ttgagggtag taggtgtatt agttgtttta 2940 

gtatatttgt ttgggttgtg gtattttata gttatttttt gtggggggta gtttagaggt 3000 

tttgttgggg tgtatggagg ttgatatgga tttttggtgt tggtggggtt gggttgtaga 3060 

gattgtggta gaagtagaag ttgtggaggt ttgggatttt ggtatggtta ggttaggaaa 3120 

taaaagtagt ggagaggttt gtgttagggg tgagtgaggg tgggaggttt ggattaggaa 3180 

gtaagggtag gggaagggtt agagttgggg agttttggta gggtagaagt taggtttggg 3240 

tagtagtggg atgtaagtta gaattgagga atgtggggtt gagaagtgga ggtgagtgag 3300 

gttgggaagt agtgaggtgg gttagtgttt gggaagagga attagaaatg aaggagagta 3360 

gataaggggt tgttgttttt aggttgtatt ggttttttaa gaattttaag aataggagag 3420 

ttttttttat agaaagagtg gggtgttgtt gaggggaaat ttttataaat ttagaagaga 3480 

tagagttagg aatgtaatat taagaaatgt ggttggtgaa ggttggggag ttttggtttt 3540 

atggggatgg ttggagttgt tttttatgtg tatatttttt aaggtgggta ttttttggtt 3600 

ttgttttgag ttgggatttt taatataagg ttttagttgt ttatgtttag aggagggtag 3660 

ttgttatgtt ttggttgtgg ttaagaagtg agtatgtagt tgagatggtt atggttagtt 3720 

ttatatttgg taaattgtaa gtagtagtta gaggaagatg gggaggaggt aatggtgtgg 3780 

aataggggag gggaaagttg gattggatat agttaggatg gaattttagg gagtagggag 3840 

taatgtgttg gtagggatag gagatggtgt tgttgataga tattattttg tgagatatgg 3900 

tttttaggga gtgaggtagg tagggtttgg gttttttatt tgtttttgtt attttggatg 3960 

gaggtttggt tgaggttggt ttgttttaaa agtggggaat ttttgttttg atttgtgttg 4020 

gtggagagga tgttgttttg tggggaataa agggtaagtg ttttattaga aagttaaagt 4080 

ggtttatagt taggtttttt ttttaggttt atggaatagg ttttttgttt tttttatttg 4140 

aagtagatag tggtaggtta aggaaaggat tttgtattta ttgttgggag tgaggttttt 4200 

tgttataagt agaagtagta gttaaaggga atagagtttg aattaagaga agagttgttt 4260 

ggagagggta ggttttattg taggaaggaa ggagagagtg gggggtgtgg tttattgtgg 4320 

agggggttgg ggtggttttt ttttttttta gattgggtag ggggttggta ggtattttgg 4380 

ttgtgttgtt ggttttttgt tttgattttt ggtttttttt aggttgttta ttttttgtgt 4440 

tgttattttg agttttttta gagtaagaat tagagg 4476 

<210> 432 
<211> 6435 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 

<400> 432 ' * 

attttttttt ttattttttt atagtttttg aatttgtggg atagattgtg gtagtaatgg 60 

taaattgatt tgtaattagt tttaggtgta tatatatata tatatagttt gttratatttt 120 

tttagttata gttatttgtt ttttttagtt tttgttgtag ttttttatgt ttgtttttgt 180 

atatataatt gtattttttt gtatttttgg ttttggttat tggtttggtt ttttagttat 240 

ttttttttgt ataattaagt ttgtatttat atagttttat agttttagtt gatattatag 300 

ttttaagtaa atagttttgt atattattta ttttttaaat gttgagttgt ttatagtgag 360 

gtttgttatt tatataattt ttgttataat ttttgataat ttttattgta gttttatgtt 420 

tatgtagttt tttat&tgtg gttttggtta tttagtttgt tttgtttgtt attaaatagt 480 

gatagggttg tagtttttgt tttttttttg tagatattat tttgtgtata ttattatagg 540 

seite 418 



seq_prot.txt 

tttttgtata gtattttgat tatttattta ggtttagtta ttttttgaaa tatagttata 600 

attttttata tagatatatt attaaaagtt tgtggttata atttttgtat atttttatat 660 

ataattatag ttttttatgt ttttgatata tatatggtta ttgatatgta gtttttatta 720 

ttgtttttta tatatgttat tatattgttt tttttataat tgatttttgt tttatatttt 780 

ttatatatag ttatgtgtgt tatgtttagt tagttttgat aatattagag aaaaaqtqtt 840 

agtagttttt agtattattt tgaaaaatat atatttatag gaataatttt ttattttttg 900 

ggatttttta gaaaaaaaaa aggtttattt tggggaagta aaataaatag tggagatgag 960 

tgtagtattg ttttttaaat tattaatttt taggttttaa ggtttgggtt gggttagtgt 1020 

taataggtga gtttaggagt tttttgaatt tatttttttt tgtttagaat agagatagga 1080 

taggttttat gttttttatt tttttttttt aattttaggg atattgaaag ggttttttgt 1140 

atttgtttgt aggaaattgg ggggttggga gggagggaat tgaaaattta tgtttggtat 1200 

taaaaataaa tatttggagt gggggaggtg gtaggaagat ttttttttaa attttttttt 1260 

ttttatttat tttattaaat ataggaagag atgatttttt tttttttatt gaaaagtttt 1320 

atttaaaaat agattatatt attaaaatgg tagtggggga gagataggga gatttggagt 1380 

attggttgga ggggtttttt agatggggat tattttttaa aaaaattttt tattgggagg 1440 

aaataggtag gattttaggg aggttggtag ataaaggaat ggttttttag ggggaagaat 1500 

aaaagggata tttttttttg gttaaaaagt tggttgaaaa ggataagttg tttgagagaa 1560 

aggttgggga ggtggaaatt tttattttaa gggtgtgatt tttttttggt taagattttt 1620 

aatttattaa atggtataaa ttttttttgg atttttgagt atggttagga ataataaaat 1680 

aaaataaatt aattttttta tttatattta aaaatgtaga tagggttttt ttattttttt 1740 

aagggtaggt tgagaaataa agaaagaaag gttgttttta agttagattt tgattgtttt 1800 

gttttgggga aaaaagtggg gaggtggggg agatgtttta attattttaa aaggttaggt 1860 

taggtttttt ttagtggttt ttaaaaaata aataaataaa ttgtttttat tattgaagta 1920 

ggagatgtgt aagggtggtt attgggggat tgatagaaaa tagagaagtt gtggaagttg 1980 

tgtgtttttt gtggaggaga taatgagagt tatttttggg agtaattttt tttaaggatt 2040 

tttattttat tataattaga gtatgagttt tttagttgaa ttatgttttt ttttgagatg 2100 

gttggttgga gagagtatgg atggagtgat tttagaaggg gtggtggtgg tggtggtagt 2160 

agttatggtt tttggggttt tggggagagt attatggggg ttgagaggtt atttatgttg 2220 

atagaaggga tgtgtatttg ggtgttgtta ttggatggaa attgggggta aaaagaggga 2280 

gaggttatag atgggggtta tttttgggtt ttttattttg tttaaaatgg ggaaagagga 2340 

ttttatttgg aaggagagtt tggttgggtt atggggtgta atgggattta gggtgtttta 2400 

gaagtgaatg ttaggaggta gtgagttggg tgttagtagt attggggttt gtggggagaq 2460 

ggtggttgga gtttaagtga gtagaaagta gatgtgggat gttggaaggg atttggtttt 2520 

ggttttatta tagatttttt atttatattt ttagggttat tttttaaagt gtatatattt 2580 

agatggtaat tatggaggtt atgtttatga ttttttaaat tttgtagata ttttttattt 2640 

tttatttttg agagtttatt tttagttgtt gagattttat tggttaagat tatttataaq 2700 

attttattta aatgttgttt ttaaattttt taagttatta ttttaatagt ttagggttaa 2760 

tttttaattt atgttttatt tatgaaattt tatatgtata ttttagatta aaaagttttq 2820 

tttaggtttt ttaagttata ttttaattta gaaatagttt tatagaaaga tttttggttt 2880 

tatttataaa agtttttagt tttgtttttt attaattaaa ttttattttt ttagttagta 2940 

ttagtttagt taattatatt taaaggtttt atttttttta ggaataatta tatttatttt 3000 

tttttgattt ttgttagtta tttgttatat gttaatttta ttggtgttgg tgtttatata 3060 

£"J^H tt:t "gtaggtat ttattaatgt atgtgtagga agtagggatg gggttagtgt 3120 

tggttttgga gtttggaggt tggagttatt ttttgatttt ggggtttgtt tgggttttgg 3180 

tttatttagt aggtttgggt tgagtggaag ttttaggttt tggggttttt ggtttttttg 3240 

tggttgggag attttagggt tggaagttgt atggttgttt gtttgttttg tggtgtttat 3300 

aataattgtg ggtagttggg ttggtatgtt tttttgtgga gggatttttg gtttggtttt 3360 

ggtggttttt ggtataaatt tttttttttt tgtaattttt aatttttttt tgggtttttt 3420 

tatttttatt tttgggggta aagtttggtt taaatttggt tttatattaa gtttttttga 3480 

ttttaggttt ggggttttgg gtggggttag gatgatttgg tagaggagta ggtataaaga 3540 

gggtgtgtaa ggattgttgg gattatagga tttttttgtg ggttatttag tgattttttt 3600 

gggaaggtgg tatttggtat ttagggtatg attgtatatt agagtaagtt ttttaatagg 3660 

atataggatt ttgttaagga agtagggttt tttttggggt aggttttttt atgggatagg 3720 

ggatttgggg tgagataatt tgaagtgggt aggaatttga gggatattat ttttatttat 3780 

taaaatagtg gtttttaaaa atttattttt ttttggatta gtaatatttg ttttatttgg 3840 

taataagtta aaaatgtaaa ttattgggtt tttttttagt tttataaatt tagaaatttt 3900 

gggggtgtgt ttggtaattt gtggtttaat aagatttata ggtgattttg atgagtatta 3960 

gagtgtgaga attagtgtat taaatatatt ttaggtattt atatgtttgg tagttattaa 4020 

ttgtagtata gtattttttt ttttttgttt taggatttgt gtagtttttt gttttatttt 4080 

tttttttaag aggatgtttt ttttttatta ttattttttt taatgggata gttatgtttt 4140 

ttttattagt tttttttgtt atttggggat tatatatttt ttttttttat tgtttttttt 4200 

tattgaattt ttatatttta gtttttattt gtagaggtaa gttggttttt ttagttttta 4260 

gataggtttt taaggggttt ggattggtgt ttttgttttt tgaggggggt gttgttgttt 4320 

ttgtaggagt tgtattgggg agtattatag taggttgagg atgagggggt ggttgtggtt 4380 

gtagagattg gatggtgaag gtggaataga ggtttgagtg tatgtatttg ttttggttgt 4440 

tgtgtgttaa attgtttggg tttgttattt ttgtattttt gggtgtgttt ggttttttag 4500 

agatagtgtt ttttggggtg gtatgtatag tagtaggggt tatatttggt atggtggagq 4560 

taataga.tat__tt.ttggttgg-_ggt^ ^620- 
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aggatataaa tttgtagatg aatttttggt tgtttatttt gtggatgatg ttttaggaag 4680 

ggaggagagg aatagagggt tttagaggaa aggggtaggt tggaggtagg gtgttagggg 4740 

ggaggagggg ggtggagtgg tatttttttt tttttagtgg tttttttttt atttgttttt 4800 

ttttggtttt tttgtttttt ttttgtaagt ttttttttta gtttattttt ttttttattg 4860 

gtttttattt tttttttttt ttgttagatt ttttttattt atattttttt ttagtttttt 4920 

tttttttttt ttttttttta taataatttt ttttttaagt ttttttttat ttattttttt 4980 

ttaagttttt ttttgttatt ttttaggttt gtttatttga tttttttatt ttttttatat 5040 

tgtagttttt ttttataatg gggtttatga agggtagtag gtgtgggagt aggttttttt 5100 

tttggtttta agaattagag gtttataggg ttaggttttg tgggtgattg ttaggatata 5160 

gttttggaga ttagtaaatg taataagata agaaaatgaa gtaattgata taaatgttgg 5220 

aagaaagtaa attaatgtag atggatgggg tgatgaatta ttaatataga ggattatttg 5280 

aagttaatga tatagaagtg gaatagggta aggttattgt agaggagagg gtgaggaggt 5340 

tttattatta atattattgt ggaggttgaa gttaaaatgg taagaaaatg aagtaattaa 5400 

tagtaattag ataagaatat taaaaatatt ggtataaata ttgggagggg gtttatatta 5460 

aggttggttt tgttgggtta tagagtatgt ttggggtttt ttaggtttgg tgggttttag 5520 

agatgattta gtgtttttag agggtagagt gagatagttt tattattaat attgttttgg 5580 

agggttgagt ttaaggaata aaatatgaaa atggaataaa tgttataaat tatgaagggt 5640 

gtgtagtata tgattagggt agttttggga gtgtggaggt attagggtat tttgtggaag 5700 

ttttagtgtt ggttagatat gttatagggt gggtttgtgt tttttggaag aggaataaaa 5760 

agggatattt atgaatattg tttgggaaat tttagtagaa ggattaagaa aatgaaataa 5820 

ttggtataaa tattgggggg aggagaaatt tagtagtggt ttgggggttt gagagggtag 5880 

gatgggggaa attttaaatt ttagttgtag gaataagata atgaaaataa ttgatataaa 5940 

tattggggta gattagtgtt tttttgggaa tttaagaaga tagtgtgtgg aagttggtat 6000 

taaatatgtt gttttagagg ttttagttta aggaataaga gaagaaaata attagttaaa 6060 

aatgttggga gttttagagg atgaagtaag taaaagttgg tataatattg ttagggaggt 6120 

tttgatttta ggaatatgaa atattttgga ataagagaag attatgaaat aattagggaa 6180 

gtttagtatt ggtttagggg ttttagagga tagaatgagg attagttatt attaattgtg 6240 

tttttaagat gttggttata ggaatatggt aggaaaatga aatgttgtgt atggtagggt 6300 

agtattattt tggggatttt agagtgggag tagaaagagt tagtattaat aatgtagtgg 6360 

tggtttttgt taaagtagta agataagaaa gtaggtgaga tagtgtttag gggtttaata 6420 

ttggttttag ggtat 6435 

<210> 433 
<211> 6435 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 433 

gtattttaag attagtattg ggtttttaaa tattatttta tttatttttt tgttttattg 60 

ttttggtgaa aattattatt gtattattaa tattgatttt ttttattttt attttgagat 120 

ttttaaagtg atgttgtttt gttatgtata atgttttatt tttttgttgt atttttatag 180 

ttaatatttt gaaaatatgg ttaataatgg ttggtttrta ttttgttttt tgaaattttt 240 

ggattagtgt tgggtttttt tagttatttt atagtttttt tttattttag aatattttat 300 

atttttagaa ttaaaatttt tttgatagta ttatattaat ttttgtttat tttgtttttt 360 

gggattttta atatttttaa ttaattattt ttttttttta ttttttaagt taaaattttt 420 

gaaataatat gtttaatgtt aatttttata tattgttttt ttaaattttt agaggagtat 480 

tggtttattt taatatttat attaattgtt tttattattt tatttttata gttaaaattt 540 

aaaatttttt ttattttgtt tttttagatt tttaggttat tgttgggttt tttttttttt 600 

-taatatttat attagttatt ttattttttt aattttttta ttgaaatttt ttaaataata 660 

tttatgaata ttttttttta tttttttttt gagaaatata ggtttatttt gtgatatatt 720 

tgattagtat taagattttt atagaatgtt ttagtgtttt tatattttta gaattgtttt 780 

ggttatatgt tgtatatttt ttatagttta tggtatttat tttattttta tattttattt 840 

tttgagttta attttttaga ataatattag taatgaggtt attttatttt gttttttggg 900 

aatattaggt tatttt,tgag atttattaga tttgaaaaat tttaaatatg ttttgtgatt 960 

tagtaaagtt agttttgata taggtttttt tttaatattt atattagtat ttttaatatt 1020 

tttgtttaat tgttgttagt tattttattt ttttattatt ttagttttaa tttttatagt 1080 

ggtattaata atgaggtttt tttatttttt tttttgtagt gattttgttt tgttttattt 1140 

ttgtattatt agttttaaat aattttttgt attaataatt tattatttta tttatttata 1200 

«aatttatt tttttttaat attt^tatta gttattttat ttttttgttt tattgtattt 1260 

gtnagttttt aaaattgtgt tttgatagm atttataagg tttggttttg tgaatttttq 1320 

gtttttggag ttaggaagaa gatttgtttt tatatttatt gttttttatg ggttttattg 1380 

tgaagaggag ttatagtgtg gagggagtgg gaaaattaga tagatggatt tggagagtga 1440 

taaagggagg tttggaaagg ggtggatggg aaggggttta aagaaaaaat tattgtggag 1500 

agaagagaga aaggggagga attggggagg gatatagata aggggaattt ggtgggagag 1560 
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gaaagaggta gaaattaatg ggagaggaaa tgggttgaga gagaaattta tagaggaggg 1620 

atgggaaagt tggagaggag taagtgggaa gaaagttatt aggagaagag aaatgttatt 1680 

ttattttttt tttttttttt gatattttgt ttttagtttg tttttttttt ttaaggtttt 1740 

ttgttttttt tttttttttt tagaatatta tttgtaaggt gagtggttag aagtttgttt 1800 

ataagtttgt gttttatttt gaggttgtag ggtgttttat tgaggattgt ttgttttagt I860 

tagaggtgtt tgttattttt attatgttaa atgtggtttt tgttgttata tatgttgttt 1920 

taggggatat tgtttttgga aagttaggta tatttaaggg tgtaggaatg gtaggtttag 1980 

gtggtttggt atgtagtagt tggaatgagt atatgtgttt gggtttttat tttattttta 2040 

ttatttagtt tttgtagttg tagttatttt tttatttttg gtttgttgtg gtgtttttta 2100 

gtgtagtttt tgtaggggta gtagtgtttt. ttttggggag taggagtatt agtttaagtt 2160 

ttttggaggt ttgtttggag gttgaagagg ttggtttgtt tttgtaggta aggattggaa 2220 

tatagaaatt tagtaagagg aggtagtaaa gaggagggat gtgtaatttt tggatgatga 2280 

gggaggttgg tagaagagat atagttgttt tgttaaagga gataatggtg gaaggaagat 2340 

atttttttag aaggaggggt ggggtagaga gttatataaa ttttgagata gaaaggggag 2400 

aatattgtgt tatagttagt agttgttgaa tatgtaggtg tttggaatat gtttaatgta 2460 

ttggtttttg tattttagtg tttattagaa ttatttgtag gttttgttaa attatagatt 2520 

gttaggtata tttttagagt ttttgaattt gtggggttgg ggaggggttt aataatttgt 2580 

atttttaatt tgttattaaa tgaggtagat attgttggtt tagaggaagg taaatttttg 2640 

agaattattg ttttaatgaa tgaggatggt attttttaga tttttatttg ttttaggttg 2700 

ttttatttta agttttttgt tttataggag gatttgtttt agaggaagtt ttattttttt 2760 

gatgaagttt tatgttttgt tggaggattt gttttgatgt gtagttatgt tttgagtatt 2820 

aagtgttatt tttttgggga aattattggg tggtttatag gagaattttg tggttttaat 2880 

aatttttata tatttttttt gtgtttgttt ttttattagg ttattttgat tttgtttgag 2940 

gttttaaatt tgaaattgga agagtttaat gtggagttgg gtttgggttg ggttttgttt 3000 

ttagaagtga aagtagaagg gtttaaggaa gagttggaag ttgtggggga gagagggttt 3060 

gtgttagaaa ttattaaggt tgagttagaa gtttttttat aggagggtgt gttagtttgg 3120 

ttgtttgtgg ttgttatgga tattgtaggg taggtgggtg gttatgtggt ttttagtttt 3180 

gagatttttt agttgtagaa gggttggaag ttttgggatt tagagttttt atttagtttg 3240 

agtttgttag gtgggttggg atttgaatgg attttaggat tgggaagtgg ttttggtttt 3300 

taggttttgg ggttggtgtt gattttattt ttgtttttta tgtatatatt ggtgagtgtt 3360 

tgtggagggg tggggtgtgt gggtgttagt attaatggga ttggtatgtg gtgggtgatt 3420 

gataggaatt agaggagggt gagtgtggtt gtttttgaag agggtggagt ttttgagtgt 3480 

gattgattgg gttgatgttg gttggaggga tggggtttga ttagtgaagg gtagggttgg 3540 

ggatttttgt gggtgggatt gagggttttt ttgtgggatt gtttttaggt tggggtgtgg 3600 

tttaaggaat ttgagtaggg ttttttggtt tgggatgtgt atatggggtt ttatgggtgg 3660 

gatatgagtt ggaagttgat tttgagttgt tgggatggtg atttgagagg tttgagagta 3720 

gtatttggat agggttttgt gggtggtttt gattaatgag gttttagtaa ttgagggtgg 3780 

gtttttggga gtggggagtg gggggtgttt gtagagtttg agagattata ggtatggttt 3840 

ttatggttat tatttaggtg tgtgtatttt ggggagtgat tttgggagtg tgagtgggaa 3900 

atttgtggtg gggttgggat taagtttttt ttagtatttt atatttattt tttatttatt 3960 

tgagttttga ttattttttt tttatagatt ttggtgttgt tgatatttag tttgttgttt 4020 

tttagtattt atttttggag tattttgagt tttattgtgt tttgtagttt ggttaagttt 4080 

tttttttagg taggattttt tttttttgtt ttggataggg tgagagattt aagaatggtt 4140 

tttatttgtg attttttttt ttttttgttt ttagttttta tttagtggta gtgtttaggt 4200 

gtatattttt tttattagtg tggatggttt tttgattttt gtggtgtttt ttttagggtt 4260 

ttagaagtta tgattattat tattattatt attatttttt ttggggttat tttatttatg 4320 

ttttttttag ttagttattt taaggagaaa tatagtttaa ttgaaagatt tatgttttga 4380 

ttgtggtggg gtggggattt ttgggaagaa ttatttttaa gagtaatttt tattattttt 4440 

tttatagaaa atatatagtt tttataattt ttttgttttt tgttagtttt ttagtggttg 4500 

tttttatatg ttttttattt taatggtagg ggtggtttat ttatttattt tttgaaggtt 4560 

attgggagga gtttgattta attttttagg gtggttagga tatttttttt atttttttat 4620 

tttttttttt aagataagat aattgaggtt tggtttgaga atgatttttt tttttttatt 4680 

ttttagtttg tttttgggga gatgagggag ttttgtttgt gtttttggat gtgagtagaa 4740 

gagttagttt gttttgtttt attatttttg gttatattta ggggtttagg aagaatttgt 4800 

attatttaat gggttgggag ttttggttaa ggaagaatta tatttttgga atagaaattt 4860 

ttattttttt aatttttttt ttagatagtt tatttttttt aattaatttt ttggttaggg 4920 

aggaatgttt tttttgtttt tttttttgag aagttatttt tttgtttgtt aatttttttg 4980 

gggttttgtt tgtttttttt taatggaggg tttttttggg gggtggtttt tgtttggggg 5040 

gtttttttag ttagtatttt aggttttttt gttttttttt tgttgttatt ttgatagtat 5100 

aatttatttt taaatggggt tttttaatag gggagaggga gttatttttt tttatatttg 5160 

gtggggtggg tgggaaggaa gggatttggg ggggaatttt tttgttgttt tttttatttt 5220 

aagtgtttat ttttgatatt aaatatgaat ttttagtttt ttttttttta gttttttaat 5280 

tttttgtggg tgggtataaa ggattttttt aatgtttttg gagttgggag ggaggaatgg 5340 

gggatataaa gtttgttttg tttttatttt aggtaagaga gagtgggttt aaaagatttt 5400 

tgggtttatt tgttagtgtt ggtttagttt aggttttggg atttgggggt tggtgatttg 5460 

ggggatagtg ttatatttgt ttttattgtt tgttttattt ttttaaaatg gatttttttt 5520 

ttttttaaag agttttagag aatggggaat tgtttttgta aatatatatt ttttaaagtg 5580 

atgttggagg ttgttggta t tttttttttg atgttgttag gattagttga gtgtgatatg _ 56 40 
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tatgattgtg tgtggagggt gtgaggtaga ggttgattgt gagagaaatg gtgtgatggt 5700 

gtgtgtgaga aataatagta gggattgtat attggtaatt atgtgtgtgt tagagatgtg 5760 

gaaagttgtg attgtgtgtg agagtgtgta gaagttgtga ttatagattt ttgatggtgt 5820 

gtttatgtgg gaaattgtgg ttgtgtttta gaaaataatt gagtttggat gaatagttaa 5880 

ggtattgtat aagaatttgt gatggtatgt ataaaataat gtttatgaga agggaatggg 5940 

gattgtagtt ttgttattgt ttgatggtga gtaagataga ttgggtgatt aaaattatat 6000 

gtgagagatt atgtgggtat gaaattatag tagaggttgt tagggattgt ggtaaagatt 6060 

gtgtaggtga taaattttat tgtaggtgat ttaatgtttg agaaatagat gatgtgtaaa 6120 

attgtttatt tgaggttgta gtgttagtta aagttgtaag attgtatggg tgtgaatttg 6180 

attgtatggg agagaatggt tgggagatta aattagtgat tgagattaaa gatataagaa 6240 

ggtgtaattg tgtgtatgaa agtaggtgtg agagattgta gtaaagattg agaaaaatga 6300 

atggttgtga ttgggagagt gtgataggtt gtgtgtgtgt gtgtgtatat ttgaaattag 6360 

ttgtaaattg atttattatt attattatag tttattttat aggtttaggg attgtgagaa 6420 

agtaggagag aaaat 6435 

<210> 434 
<211> 4406 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 434 

ttagtttgga ttttttatgt ttggggaaaa aggttgtatt tggagtttga ggtatttatt 60 

atttgtttat tttaggtaag ggttgtttat atttaggggt tgaggatatt tttgatgtag 120 

ttattttagt ttatgggagt tggaggtttt ggttttggtt ttttagatat ggtattgtag 180 

ggttttgtgt tttgtaagtt tttaataaat gttggtttta tgggtatttt aagatagaag 240 

agagattgga atttattttt tttatattta tgtaggttat tttttttagt tgttttttgt 300 

ttttagggta gtggttatta ttttatagtt tggtttgtgg tgttgttgtg gatggttgtg 360 

gtatttgttt ttatgtttat tattggttgg ggttttattt taggttatat agttattgta 420 

tagtagggtg ggttgtttag gtggtagagt ttgggatttt taagtttttt tatggttaag 480 

ggttttatat ttattagttt tatgtagatg atgaagggtg gttgggtggt ttggaagtgt 540 

aggatgggga tttggggttt gggttttttt tgagagtgat agtggttatt agttttgttt 600 

gagtttggag tttgtttttg gagagatgtt tttagggtgt atgtagttat taagttggtt 660 

tatataggag ttagggtttg ggtagtagat aatgtgttag ggttatagat tgtggtaggt 720 

gtttagagta ggaatttggt ttttgttatt tttgtttgat atttgattgt agaattgatt 780 

agggtgggtt attttaaggt ttagatttat gagggatagt ggggtttttt ttttaagaat 840 

gggggttggg ttttttggag gttttgagga gttagtatgg tttaggtgga ttttgttaag 900 

ggggaaggtt tggagtagtt gttttttagt tttggagtag gtggttgatt ttaagggtga 960 

tattatgtgg ttttgtatat ttgagagttt ttgtggtaat agaagttttt tttgattttg 1020 

tttttgttta tgttatagaa tgttattatt tggggataag tatagaaggg tagttttgga 1080 

taggttgggg ttagggtttg tgttgtggga tagtatgggg gtagggtagg tagtgttgga 1140 

gggtggtagg atgtttattt tgtgttggtt gttggtagtt aagttgtggt atagttggtg 1200 

tttttgtttg ttagtgtttt agatggtgaa ggttgtttag tggggtaggg tttgtttttt 1260 

taggaagtgg atgtggtggg tttagatggt tgttttgtgg tggagaagag ggtgataggt 1320 

aggttggagg ttttatagat gtatatagat gttgatatat agggatatga tgtgtgttta 1380 

gtttatgttg ttagtggagt ggatatttgg tttaatgttt aagagagatt ggttgtttta 1440 

tagttaggtt atttgtttta gttggatgta gttatatgtt taggtgggga tttatttttt 1500 

ttattagatt ttaataggtg gggatagatg agaagttgtt atttaagaat ttagaaggtt 1560 

ttggaaggta gaatttttgt aggatgtaga ggattggatt tgatttaagt ttagtttttt 1620 

agagaaattt ttgtttataa ggatatatgt aattatttta taaaatttta gtgtttatat 1680 

ttggatatga ttttttgttt tagggttgtt gttttgaatt tttttatttt tttgggaatt 1740 

tggtttttgt tttttttttg ttgtgataag ttgattttag gttgttgttt tggggggagt 1800 

aggttttttt ttatgatttg ttgttttgtt tagtgtttag ttttattgtt gtttgggttt 1860 

tttttagttt tgtttttttt ttttttattg tagttttttt ttatgtgttt tattgttgtt 1920 

tgtttgtgtg tgtttttttt ttaattttgt ttttggagtt gtttgggttg agtttgtttt 1980 

aaagttttag gagttggatg ggttagaagt tagaggggta ggggatgtag aggttaatgg 2040 

ggttagaatg gtgtgggatg tgttttagat gttgtttttt ttaatggttg tgatgtagtt 2100 

atgttgaagt ttggtatttt atagattttg ttttggttga gttttgggat gttttttgtt 2160 

gtgttaaggg agttgtgttt gtgggagttt tttttggggt ggggtttggg ttagatataa 2220 

ttaatattgg ttggtgatgg gttttttttg agtagggttt ttgtgggttt agttttggga 2280 

ttggggtagg attttgttgt tttatgattt tgttgttgtt tttgtttttt gggataaatt 2340 

tgtgattttt ttttaatttt gtggtatttt ttgtttgtag tagaattttt ataggtaatg 2400 

ggatgggggg gatgggggga tatggggttt tttttgttag tttagggagt tgtagggttt 2460 

aggtattgtt agggaggttt tgtttggagt ttattttttt ttttttaatt tt^tgtagtt 2520 

attttaatag tttggatttt atttgggata tagggagatg tttattttag ttagtgattt 2580 
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ggggatattg aagtatgtgt gtgtgtgtgt atattgtgtg ggggtgtatg gtgtqtgtqt 2640 

gtgtatattg ggggtgtatg gtgtgtgtgt gtgtgtatat tgggggtgta tggtgtgtgt 2700 

gtgtatattg tgtgggggtg tatggtgtgt gtgtgtgtat attgtgtggg ggtgtatgtg 2760 

tgtgtgtgta tattgtgggg gtgtatggtg tgtgtgtgtg tatattgtgt gggggtgtat 2820 

gg 3 g 3 g * g * 9 3 atat1: 9tgg gggtgtatgg tgtgtgtgtg tgtatattgg gggtgtatgg 2880 

tgtgtatgtg tgggggtgta tggtgtgtgt gtatattgtg tgggggtgta tgttgtgtgt 2940 

gtatagtgtg tatgtgtggt gagtgtatgt gtatatgagt atgtgtatag tgtgtgtagg 3000 

tgtttggagt atatgtttgt atgtattgtg tggatgtgtg tatatgtatg tgtatltgtg 3060 

tgtgtagtga gggtttgtgt atgtagtatg agtatatgta tgtagtgagt gtatatgttt 3120 

gtatgtattg tgtagatgtg tgtatatgta gtgtgtgtgt atgtgagtgt gtgtgtgtag 3180 

tgagtgtgtg tagtgagtat gtgtgtatat ttttgtgtat gtatgtgtgt gtatgttgta 3240 

tatgtggttt gtgtgtatgt gtagtgtgtg tgtgtgtgtg agaatttggg tgaggatagt 3300 

ttttgtagat ggttgtgggg atttttttat taggtgatgt agttttgttg ggttttggta 3360 

*fZtflll 99 23S t9tt9 S fl ^sttttgggg tggggggatg agtttgattt 3420 

tttt ifH ta tt g atat t a g tttagggatt ttagggggtt tatgtgatag gaaagtagtt 3480 

agggttttag aagaaagtgg ggggtttttg tggttgtggg tggtatgtga ggtttggggg 3540 

gtttttgata ttggtttttt tgatggtttt ttgttttttg ggttttttta ggggtttttg 3600 

§5H 9 f? aaa i g"g 9 H tat t g ta ggaagg taaaatgtag aagtagttat gatgtttgta 3660 

HJ2S tttat ttttattttt tatattttaa aaagttttta tttaattagg gagaataggt 3720 

tt 3 t 3 ag99t gttttatgtt ttttttttta tatttttaga ggaagttgtg gttttgtttt 3780 

agtattggga aaggtggaaa tttagttttt tttaaaagta taagatagta tttttggaaa 3840 

tgttgtaaag gagttgaagt gttagggata tgttattgta tgaggaggtg gttaaggtgt 3900 

tggtgggtat aagggagagg agtgtggatg tgggaggttt tagtatgtag gagtgaggat 3960 

gatgttttta atggggaagt ttgtggtttt tttgttaatt ttttagtttt attttgtgat 4020 

tttggatttt agattttttg gggagttggg attttgttga gagttttaat tttattatgg 4080 

taggtttggg attattgtgg tttttatatt tatatatgat tttagttggg gttttgagat 4140 

tttgggggtt ttgagtattt gtaggggttg agatgtaagt tttggatggt ggtttgtttt 4200 

ttaatttaat gttgtggtat gtggtaggta ggtggttgag atttttgtgt ttttatagtt 4260 

ttaaggaggg aattgtttgt ttggatttaa ggaaggaatt ttatatgtag ttttattttq 4320 

atttgtttta tttttgttat ggttttaatg tagttatgtg gtatttttag atttgggttt 4380 
atgtggttgt tttttatagt gatatg u yy« L "- t 

<210> 435 
<211> 4406 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA CHomo sapiens) 
<400> 435 

59aggagtgg ttatgtgggt ttgggtttgg gggtgttgtg tggttgtgtt 60 

ggagttgtgg taggggtgag gtaaattggg gtgaaattgt atgtagaatt ttttttttag 120 

atttgggtag atagtttttt ttttagaatt gtgaggatat agaggttttg gttatttgtt 180 

^lli^E?! * g H gt9t H til tggaa - ag taggttgttg tttgaggttt gtgttttggt 240 

ttttatgggt atttagggtt tttggagttt taagatttta attagggtta tgtgtgggtg 300 

* 9ggag 5 t9t Satggtttta ggtttgttgt gatggggttg agatttttgg taagatttta 360 

gttttttagg gggtttaagg tttaaggtta tggggtgggg ttggggggtt ggtaggaggg 420 

ttatgagttt ttttattggg ggtattattt ttatttttgt atgttggggt tttttgtgtt 480 

SESSSS «*gg«gtt ttttta?e y g gtgatgtgtt 54 o 

*H ga H a 5H 5 ag 3H£H3 Qtaatatttt tagaaatgtt gttttgtgtt tttgggaaaa 600 

attgggtttt tatttttttt ggtgttaagg tggggttatg gttttttttg aggatgtaaa 660 

aagagaggta tagggtgttt tggggagttt gttttttttg attaaatgag aattttttaa 720 

aatgtgaaaa atgaagatga attaaatata gatattatgg ttgtttttat attttatttt 780 

tttatgatgg gttaatattt ttagattaga gatttttagg aaagtttagg agataaqqqq 840 

ttgttgagga ggttggtgtt agaggttttt taaattttgt gtgttattta tggttataga 900 

gg "rH gt "ttttttgg agttttgatt gtttttttgt tatgtgaatt ttttggggtt 960 

|H aag H ga iSr^S* 99 * t 9 a 9g a gg a t taggtttgtt tttttgtttt gagatattta 1020 

atatatttgg gttgttaagg ataagatgtt aaaatttggt gaggttgtgt tatttggtgg 1080 

SSJSG:*?* gj a gttgttt gtaggagttg tttttattta ggtttttata tatatatata 1140 

3 a 5H3 a * g 3 gtatratagat tatatgtata gtatgtatat atatatgtgt atgaaagtat 1200 

atatatgtat ttattgtata tgtttattgt atatatgtat ttatgtgtat atitattgta 1260 

H*?*? 1 ?*" tt:t g ta tggt atatgtaggt atgtgtattt attgtatata tgtatttata 1320 

ttgtatgtat aggtttttat tgtatatatg tgtatatatg tgtgtatata tatttatata 1380 

g * a * a 5 a H g g tat g ta1 :tt tgaatatttg tatatattgt atatgtgttt atgtatatat 1440 

gj a ii tatta tatatgtata ttgtgtatat ataatatata tttttatata gtgtatatat 1500 

atattatata tttttatata tatatattat atatttttag tgtatatata tatatattat 1560 
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atatttttat agtatatata tatattatat atttttatat agtatatata tatatatatt 1620 

atatattttt atagtgtata tatatatata tatattttta tatagtatat atatatatat 1680 

attatatatt tttatatagt atatatatat atattatata tttttagtgt atatatatat 1740 

atatattata tatttttagt gtatatatat atatattata tatttttata tagtatatat 1800 

atatatatat atgttttagt gtttttaagt tattgattgg gataagtatt tttttgtgtt 1860 

ttaggtgggg tttggattat tgaggtggtt gtagagggtt ggggaggaaa aggtagattt 1920 

taggtaggat ttttttgata gtgtttaagt tttgtagttt tttgagttga tagaggggat 1980 

tttgtgtttt tttatttttt ttattttatt gtttgtgggg gttttgttat aaataggggg 2040 

tgttataggg ttggggggga gttataagtt tattttggga ggtggaggta gtagtagaat 2100 

tgtagaatag tgaaattttg ttttgatttt gaggttgaat ttatgagggt tttgtttagg 2160 

ggaggtttgt tattagttag tgttggttgt gtttggttta gattttattt tggaggagat 2220 

ttttatggat atagtttttt tgatatagta gagggtattt taggatttgg ttagggtagg 2280 

gtttgtgggg tattgggttt tagtgtggtt atattgtggt tattggggag gatagtattt 2340 

gggatatatt ttatgttgtt ttggttttgt tggtttttgt attttttgtt tttttagttt 2400 

ttggtttgtt tagtttttgg ggttttgaga tgggtttagt ttgggtagtt ttaggagtag 2460 

ggttgaggga gaggtatata taggtagata gtgatggggt atgtggggag gggttgtggt 2520 

gggaggagag agggtggggt tgggaagggt ttgggtagtg atggggttgg gtgttgggta 2580 

gagtagtagg ttatgggaag gagtttgttt tttttgggat agtagtttgg ggttgatttg 2640 

ttatagtaga gggagaatag agattaggtt tttaggggag tggaagggtt tggggtagtg 2700 

attttgaggt agaaggttgt gtttaagtgt gaatgttggg attttatggg atggttgtat 2760 

gtgtttttat ggataaaggt ttttttggga aattggattt gggttaagtt tagttttttg 2820 

tgttttatag aaattttgtt ttttagaatt ttttagattt ttaagtgata gttttttatt 2880 

tgtttttatt tgttggggtt taatggaagg ggtggatttt tatttaggta tgtggttatg 2940 

tttagttggg gtaggtggtt tgattgtagg atagttagtt ttttttaggt attgggttag 3000 

gtgtttattt tattggtaat gtgagttgag tatatattat gtttttgtat gttagtattt 3060 

gtgtgtattt gtggggtttt tggtttgttt attatttttt tttttattgt aggatgatta 3120 

tttggattta ttgtgtttgt tttttggaag agtaggtttt gttttgttgg gtggttttta 3180 

ttatttggaa tgttggtaag taggggtgtt ggttgtattg tagtttgatt gttggtagtt 3240 

agtgtaaggt gagtattttg ttatttttta atattgtttg ttttgttttt gtattgtttt 3300 

atagtatagg ttttggtttt ggtttgttta gagttgtttt tttgtgtttg tttttaggtg 3360 

gtggtatttt gtgatgtgga tgagaataag attaggaaag gtttttattg ttatgaggat 3420 

ttttaggtgt gtagggttgt atggtgttat ttttggggtt aattatttgt tttggggttg 3480 

agggatagtt gttttgagtt tttttttttg gtagagttta tttgggttgt gttggttttt 3540 

tggggttttt agggagttta gtttttattt ttggggggag aattttgttg ttttttgtgg 3600 

gtttaggttt tggggtagtt tattttggtt agttttgtag ttgggtgtta ggtgggggtg 3660 

gtgggagttg ggtttttgtt ttgaatattt gttgtagttt gtgattttgg tatgttattt 3720 

gttgtttaag ttttggtttt tgtgtgagtt aatttggtgg ttgtgtgtat tttgagggtg 3780 

tttttttggg gataagtttt aggtttgggt aggattgatg gttattgtta tttttaggaa 3840 

agatttaagt tttgaatttt tattttgtat ttttgagttg tttggttatt ttttgttatt 3900 

tgtgtgaagt tggtgagtgt ggggtttttg gttgtggaag ggtttgggga ttttgggttt 3960 

tgttatttgg ataatttgtt ttgttgtgta atgattgtgt ggtttgggat gagattttag 4020 

ttgatggtgg gtatggggat aggtgttata gttgtttata atagtattat agattaggtt 4080 

gtggggtggt ggttgttgtt ttgggggtag gaggtagttg gagagggtgg tttgtatggg 4140 

tgtggggggg atggattttg gttttttttt tgttttaaaa tatttataaa gttaatattt 4200 

gttagaaatt tgtaaagtat gaggttttgt ggtgttatgt ttgaggggtt aaggttagga 4260 

tttttagttt ttgtggattg aggtggttgt attggggatg tttttagttt ttgagtgtga 4320 

atgatttttg tttggggtgg gtaggtgatg ggtgttttgg gttttaaatg tagttttttt 4380 

ttttaagtgt ggggagttta ggttgg 4406 

<210> 436 
<211> 4417 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 436 

gaaattttgt ttttattaaa aatatgaaaa gttatttggg tttggtggtg ggtgtttgta 60 

ggagaatgta gtgaatttgg gagtgggagg ttgtagtgag tagagattgt gttattgtat 120 

tttagtttgt atgatagagt gagattttat tttaaaaaaa aaaaaaaaag aaagaaagaa 180 

agaaaaagaa aagagggtgg agatggggga tgatatttag tttaggaggt gtttatggtt 240 

tggttttttg tggggagaag gaaggttata tgattggtgt ggtttagggg gtagggtttt 300 

agtattttaa gttgagtttt tttttttatt ttttgaggtt ggtattggta gtttaggtgg 360 

gggtttgggg atttgaatgg gatgaatggg ttaaggggga tgttattttt tttatttttt 420 

tttatttatt gttttttttt tttttttttt tttttttttt tatttttggt ttatttattt 480 

tttattttag ttaggagttg ttatttaagt agaaaagggg tttttggaaa gggggtgggg 540 
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ttttgatttt tgggtatttt gtgtttgaag aggaattttg ggaaggggtt gtttaqggat 600 

ttggtttttt ttttatttgg ttgtataatt tttttgtttt tttttttttt tagttggttg 660 

tttttttttg ttttgagatg ttaggaaaga gggggttatt tgtgtttttt atagtggttt 720 

ttgaagtttg gggtttttag ttttagagtt tagaggtgaa ggaggtgtta tagttttggt 780 

gattattggt tgttttttag ttttttttat gttatttttt tttaaaataa taaaataaat 840 

aaataaaatt gggtttggtg tagtggttta tgtttgtaat tttagtattt tgggaagttg 900 

agatgggtgg attataaagt taggagatta agattatttt ggttaatatg gggaaatttt 960 

gtttttatca aaaaatataa taaattagtt aggtgttttg gtgggtgttt gtagttttag 1020 

ttatttggga ggttgaggta ggagaatggt aggaatttgg gaggtggatt ttgtagtgag 1080 

ttgagattgt gttattgtag tttagtttta ataatagagt gagattttgt tttaaaaaaa 1140 

aaaaaaaaat tgattggaat attttttaag atgtaagatt tttagttttt tagagtttta 1200 

taggaaggat ggtagagtgt agttgtttag agttgaagtt ttatttttgt tatttgttgg 1260 

ttgtgtgatt aggtataaat tattaatttt tttgagtttg tattttatta tttgttgtta 1320 

ttgagtaata gtagtggatt attttttatt tttattttta ttttatttat tttatttttt 1380 

tttgagatgg agttttgttt tgttatttag gttggagtgt agtggtgtga ttttggttta 1440 

ttgtatttat gttttttggg tttaagtgat ttttttgttt tagttttttg agtagttgga 1500 

attataggtg tgtattatta tatttagtta attttttttt ttttttttag tagagatagg 1560 

gttttattat gttggttagg ttggttttga agttttgaat ttaggtgatg tatttgtttt 1620 

ggttttttaa agtgttggga ttgtaggtat gagttattgt gtttggatga atgtttttaa 1680 

ttatattatt attattatta ttattattat tattattaat ttttgagatg gagttttgtt 1740 

tttgttgttt aggttgaagt gtaatggtgg gattttggtt tattgtaatt tttgtttttt 1800 

aggattaaga gatttttttg tttaagtttt ttaagtattt gggagtatag gtatgtatta 1860 

ttatgtttga tgatattttg taattttagt agagatgggg tttttttatg ttggttatgt 1920 

tggtttggaa ttttttattt tagttgattt atttattgta gtttttgata attataggtg 1980 

tgagttatta tgtttagttg tttattttgt tttttattag agttttgtat tgtgattttg 2040 

tataaaatag tttggaagtt ggatttattt tgtgtgtgtg attgttttga gtttatagaa 2100 

agatattttt agagtgtgga ttgagaagtt tttattgtgg gaggattggg gtgttttagg 2160 

gttttgggag atgggatgga tttggaaggt tggggggagg ggtttttgag gaagaggagt 2220 

tttggaagtg ggggttatta taggttaagg ggtggttttt gggatttttg tagttagtgg 2280 

tgttgtggtg gtagagtgta tattgatagt tgagagttat ggtgtaggag attatggggt 2340 

ttatgttgtg tgggtagtta gggagttgga tggatttgaa gtgtatattg tggtagttgt 2400 

atattatttg aggtagggtt ggtaggattt tttgtagtat gtgggtttgg aagttgtgtg 2460 

agtgggggaa tgagtatgtg tttggggtta gtgttttagt tgagtttttt agttgttttt 2520 

ataggtttta gatttagggg tttaggaagt tttttgtttt tgttttttta tattttattt 2580 

tgtagttttt gattagagag gtagattatt tttttttttt tgttgttttt gtgggtttgg 2640 

ttttgaggtg gtagtatttg ttttggtttt gggtagttta ttatggtggg gtagtagttg 2700 

gtatagatgg tggtgttgat ggtgatgtat atggggtagt tttttttttt tatagttagg 2760 

gtggtattga tggggtggta ttgtggttga agtggttttt tggatgttta tgttttgttt 2820 

atgtttagta gtagtaatag tagtagtttt tggggtaagg atattgtttt atttgggttt 2880 

gagattgtag tttttagttt tggttttttt attttgtatg gtatattatt tataaagatt 2940 

t ^?5 g fIH t ttt1: 9g tat * ttttatttag gatttattat ttggatattt gttttttaga 3000 

gtttatttta tagtttagag gatttgagat attttaatat tttagatttg tatttttagg 3060 

aattgattta tttgaagttt attgggggtt atgttttttt agaaagaggt ttttttttat 3120 

??JES2S?5? 22H^H!£ "tttatgtt agtgatggtt tggaaggagg tggaaggtgt 3180 

ttaggggttt tgtagttttt tttggaatat ttttattttt ggtgtttgga aatgtggatt 3240 

^H" attt "gatatgtt ttttttttag tgggatattt tttgtaagta ttgggiatgt 3300 

f?«^Si^ a a 9 ta ^attga gtttttgtgg gggagtgaga tagggagtga ggggtgttgg 3360 

atgtggtatg ggaatttggt tagagttagt ggatttaatt ggttgttttt ttttagatgt 3420 

agtttttttt ttttttttta gggggtgtta tggaatgtag ggtttttatt ggttttgggg 3480 

attgggtgat gttagggatg agttttttgt gattggtttt attattttgt gtaagattaa 3540 

agggaagaaa ggatgggttt gataattgga gttattgtgg ttttggttga ttgagtttgt 3600 

Jtttgggttt ttaaggttag gtgagggtgt gagggattgg agtttgaggt taatttgggt 3660 

tttattggta ggaaaaaaaa aaaaaatttt tttattgttt tttatataat aataaaatat 3720 

aaagggagga tgttttgata ggaagaaatg atattttttt aagtgttttt aaattatttt 3780 

^HJ^B* t 5 ttttttt t: tttgggatat tgagttttgt ttggtttttg ggggattttt 3840 

J5SH!2H a 2£5 at 99 t 99 ttgttatttg tggtaaatag aatggtggga gtaagtggga 3900 

tgttttatta tttgggttga ggtaggagaa ttataggaat ttaggaggtg gaattttttt 3960 

gtttaggagg ggtgtggttt tggatttagt ttttgtttaa tgagagaggg tttttttgtg 4020 

?SSHE:2£ HS2 ttagtt: attt 9atttt ggtgttgtta atgagggatt tagtttgigt 4080 

tttatttttt ttttagggat ttttgtttat tttattttta agttaggatg tttggagtgq 4140 

HJ2 tgtgtt: ttgggtgat t taattgatta ttgaatiggt ttgtaggagg 4200 

HHBJi 9 ^ t1: ttgaggtt gtagttttgg aggatattat ttggatttag tttttttttt 4260 

gtgatgttat taagttggat aattttaagg tttgtttttt ttttaatgta gggtatagqa 4320 

tggtataggg aagagggtta gaaatttgat ttgagttttt tgtttagggt tgaatttttt 4380 

agtattttga ttatttttta ttgaattttg tttatat * yyy 9 4417 



<210> 437 , 
<211> 4417 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 437 

gtataagtaa ggtttaatga gaagtgatta ggatgttgga gagtttagtt ttgggtgggg 60 

agtttaagtt aggtttttag tttttttttt tgtgttattt tatattttat attgggaaag 120 

aaatagattt taaaattgtt tagtttgatg gtattgtggg gaagggatta agtttagata 180 

atgttttttg aggttgtggt tttgggggta ggatatattt tttgtgggtt tatttaataa 240 

ttagttaaat tatttgaagt atatgtattt ttggggattg ttttgggtat tttggtttga 300 

gggtagagtg ggtagaggtt tttaagggag aggtggggtt tgggttgaat tttttgttgg 360 

tggtattagg gttaagtggt taatttggta gtatagttat ggggaggttt ttttttattg 420 

ggtagaaatt aagtttgaag ttgtgttttt tttgggtgag gaggttttat tttttaggtt 480 

tttgtgattt ttttgtttta gtttgagtag tgggatattt tatttgtttt tgttattttg 540 

tttattatag gtgatgattg ttatggttga taggtaggga ggttttttga ggattgagta 600 

aggtttggta ttttaaaaaa aaaaaaaaag atatattgaa gtaatttaaa aatatttagg 660 

aagatgttat ttttttttat taaggtgttt tttttttatg ttttgttgtt atatagggaa 720 

tgataaaaaa attttttttt ttttttttat taatggggtt taggttgatt ttgaatttta 780 

gttttttata tttttgttta gttttgagag tttgagggta ggtttaattg gttggagtta 840 

taatggtttt ggttgttggg tttatttttt tttttttttg attttatgta gggtgatgga 900 

gttaattata agaggtttat ttttgatgtt atttagtttt tagggttagt gagggttttg 960 

tgttttgtgg tgttttttgg agggaggaag gggaattgta tttgagagag agtagttaat 1020 

tgggtttgtt gattttggtt aggtttttgt gttgtgttta atatttttta ttttttgttt 1080 

tattttttta tggagattta atttattttt tatgtttata tttttagtgt ttgtggaaga 1140 

tattttgtta agagagagat atgttaaagg tagggtagat ttatattttt gggtattaaa 1200 

gatggagatg ttttaggaaa gattgtaggg tttttgggta ttttttattt ttttttaggt 1260 

tattattggt atgagaaggg gtagatttgt gtgagttgtg gaaggaggtt tttttttgga 1320 

ggagtgtgat ttttagtaag ttttaggtgg gttagttttt gagggtgtgg atttgaaatg 1380 

ttggggtatt ttaggttttt tgggttgtgg ggtgggtttt gaaaggtagg tgtttgggtg 1440 

gtgggttttg aatagaagat gttgggaagg gtttttgggt ttttgtgggt ggtgtattat 1500 

gtgggatggg aaggttagga ttgggggttg tagttttaga tttgggtgaa gtagtgtttt 1560 

tgttttagag gttgttgttg ttgttgttgt tgagtatggg tgggatatgg gtatttaagg 1620 

agttgttttg gttatggtgt tgttttatta atgttatttt ggttgtggag aaggagggtt 1680 

gttttgtgtg tattattgtt aatattatta tttgtgttgg ttattgtttt attatggtga 1740 

gttgtttggg gttagggtag gtgttgttat tttagggtta gatttataga ggtagtgggg 1800 

gaggaagggt ggtttgtttt tttggttagg ggttgtggaa tggggtgtgg gagggtagga 1860 

atagagggtt ttttggattt ttgagtttga gatttgtggg ggtagttggg gagtttagtt 1920 

gaggtgttgg ttttaggtat atgtttattt ttttatttat atggttttta gatttgtgtg 1980 

ttgtaggggg ttttgttggt tttgttttag gtggtgtgta attattgtga tgtgtgtttt 2040 

gagtttattt ggttttttgg ttgtttgtgt ggtgtgaatt ttgtggtttt ttatgttgtg 2100 

gtttttagtt gttaatgtgt attttgttgt tgtagtatta ttgattgtgg gggttttaag 2160 

gattattttt tgatttgtga tgatttttgt ttttaggatt tttttttttt aaaggttttt 2220 

ttttttagtt ttttaagttt attttgtttt ttggggtttt aggatatttt gattttttta 2280 

taataaaggt tttttaattt gtattttgga ggtgtttttt tgtgggttta gggtaattat 2340 

atatataggg tgggtttagt ttttaaatta ttttatatag agttatagta tagaattttg 2400 

gtagaaaata gggtggatgg ttgggtgtgg tggtttatgt ttgtaattgt tggaggttgt 2460 

ggtgggtgga ttagttgagg tgaggggttt tagattaata tgattaatat ggagaaattt 2 520 

tatttttatt aaaattataa aatattgttg ggtgtggtgg tgtatgttta tatttttagg 2580 

tatttgggag gtttaggtag gagaattttt tgattttagg aggtggaggt tgtagtgagt 2640 

taagattttg ttattgtatt ttagtttggg taataaaaat aaagttttat tttaaaaatt 2700 

agtaataata ataataataa taataataat aatataatta aaaatgtttg tttaggtgta 2760 

gtggtttatg tttgtaattt tagtattttg ggaggttgag gtaggtgtat tatttgagtt 2820 

taggattttg agattagttt ggttaatatg gtgaaatttt gtttttatta aaaaaaaaaa 2880 

aaaaaattag ttgggtgtgg tggtgtgtgt ttgtagtttt agttatttag gaagttgagg 2940 

taggagaatt gtttgaattt gggaagtata ggtgtagtga gttgagatta tgttattgta 3000 

tttt^gtttg ggtgatajggg tgagattttg ttttaaaaaa aaataaaata aataaaataa 3060 

aaataaaaat agaaaatagt ttattattat tatttaatag taatagatgg taaaatatag 3120 

gtttagagaa attaatgatt tgtgtttggt tatatagtta ataaatggta gagatgggat 3180 

tttaattttg ggtaattgta ttttgttatt ttttttgtag aattttgagg aattgagagt 3240 

tttgt^tttt ggaggatgtt ttaattaatt tttttttttt ttttgagatg gagttttgtt 3300 

ttgttgttga gattggattg tagtggtgtg attttggttt attgtaagat ttgttttttg 3 360 

ggtttttgtt atttttttgt tttggttttt taagtagttg ggattzatagg tgtttattag 3420 

gatgtttggt taatttgttg tattttttag tagagatgga gtttttttat gttagt { tagg 3480 

atagtttiga ttttttgatt titgtgatttg tttgttttgg ttttt^aaag tgttgggatt 3540 

ataggtatga attattgtgt taggtttaat tttgtttgtt tgttttgttg ttttgggaaa 3600 
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gggtggtatg gagaagattg ggaaataatt agtggttatt agagttgtag tatttttttt 3660 

atttttgagt tttggggttg ggaattttag gttttggagg ttattgtgga ggatgtaggt 3720 

ggtttttttt tttttgatat tttaggatag ggaggggtaa ttggttaggg gaagaaaaga 3780 

ataggaaggt tatatagtta ggtggggaag gggttaaatt tttgaataat ttttttttag 3840 

agtttttttt taagtgtagg gtatttaaga gttaaggttt tgtttttttt ttagaggttt 3900 

tttttttatt taggtgatgg tttttggttg ggatgggagg tagatggatt gggggtggga 3960 

ggggggggag gaggggaagg gagaggtagt ggatgaagga gaatggaaga gatgatattt 4020 

tttttggttt atttatttta tttaggtttt taagttttta tttggattgt tagtgttaat 4080 

tttagagggt ggaagaggga gtttggttta gaatgttgag gttttgtttt ttgggttata 4140 

ttaattgtgt ggtttttttt ttttttatgg aaggttagat tatggatatt ttttgagttg 4200 

gatgttattt tttattttta tttttttttt tttttttttt tttttttttt tttttttttt 4260 

ttttgagatg gagttttgtt ttgttgtgta ggttggagtg tagtggtgtg atttttgttt 4320 

attgtaattt tttgttttta ggtttattgt atttttttgt aagtgtttgt tattaagttt 4380 

ggataatttt ttgtattttt agtagagatg gggtttt 4417 

<210> 438 
<211> 4420 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 438 

tttttggtag ttttgggggt ttttttttgt attttgaagg aaggggaggt gtgtggttgg 60 

aggtgagagg ttttgaggtt ttgtttggtt tttagttttt taaatttgtt tttttttggg 120 

ttgttgtttt tagagattgt tgaggatatg agttttttat ttatgggttg ttttaatgga 180 

gttgggttag gggtttttgg atagaggttg tttgtttttt tgggttttgg ggtttgtttt 240 

Jtttagtagt ttatttgggt ttttggggtt agggtattgg gtagaagtta gaggatggtg 300 

tgtttgggat ttgagttttg ggtagttttt ggggtttagg atttttggtt tttttgttta 360 

gttttatgtg ttttgtgttt tgaaattatg tgtggtttta tagttgtgtt tttagaagtt 420 

gttgtggttg ttgtgaggta gtagtggatg ggatgtttat ggtttttttt atgttatgtt 480 

gtttttagtt gggttttatt atttttttat aagttgaaag gaagtggttt ttttattagt 540 

tttggttggg agttttgtgg gtgaggtttt ttttaggagt tgttttttta agtgttttag 600 

gtattaataa gattttatgt gtgttttttt tatagttggt tttgtttagg tgttgttgtt 660 

ttagtagagg tgggtggggg gtagtgtgga gttgtattgt gaggttgtgg gtagtttggt 720 

gtttgagatt tagtggtggt ttgaagggta gggttttaat gatatttgtt tttagttttg 780 

ggatggtgtt tggttggatt gtgtttatat ttatgttatt tattattagt atgtggttag 840 

tattattttt attgatatgt ttgtggagga ggatatgggt atttatgagt gttgggttag 900 

taatgatttg gattgtaatt atttgatttg ggtgtttagg gttaagtggg tttgtgttta 960 

ggtagttgtg ttagttttgg aatgtgagtg gtgggtattt ttttttttgt tttttttagt 1020 

tttttttttg tgttgtttgt tttttggttt ttgtttagta gaatttagat gttttttttt 1080 

ttttttgttt tgttgtgttt tgttgggttt attgtttgga tggaggggtt tggatttgaa 1140 

gggggtgggt ttgttgtttg attattagtg ttgggtggtt tggtttgggg taggtagggt 1200 

tttgtttttt agtaggtggg tttttgtttt ttttttgatt tttgtgttat ttgttttgtt 1260 

gtttttgttt ttttgggtgt ttgtttggtt ttttttttga gtgtttatat tgtaagtttg 1320 

aggggttttt taagtttagt ttaggtttgt aggttttaga attttttgtg ttttttgtag 1380 

tattttgttt taagggattt ttaaatttta atttttagtg ggaatattgt aaattaatta 1440 

agaattaaga gatttgtttg gttggggatt ggtagaagaa ttggggtgtt tttagtattg 1500 

ttggagtgtt gttagtttgg ggttgggggg atgtggggtt atttgagtgt tttttagaga 1560 

agggtatggg agttttggtt gggggtgttg tggttatttt tttttttttt tttggagatg 1620 

gagttttgtt gtttttaggt ggagtgtggt ggtataattt tggtttattg taatttttgt 1680 

tttttgggtt taagtgattt ttttgtttta gttttttgag tagttgggat tataggtatg 1740 

agttaatatg tttagttaat tttttgtatt tttagaagag atggggtttt attgtgttag 1800 

ttaggatggt tttgattttt tgattttgag atttatttgt tttggttttt taaagtgttg 1860 

ggattatagg tgtgagtttt tgtgtttggt tattattgtg tttagttggt tatgttttta 1920 

gaaaagaatt tttagtttgt gttggggttg ttataagagt ttgatgtttt ggtttttgtt 1980 

tagggatgtg ggtgggtgtt ttgtttgttt ttatattttg gttttagttt tgtgtgtttt 2040 

tgggtaggaa tattgttagg tggggttttt gggtttttag tgttttagtt tagggtttgt 2100 

tagttgttag tgaagggtgt ttttttgaag ggggtgtttt tttgaagggg tttttttagt 2160 

aggtgttggg tttttttttt tttggaggtg tggttagaag ttttttttgg gtttttggtt 2220 

tttggggtgt aagggttatg gtgtattttt gggatgaaaa ttagtttttt ttggggagga 2280 

gttgtaggtt tttgggggtt aggtaggtag tgtgggttgt gtttttttat tttgttgatt 2340 

gtgttttgtt gggttttgta gttggtatag tttttattat tgtagaagat tttggtttta 2400 

agatattttt tatttgtttt ttgaatgata gtgttataga ggttataggg tattgttggt 2460 

tgaagggggg tgtggtgttg aaggaggatg tgttgtttgg ttagaaaatg gagtttaagt 2520 

gagtgtttga ttatgttatg ttgttatttg ttttttttta tggttttttt gttgttagtg 2 5Rp 
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gtttgtggtt tgtgagaata aaagattggt tggtaggttt gatttaggtg gaagttaggg 2640 

attagttttg tgtagatttt tagagggaaa ttttagggag gggtttgagg ggtgtttttg 2700 

tgaggtaggg ggttttagta gttttggtag gagtttttat tgtttgtttt ttattaggtt 2760 

ttgggtatga aagggtgtta tattgttagg ttttgggtat gtgtgtgggt tttttaggtt 2820 

ttgtgttttt ggtataagag ggtattatat tgaggtatgt tttgtttgga ttttagatgt 2880 

tgttatggtt ggagttttgt tataattggg gttgggatgg ttttgggagt atagagtttt 2940 

ttgtttttag gttgattggg aagatggtta tagggtgtgg ggtgtagagt agggattaga 3000 

ggtgtggggt ttattagttg gggagaaggg ggtttgttgg tttttttatt tgtggggttt 3060 

tgttagtttg gttttgtttt tggggttgta tggggatttt aggagttttt ttgaggttta 3120 

gatttgattg ggatagtttt gtttttttat tgtgttgtgg ttgggttttt ggagaatttt 3180 

gggtttttgg aggtgttttg tagagttggt atgtggttta tttgtttgtt gtggttgtag 3240 

ggtggatttt gatgattagt ggggagagta tttttgtgtt tttttttttg agtttatggg 3300 

tatggttaat atttagtttt atggtgagtt ttgtagttag gggtattggg tattattgat 3360 

tgttgggaga agttgttggt ttgaggtatt tggtatattt tagagttttg gttttttgtt 3420 

ttttggaggg gaatagtttt tttgtgggag gttggggatg ggggtgggtt tgtggtttta 3480 

ggtttttttt ttttattttt ttgttagggt tttttagagt gaaggttgtg aagttgttag 3540 

aatatattaa tgagggggag atggttatgt tggtttgtaa gttagagttt gtgttatttg 3600 

ttattgattg ggtttggtat aagattattg attttgagga taaggtgaga agttaaggag 3660 

gttgggggtt ttggatttag tttttaggat tgggtgagag gtttagattg ggggttttgg 3720 

atttagtttt taggattggg tgaggggttt agattggggg ttttggattt agtttttagg 3780 

attgggtgag gggtttagat tgggggtttt ggatttagtt ttttggattg ggtgaggggt 3840 

ttagattggg ggttttggat ttagtttttt ggattgggtg aggggtttag attgggggtt 3900 

ttggatttag ttttttggat tgtagttttt tagattgggt gaggggttta gattgggggt 3960 

tttggattta gtttttagga ttgggtgagg ggtttagatt gggggttttg gatttagttt 4020 

tttggattgg gtgaggggtt tagattgggg gttttggatt tagttttttg gattgggtga 4080 

ggggtttaga ttgggggttt tggatttagt ttttaggatt gggtgagggg tttagattgg 4140 

gggttttgga tttagttttt gtttttggtt ttttaggttt ttatgaatgg ttttgagagt 4200 

aggttttttg tgagtttttt gtagggttgg ttagagttat atattgagaa tttgaatatg 4260 

gaggttgatt ttggttagta ttggtgtaat ggtattagtt ttaagggttt tgattaggtt 4320 

attattatgt tttgtgtgtg tagttatttg gttgtttttt ggtttttttt gggtattgtg 4380 

gttgaggtgt tggtgttggt tattattatt tttatttatg 4420 

<210> 439 
<211> 4420 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 439 

tgtagatgaa gatgatggtg attagtatta gtattttagt tatgatgttt aggaagggtt 60 

agagggtggt taggtggttg tgtatgtgga gtgtgatgat ggtttggttg gagtttttgg 120 

agttggtgtt gttgtattgg tattggttgg ggttggtttt tatgtttagg tttttaatgt 180 

gtagttttga ttggttttgt gaggaattta tgaagaattt gtttttggag ttgtttatga 240 

gggtttaggg gattaaaggt aagggttgag tttaggattt ttagtttagg ttttttattt 300 

agttttgagg gttgagtttg ggatttttag tttaggtttt ttatttagtt tggagggttg 360 

agtttgggat ttttagttta ggttttttat ttagtttgga gggttgggtt tgggattttt 420 

agtttaggtt ttttatttag ttttgagggt tgggtttggg atttttagtt taggtttttt 480 

atttagtttg gagggttgta gtttagaggg ttgggtttgg gatttttagt ttaggttttt 540 

tatttagttt ggagggttgg gtttgggatt tttagtttag gttttttatt tagtttggag 600 

ggttgggttt gggattttta gtttaggttt tttatttagt tttgagggtt gggtttggga 660 

tttttagttt aggtttttta tttagttttg agggttgggt ttgggatttt tagtttaggt 720 

tttttattta gttttgaggg ttgggtttag gatttttagt ttttttggtt ttttattttg 780 

tttttagagt tagtgatttt gtattaggtt tagttagtga taggtggtat ggattttgat 840 

ttgtagatta gtatggttgt tttttttttg ttgatgtgtt ttgatgattt tatagttttt 900 

attttgggag gttttgatag gagggtgaga gagaggagtt tggaattgta ggtttgtttt 960 

tatttttggt tttttgtagg agggttgttt ttttttaggg agtagggggt tagggttttg 1020 

ggatgtgttg ggtgttttag gttaataatt ttttttgata gttggtggtg tttggtattt 1080 

ttggttgtag gatttattgt ggagttggat gttggttgtg tttatgggtt tggggaggaa 1140 

gatgtaggag tatttttttt attggttgtt ggagtttatt ttgtaattat agtaggtagg 1200 

tgagttgtgt attagttttg taggatgttt ttaaggattt agggtttttt aggggtttaa 1260 

ttatggtata gtgaaaaagt aaaattgttt tagttaagtt tgaattttaa aagaattttt 1320 

ggggtttttg tgtggtttta aggatagagt taggttggta agattttata ggtgagagaa 1380 

ttaatagatt tttttttttt tgattgatga attttatatt tttggttttt gttttgtatt 1440 

ttatattttg tgattgtttt tttagttagt ttgaaaataa aaggttttgt gtttttaggg 1500 

ttgttttagt tttagttatg ataaagtttt ggttatgata gtgtttgggg tttgagtagg 1560 
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gtatgtttta gtgtggtgtt tttttgtgtt aggagtatag agtttgggag gtttgtatat 1620 

gtgtttggga tttggtagtg tggtgttttt ttgtgtttgg gatttagtga ggagtaagta 1680 

atggggattt ttgttagggt tgttggagtt ttttgtttta taaggatgtt ttttagattt 1740 

ttttttgggg tttttttttg ggggtttgtg tggagttggt ttttaatttt tgtttggatt 1800 

aagtttgttg attggttttt tgtttttatg gattataggt tgttggtggt aagagagttg 1860 

tgagaagggg taggtggtgg tatggtgtgg ttaggtattt atttgaattt tgttttttgg 1920 

ttgggtagtg tgtttttttt tagtattatg ttttttttta gttagtggtg ttttgtgatt 1980 

tttgtggtgt tgttatttaa ggagtaggtg aggagtattt tggagttaag gttttttatg 2040 

gtagtgaaga ttgtgttggt tgtaaggttt ggtgagatgt ggttagtaga gtggaggagt 2100 

atggtttgtg ttgtttgttt gatttttagg aatttgtggt ttttttttgg gaaggattgg 2160 

tttttatttt aaaaatatat tgtggttttt atgttttgag gattggaggt ttaaggggaa 2220 

tttttggtta tgtttttggg aagggaagag tttagtgttt gttgaggagg tttttttggg 2280 

aaggtatttt ttttgggaag gtattttttg ttggtagttg gtgggttttg ggttaaggtg 2340 

ttgggaattt agaggttttg tttggtgatg tttttgttta gagatatatg aggttgggat 2400 

tagggtgtgg ggatagataa ggtatttatt tatgtttttg agtagggatt agggtattga 2460 

gtttttgtag tagttttagt ataaattgga aatttttttt taaaaatgta gttagttggg 2520 

tgtggtggtg gttgggtgtg gaggtttatg tttgtaattt tagtattttg ggaggttaag 2580 

gtgggtggat tttgaggttg ggagattgag attattttgg ttaatatggt gaaattttgt 2640 

tttttttaaa aatataaaaa attagttggg tatgttggtt tatgtttgta attttagtta 2700 

tttaggaggt tgaggtagaa gaattgtttg aatttgggag gtggaagttg tagtgagttg 2760 

agattgtgtt attatatttt atttgggaat agtaagattt tatttttagg aaaaaaaaaa 2820 

aaaatagtta tagtattttt ggttaaggtt tttgtgtttt tttttaggag gtatttaggt 2880 

ggttttatat ttttttagtt ttaggttgat ggtattttag tagtgttggg agtattttgg 2940 

tttttttgtt aatttttaat taagtaaatt ttttggtttt tggttagttt ataatatttt 3000 

tgttaggaat tgaaatttga aaatttttta gaatgagatg ttgtgaagga tatagaaggt 3060 

tttgagattt gtaagtttgg gttgagtttg gagggttttt taggtttgta gtatggatgt 3120 

ttagggaggg agttgggtag gtgtttgggg aagtagaaat gatggaatgg atggtatgag 3180 

ggttgagggg aggatgagga tttatttgtt ggagggtaga gttttgtttg ttttgagtta 3240 

ggttgtttag tgttggtggt taggtggtga gtttattttt tttaggtttg ggtttttttg 3300 

tttaggtggt gggtttaatg gggtatagtg ggatggagag gggaggaggt gtttgggttt 3360 

tattgagtag gagttgggag gtgagtggta taggagggaa attgagggag gtggggaggg 3420 

aggtgtttgt tatttatgtt ttaggattag tatgattgtt tgggtgtgga tttatttgat 3480 

tttgggtgtt tgggttaggt ggttgtgatt tgggttgttg ttggtttggt atttgtaagt 3540 

gtttgtgttt ttttttatga gtgtgttgat ggagatggtg ttggttgtgt gttggtggta 3600 

ggtggtgtgg atgtggatgt ggtttagttg ggtgttgttt tagagttggg agtaggtgtt 3660 

gttgggattt tgttttttaa attattattg gattttgggt attgggttgt ttatggtttt 3720 

gtagtgtagt tttatattgt tttttattta tttttgttgg gatagtggtg tttggatgaa 3780 

gttggttgtg gggagagtat gtgtggggtt ttgttagtgt ttggggtatt tgggagggta 3840 

gtttttgggg aaggttttgt ttataggatt tttagttagg gttggtggag aagttatttt 3900 

tttttagttt gtgggagagt gatggggttt ggttggaaat agtatggtat gggaaagatt 3960 

gtgggtgttt tgtttattgt tattttatgg tagttatggt ggtttttgag ggtataattg 4020 

tgggattata tgtggtttta aaatataaaa tatatgaaat tgggtagaag agttaggagt 4080 

tttgagtttt gagggttgtt tagagtttag gttttaagtg tattgttttt tgatttttgt 4140 

ttgatatttt gattttagag atttgggtga gttgttgaga aaaataaatt ttaaagttta 4200 

aaggggtagg tggtttttgt ttagaagttt ttagtttggt tttgttagga tagtttgtgg 4260 

gtggaaagtt tgtgttttta gtaatttttg agggtggtga tttagagaga ggtggatttg 4320 

ggaagttggg agttaggtag ggttttaggg ttttttattt ttagttatat gttttttttt 4380 

tttttgaaat gtaagggaag gtttttagag ttgttaagga 4420 

<210> 440 
<211> 4258 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 440 

tttttaaagt gttgggatta taggtttgag ttattgtgtt tggtttaaag taattgtttt 60 

tttattggtt tttttgtttt tggttttatt aaatgtttta ttttaaatta ttgtagataa 120 

ggggttttgt ttaaggagga gagtttatta gttaaaattt tttagtagtt tttaattatt 180 

ttaggataaa tgtagattta ttttttgttt tttgttgttg tttttttttt ttttagattt 240 

tatatggttt tagttatatt tttagattat ttgggttgtt gtttgttttg ttatatattt 300 

tttttatttt gtttttttgt aagttttatt taggttgtat ttatatagtt gttttagttt 360 

ttatttgagg tttttgtagt tttttttgaa agtttttatt atagttaatg atattgaaat 420 

tgttttttta tgtagggtat ggaaagattt atttaattta ttgtttaatt tttttttttg • 480 

gtgtagaaaa gatatttagt tagtgatttg tattatttgt aggtatgagt gaataataat 540 
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aatgtttaat 
aatttatgta 
agtagttaaa 
tttataaatg 
tgtgatttgt 
taggagatag 
ttgtttgggg 
ttttagatta 
gtggtggagg 
ggatttgggt 
tggaggtttt 
ggtatttatt 
tggggatttt 
gtttgttggt 
tgttttttga 
gggtttgagg 
aatagaagtt 
gtgggagtgg 
ggtgtgggtt 
agttgttttt 
gagaagagtt 
ttttagtgga 
ggtgttggtg 
tgtatttgtg 
ttgggagtgt 
ggagaaggtg 
gttgggggtt 
attatgatgt 
ttttttattt 
trttttttta 
attgtttatt 
atattgtatg 
aagtggggag 
tataggattt 
gttgaagtgg 

gggggtggtt 

tgtagtatga 
ttgggttaga 
ttgtgatttg 
gttgttggat 
gaagtatata 
atgggatttt 
agatgttgat 
aggataaggt 
gattgggttg 
gggtagtttt 
ttttttttrt 



gattattgag 
gatatttggg 
gtagggtttt 
ggttgttttg 
ttgtgataaa 
ttttagttgt 
tgtttgtgtg 
gtttggtttt 
gttatgaggt 
tatttattgg 
ataatgtttt 
ttgttttgag 
tttgatttag 
aaattatttt 
tgtaggttta 



ttttatatag 
atttatataa 
taatttgttt 
aattaagaat 
gtgttttata 
aggttgggag 
tttgggagta 
aattataatt 
gtgtaggtgg 
trtggttttg 
gtggtgtatg 

atatgggggg 
gtgttgtatt 
tgtgttttta 
gaagggtatt 
ggtttttttt 
tttgggaggt 
tggggagagg 
ttgtggattt 
tgtgggttga 
uttatttatt 
tttttagttt 
ttgaggagtg 
utttgggttt 
tttaagattg 
ngtaaggggg 
ttttgtttaa 
tttatgttgg 
-tttttgtgga 
t:tttttrtta 
gtattaaggg 
tggtaaggta 
tttggggttt 
aattttaggt 
gggtttgatt 
gttttttgtt 
gtnatatatt 
gtagggtgag 
tagattaagg 
gattataagg 
gaggttgggg 
gagattttat 
tttgaggttt 
ggggttttgg 
gggatttggg 
gaagttgtta 
ggttagtttg 
aagtgggtgg 
gtagggaagg 
ttaattatgg 
tgtatgttag 
taaaaatgtt 
aaattattgg 
agtataagta 
tttttatttt 

ggggtggttt 
ttttttttrt 
agatgtggtt 
tgggagttga 
gattttgagt 
ttgaattttg 
ggtattttgt 



tgttaatttt 
ttttaaaatt 
tagatattaa 
tggaaggagg 
tttaagattt 
ggatatggag 
agggtgtggt 
tttaggattt 
ttgggtgttt 
gttttgtggt 
tttgtagttt 
tatttgggtt 
tttttaagtt 
gggtttgggt 
tgggattttg 
ttgatagttt 
taggtttttt 
ttgtttgggt 
ggatttttat 
gtttgttagt 
tatatttttt 
tagtggttat 
gttttggggg 
gtttgatgtt 
ttattttttt 
tagttagttt 
ggtttttttt 
ttgtttrtag 
ttttggagrt 
gaaaagtgtt 
tttttgttgt 
gagaggattt 
agagtggtag 
ttttagaagt 
atgtggttga 
tagtgtttat 
tgtgtttttt 
gggttgagag 
attaggtgga 
atgtggtgat 
tagtaaagga 
tttttatagg 
ggaatttgta 
gatttgagat 
ttaagggttt 
gtattttggg 
attttgttgg 
attttgttag 
tttgggtttg 
tattattgat 
gaagtttagt 
tttgtatatt 
tggaggggtt 
tggtaatgtg 
tatgttattg 
gatgtgttgg 
ttgtatagag 
attattattg 
gttgaggtgg 
ttttttttgt 
agatggttat 
ttttgttagt 



seq_prot.txt 
atgtttggta ttgttttagg 
gatgatattt ttttatttta 
ggggttgagt tggggtttga 
ataagagttt tgagaaggag 
tttagttttt agggagtttg 
agttttgaga gttgtgtgga 
ttggatgtgt gggtgtttgg 
agtgggtgtt gttgtttatg 
ggtgagtgga ttgtttgagt 
aagtggggtg attttagttt 
ttattatagt gtgggtgtgt 
gaattaagtg atttgtgtgg 
ggttttaggg gtttagggtt 
ttgaaggtgt tgggtaggta 
gggtgttggg gtgaggtttt 
ttttattgtt agtagagttt 
gggtgtggtt tgtgtgtatt 
ttggggagat tgatgtatag 
aatttttttt tttttgtttt 
atttttgatg tattttggtt 
tgttttattt tgggttgttt 
tgggtttgaa aggagtaaga 
tgggtttttg ttttggtttt 
ggttattgat atatattatg 
ttataattag ttggttattg 
agggtttggg atgtaggtgg 
tttttttagt ttttagtttg 
gatggtagtt attttrtggt 
tttttagata tttaggtttt 
tggtatttgt agtaagaatt 
ttttttttta ttattggttg 
taggttagtg ggttattttg 
atgattgttt gtttaaaggt 
taaaggtgtg tatttatgga 
ttagttgggt ggtgattttt 
gtggttttgt gaatttttat 
agaagttgat agattttttt 
gttgggtttg gattattttt 
tgaggtttag tattgttagt 
ttttttggta gagggtttat 
gaggttgggt ttgttggggg 
atgaaaagtt tgtttgttat 
tgttggttat gtattatttg 
ttatttggga atattaagtg 
gggggttgag gttgtggggt 
gtggggtrag gggtgtgggt 
tattatttgt atttgttttt 
gtgaggtaag aatggtttag 
agtttttgtt tggggtatga 
atttttagtt agataatttt 
agtatttttg gtgttagtag 
gttatatgtt atttaggggg 
tttgattgaa tttttgtttt 
ttttgtgttt gtattaatgg 
gattatattt tgttggagtt 
tttgggggtg gataggaatt 
ttataatgga gagttattta 
ttaataatga tgttgatttg 
atgggatggg ggtttaggta 
tttattttgg gatttggttt 
gagttgttta ttaatggatt 
agttgaggag tttgaaattg 



tatttatata 600 

ttaagtattg 660 

agtttggtta 720 

ttaggtaggg 780 

ttttatagag 840 

ggaagtggtg 900 

gatggtatgt 960 

tgatgttatg 1020 

ttttttgttg 1080 

atttagtttg 1140 

agatggggtt 1200 

ggggttttgt 1260 

ggtgttgtat 1320 

ggggtagttt 1380 

tgggttggaa 1440 

tgggttgggg 1500 

tggggagatg 1560 

gtggagagat 1620 

ggtagatggg 1680 

tttgaagttg 1740 

gggtttttag 1800 

tgatgatttt 1860 

ttttgggatt 1920 

tggagatggt 1980 

atgtggtagt 2040 

gagggagagt 2100 

tttaatgttt 2160 

aagatttatg 2220 

agttttgttt 2280 

tttagtgagg 2340 

taattttatt 2400 

ttttgggggt 2460 

gttagggtta 2520 

gtttggaagg 2580 

atgggtaggt 2640 

attttttttt 2700 

ttggtgagtg 2760 

ttttatgatt 2820 

tggtgtgata 2880: 

gtgagagttg 2940 

tgggaagggt 3000 

tttttggata 3060 

gtgttgtatg 3120 

agatagagga 3180 

tggtgttttg 3240 

agttttgatt 3300 

tattaaagaa 3360 

ggggtgggta 3420 

tttgtgggga 3480 

ttgttatagg 3540 

tattgtttag 3600 

gtagaattgt 3660 

tatttgtaga 3720 

ttatgtggtt 3780 

gtttaagaat 3840 

ggggtgtttg 3900 

gatatttttt 3960 

attattaggt 4020 

ttgtttttga 4080 

ttgattagat 4140 

tggggttagt 4200 

agaaatag 4258 



<210> 441 
<211> 4258 
<212> DNA 

<213> Artificial Sequence 
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<400> 441 



H a HH tta attttaa 9" ttttagttgt tgatagaggt aggatattta ggtttqtaot 60 

tggttttaga tttattaata agtagtttat ggttatttta ggatttagaq aStaqtttSt 120 

J22222S2 a " aagtttt agaalggggt algaaggggt ttaaggS?? JSttSJS 180 

??«?«2J 9 ta «ttattt attttagttt aatttttatt tagggtaiat 240 

222222 9 a ttgatatt attgttggtg atggtgatga ttatatttgg gatattgtag 300 

13*1322 ?2?222 22? t99 2 ttat 9tagag gggaaagal? agtagg?a?I 360 

a ?^ a n^JfS H2 tg I ttg «tttaagtt agtatattaa gttattttat tttatggtat 420 

S2S5K? 2222222 ^gtagttta gtggtatagg gatgaagggg aattgggtag 480 

£2232 9tt att 9 a1 :gtgg atatggggtg tattgttata tttgtgttta tataqqtqtt 540 

HJ???HS2 ag i gagggtt ta 9ttagggg tttttttatt agtggtttgt aattggaaat 600 

2222222 tata 599taa tgtgtagaga tltttttgtt tgttltaatt 660 

?SSHS tatt 9 t j99 t 9tta gggatgttgt tgaatttttt aatgtgtatg gggtagtttt 720 

ZfZZfZat^ a 2222 t99 "Sgaaatgt taatagtgtt atggttgagl aatiftSSi 780 

2H2222 atgttttggg taagggttta gatttaagtt ttttttgttt tagatgtttq 840 

"* a "i ttt: 9 a 9ttatttt tgttttattt ggtaaagttt atttattttt taataatttt 900 

2222J222 aga ! gagtat a 9 a tgatgtt gataaagtta ggttggttgg agaaaggagg 960 

2?SSS 22?22J 222222* taa 9 at 9ttg gtaattttgg alttgttitl 1020 

2 a 225522 t £" atagttt tagtttttag gtttttgatt tggattttta gtttaqtttt 1080 

22222! t1: * aa 39ttt ttaggtgggt tttaggtttt agagttttat tttgttttta 1140 

222 tgtt 2 99tgatgtgt ggttaatatg tggattttaa gttttgaagt tagtgttttq 1200 

£" aagaag r a 9tggatgag ttttttattt tgtaaagagt gaggttttag aattttgij?" 1260 

t HiH tatt ttta 9taggt ttggtttttt ttttgttgtt ttaattttta tqtqttttta 1320 

2222 t9 3 a 99ttttttg ttaagagggt tattatattt ttgtggttat ttagSg"! 1380 

ttgtattagt tggtagtatt gggttttgtt tgtttggttt ttgatttgta ggttatlqaq 1440 

2222222 a 999tggttta agtttaattt tttagttttt tattttgttt tggttSgta 1500 

tttattggag ggaagtttgt tagtttttgg aaggtatgga tatatagttt atqttqtaao 1560 

gaagaggtgt gggaatttat aaagttgtat gggtattgag tagagaltag JtattttSt 1620 

22222? ggatta ! tat ttagttggtt gattatgtga ttaaattttt attttgatS 1680 

ttttaggttt tgtgaatata tatttttggt ttttgaaggt ttqqqqttoa attttatora I74n 

g "" gatat ttttaggtaa gtaattgttt gttattttgg gttttagat? ?ttS?ttJf 1800 

2222222 222 tg 2 t 2 g ttgatttaag gtttttttta ttttattatg tgtagtatgg I860 

??I??222 ?222^?2«2 aaa 99 aa 99t: agtggaagtt tttgatgtgg tgagtaattt 1920 

ttattggaag tttttgttgt aggtattgag tatttttttg ggagaaaatg aqaaqqqaao 1980 

2J2222222 agtttgagt 9 tttgggagag ttttaggatt tatgaggaaa atSIgggS 2040 

* gaa ! tttat tagaaggtgg ttgttgtttt gagagtggtt agtgtagagt attatgqtqq 2100 

S8SS8S 22222222? ^ agagaggga g a 9 a ggtttt gag?agagla tttttStK 2l60 

22222 gtttatattt tggattttgg gttggttgtt tttttgtgta ttttttttgt 2220 

§2?222 2222222 gg " gtaaaa 9gaggtgatg gttttggagt gtttttgagi 2280 

2222J 9 tggtgtgtgt tggtggttga tgttgagtgg gtttggagtg tgagtgtggg 2340 

Ja«a??a?? ??222??2 2 aag 2I g 2 t "999 a ttg ttttttagta ttgatgttag 2400 

£22222 2222^ ta 9 at "99t ggttgttagg attgaggatt tgttaggatt 2460 

attSSSS 25222222 a *999gtaag ggggtgtggg tgggtagggg tttttttttg 2520 

2222222 tta 999tgtg ttgaggatgt tgataggttt agtttgtgga gagtagtttt 2580 

2222?? 22n?22?2 a 2 agga f gttgt aa 99 a tttga atttatagaa tttgtlttat 2640 

22222 2222 9gt H2 ttaflat tt: 9ggtg a tt tttttttatt atttttattg 2700 

222222 n?22222 "9*9 tttaa aggatttagt tttttggggg tttttatttt 2760 

SS2Sai 2222JSS ^i^^Qggagg gttgttgggg gaggagtttt ttagattttt 2820 

^i ag *" gaa aa "5i gtt T tagtgttttg gggttttggg tatttttttt agggggtggg 2880 

§21222 292S& 22 tttaaat «9S9«ttg ggagtgtggt tagtgaatgt 2940 

SSmS2?? 222222S 22 99 2H" t1:t 9g9 a 9g9 tgtggtatgg agtttttagt 3000 

ttt a 222 S2J?S22? J^ggtttgg tttggatgtt ttttatatgg tagatgttig 3060 

2222Ji^ g 3 gtttgtgt tgtgatggag gttgtggatg tgtgttgtgg ggtttttgtg 3120 

gattgagtag gttggggttg ttttatttat tgtggggttg gagttaggqt ttqqqtttta 3180 

gtggaagggt ttgaatagtt tatttgttag gtgtttagtt gtttgtgttt tttottqtto 3240 

SKS2SS tgggtggtag tgtttgttgg gttttgggag tiSIJtttJ g2tglggat 3300 

RS2S22 gggtgttt g t 9tatttaggt tatgtttttg tttttggatt ttaaltagta 3360 

2I222„ no22?2« «9 a fgtttt ttgtgttttt tttagttttt attttttgS 3420 

H a H aaa J§ ggttttttgg gagttggaag attttagatg taaagtgtat aqattatqtt 3480 

2 a 2 tg . a H ttt:t "tgga gtttttgttt tttttttagt ttttggttta tttatgggta 3540 

o2 a ??22 a n22S a9t " gatttt « aatatttgga gtaagttatt SSttytS 3600 

?a?2i?22 ag ^f a f ga 9 a i attat 5 a 9 «ttggggtt gtgtggatta tatgaattta 3660 

i a J g f g Ji" H? aa HS tg ttaggtgtgg aattagtatt atataagggt taggtattat 3720 

^tatttat_ttatgtttg.t^ga^atgta-ag«gt^^^^ USo~ 
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seq_prot.txt 

aaaaaqqaga ttgggtagtg agttgaatag atttttttgt gttttgtata agaaaatagt 3840 

tttlqtatta ttqgttgtga tgaggatttt tagaggaggt tgtgggagtt ttagatagaa 3900 

attolqatag ttgtgtaggt gtagtttgag taggatttgt aagggagtgg ggtgggaagg 3960 

atStotggta ggatagatia tagtrtagat ggtttgagga tatggttgga gttatatggg 4020 

StSSSSS glSgiggtg gtagtagaga gtgaaggata agtttgtatt tgttttaggg 4080 

taqttaggaa ttattgaagg gttttaattg gtgggttttt ttttttagat agaatttttt +iw 

gtttgtagtg atttagagtg gggtgtttgg tgagattaaa ggtagagaaa ttaataagaa 4200 

gataattltt ttaggttlgg tgtagtggtt taggtttgta attttagtat tttgagag 4258 



<210> 442 
<211> 4435 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> chemically treated genoimc DNA (Homo sapiens; 
<400> 442 

-i-mraaatatt taaootttat aggtttggag tggtttttgt ttttgttgga ttatttaggg 60 

Jtottttgtt gglggggttt ttttgttttt gigttttagg ttatgtttgg aaggtttggg 120 

JtttSgtgt ttggttltgt tatttgtttg gtggttttag ggatttagtg ttggggttag 180 

tgtaggtttt gtttagtgtt tttgtttttt ttagttggtt ggtttttggg attgtgtttt 240 

tatttqtgtt ttttatgttt gtatttgtat gggtagggtg gtttggggtt ttttagtttt 300 
gagttgtggt tttttgiagt gtgttatttg gatgtgtgtt gtttgtagat tttggatgag 360 

ttqagtttgg gtgttgtgtg gtggaattga ttttgttttg gattgtaggt agtagttgat 420 

Stgtgttat tggtttgtgg gtglgatttt aggtgtgaga tgggttgtgt gtgtgtgtgt 480 
gtgtgtgaga gagagagaga gagagagaga gagagagaga gagagagaga gtgtgtgtgt 540 
otqtgtgtgt gtgtgtgtgt gtgtgtgtgt gtgttggggg aggggtttga ggatttgttt 600 
?itagttglg tgttaggatg gggggtggat ttttaggagt tgtgtggtgt ttgtagagga 660 
gagglagtgt tggagatttt tttgttttta tttggttttt ttgttttgtt tttttttttg 720 
gggttttaga gaggggagtt ttaggtgtgg ggtgtttggt ttatgtggtg tggtgttggg 780 
toqqgtaggt gggggtgtgg tgagagttgt ggagtttttt ttatgggtgg gggggtgggg 840 
atggttgggt tttgtttttg ggggattggg gtgtgttttg ttaatgggta gttgggtgtg 900 
craqtSggtt gtggggttgt tgtttaaaga aatttgttgt gggttttgta gtgggatttg 960 

ag«t?ggtg Sggtggtgg tggtggtalg ggtgagggtt ggggttattg tgtggtgatt 1020 

ttgggttttg gagtgattgt agggtagttt tgggtgttgg ttttggtgtg tgttttttgt 1080 

gtgtgttttt ttgtgtgtgg ttttgatgtt ggatatgagt gaggtttgtt tttagttgtt 1140 

ttqtaqtttg tttggtattg ttagttttat gttgtatgtg gaggatttgg atttggatgt 1200 

qttqttgttt tttgttggtt ttgigggttt gggttgtgtg ggggttgtgg tggggggtgt 1260 

ttqgggtgat ttggtggagg tggtggitga gtgttttttg gtttgtattt gtgatgttgt 1320 

gitgtaggtg tttlagggtt atgattggag tttggtgttt atgttggtgt gtggtggtgg 1380 

tggtggtgtg tttaaigtta agttgtatgt gaagtggttt atgaatgtat ttatggtgtg 1440 

ggtgtaggtg gtgtgttgta agttggttga ttagtatttg tatttgtata atgttgagtt 1500 

taotaaqatq ttgggtaagt tgtggtggtg agtgttggtg ttttgggggt ggggtttggg 1560 

a?tt?gg?tg tttttggttt tggattgtgl gtgggggttt ggagggtgta gagtttgtgg 1620 

aoqtqqtqgt tatatgtagg ttttgggttt tgtattttgg gtgggtgtta gggtgggttt 1680 

Sgiggaaia atattttttg gttagttggg gtttggtgtt agttatggga gttgggattt 1740 

qtogagggta gtgtttgggg tgtgtatttt tgtggtgggt gtgttttgga gttagttttt 1800 

ttggtgggga Itattttgtg gagggttatg gttggtattt tgatgtgttt tttagttgtt 1860 

ataaataatt ttaattttqq gaggtatgtt tggtatgata gtaagttgta tgtggttttt 1920 

?gSlgtg« SSttSS aalgggtlta gStttttag tttttggatt atttatgggg 1980 

agggagtgtg gtaggtggat gggtgggtag taggtttgtt tgtgagtatt ttttttgtag 2040 

tttttttttt ttttggatat ttttggtttg aagtagggag ggtaggagtg gagatgatgg 2100 

aatatottot atotatqttt tggtgtggtg gggttggtta gttttttaga aattaggatt 2160 

gtagaKgla qaat??gttg tttaggatag gttgttttta ttgagtgttt tttggttggg 2220 

ttttliatoa tttttttqqt ttggggtttt ggtttttgta ttttgttttt tatttggaag 2280 

aagggagtt? ggag«St5 €att?gtatt ??ttgttltt tgtttggatt tagggtgtag 2340 

ttitggggtg ttgtggatgg taggtttttg tttttttaag ttttgttgta taggtggttg 2400 

gaggtglgtt ttgSglaglg ttttaaggag tgtgttttgt ggtttagtga ggatgggtgt 2460 

ttgggttttg ttagggtttt tgggtgtttg ggaatggagg gatagtagtt tagttgtttg 25Z0 

Sgllggttg gtattgtttt tgggaagttt tgtttttgtt tattgttggt tgggttttgt 2580 

ttgttttttt tgtttttttt agaaagtttt agagggaggg aatagtaggt «ggaatagg 2640 

gggaaataga aggaggaaga tgggaggtgt ggggtttgtt tttttttggt tgttgtaagt zzou 

itittaattt ttttilatag gtgttggttg gtttttttag aggagtgtat tgtttggtgt 2760 

tatgggaggt tttggagtgt atggtaggtg ggtttttttg gttagagttt tttggttttg za^u 

gtttttgttg ttggtttttt ttgtggtgtg tgtttgtgtt gagggtatag tgggtgtttt 2880 

ggttattttt gtttttgttt tgtgttgtag tttgttgagt gagagtgaga agtggttttt 2940 
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tgtggaggag 

ttagttatgg 

gttrgggattt 

gtattattat 

tatgggaggt 

gtgtttttga 

tgtttttgtt 

agttttgttt 

gttttttttt 

tagattgata 

ttagagtgtt 

aattttagtt 

ggggttttgg 

gtttaggttt 
tttttggtag 
tagaaggggt 
tttattttgt 
aagttggagg 
gatattttgg 
tttgattagt 

ggtgtttatt 

tttgtgttgt 
tttggttatt 
tttagtgtta 
gatttgtagg 



gtagagtgtt 
tgtaggaaga 
tattttggtg 
ggtgattata 
agttgtgggg 
gagaattggg 
tttggtgtat 
ggtagggatt 
ttatattgag 
gggtggagtt 
tggttggttt 
ttggttaggt 
gtttttttat 
gattttgttg 
ttagtgtggg 
atttattttt 
ttattatttt 
gatgttggtt 
agtttagtag 
atttgttttt 
tttatgttgg 
ttattgagat 
atggtgatta 
ttttggttgt 
tttttagtta 



tttgtgtgta 
gtgttaaagt 
gtggtgttgt 
taggtgggtt 
tttttggttg 
ttttttttgg 
gagggatttg 
ttgagaggtg 
ttgttgttta 
ttgttggggt 
ggggaggaat 
tgggtttgtt 
agtatagtgt 
gaatttttgg 
tgtttttttt 
gaagtttgtt 
taagatggag 
ggtggatagt 
tgaggttatg 

gggtggtttt 

ggtgtttttt 
gggtttttta 
gttttgaggt 
ttttgttggt 
ttatggtgtt 



seq_prot. 
gtataagaag 
tggttatagt 
gtataaggtt 
ttaggttttt 
agaagggtgt 
gttttttgaa 
tttagttatt 
gtgtttttag 
tatggtattt 
tttggttagg 
agggtttgtg 
ggtttttttt 
tttatggaat 
gaaatgtaaa 
tttttagttt 
tttttgtttt 
ttgtagtagg 
gggtgttaga 
ggtattatgg 
gttttatttg 
gtgtgggttt 
tggttgtata 
ttgtttgatt 
ttttttgttg 
tattttggtt 



txt 

gattattttg 

gattttgatt 

gaagtagggt 

tgtatatttg 

atgatggtgg 

aaagttttta 

ttgagtgttg 

taagatttag 

agtagtttta 

ttggttgatg 

tggtttttta 

ttttattttg 

ttttttggtt 

gaaataaaaa 

ggttggtttt 

tagggtagat 

tgggtgttaa 

atattgattt 

atgtttttga 

agttgggtta 
ataagagtgt 
ttaagatgga 
atggttttitg 
gtttataggg 
atgtatttgg 



attataagta 

tgggtgtgga 
ttggagatrgg 

agggtttttg 

^ggggggtag 

aggttttggt 

aggggttgat 

tttaatgtgg 

gttttagggt 

tttgttagtt 

gtgttttgta 

gtttttggga 

gaggagtgag 

gttgttgatt 

atttgtttta 

ttatgggttg 

gttggagttg 

tagtaatgtg 

tgtttatgag 

ggtttatggg 

tttgttggtt 

gtagttgagt 

tagtggttag 

tgattatggt 
ttttt 



<210> 443 
<211> 4435 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 443 



agaggttggg 
agttgttttg 
tgttgtagga 
gttgttttgt 

atggggtatt 
aggtttggtt 
ggatgttgaa 
tgttgaagtt 
ttggtttggt 
tgtgggtttg 
t:gggtggggt 
atggtttttt 
tttttggttg 
gagattaagg 
ggtgttgggg 

gtgggtgtta 
gaagttgggg 
ttggattgag 
ttttttagta 
ggttttggga 
tttttttatt 
atttttagat 
tttaagtttt 
gtttgagttg 
gtagttgggg 
ttgttttttg 
gtttattgtg 
ttaggaggtt 

aggtagtata 
tgatagttaa 
tttgggtttg 
atttagttgg 



tgtgtagtta 
tgagttggtg 
attgtagttg 
tttgatgtgt 
tttgtgggtt 
tggtttgggt 
ggtgtttatg 
gatgttttgg 
gtttgtttgt 
ttttggggat 
tggttaggtt 
gtttttttat 
agaaaatttt 
tggggggaag 
gattatatag 
gttagtttga 
ttgttgagtg 
ttttgttgaa 
tttaaggtgg 
gtttttttag 
attatgtatt 
atgtgggggg 

gttttagttt 
gagttgttgt 
tggttttttt 
tttttgttta 
tttttggtgt 
ttgattaggg 
tttttttggg 
agaagaataa 
ttgttttttt 
tggtggataa 



gggtaggtat 
aaggggttgg 
ggtgagtttt 
ggttgtgggg 
tatatggggg 

ggggtggggt 

gtgtttatga 
tgtttgttgt 
tgtagttttg 
aggagagtag 
ggaaggggag 
attttttaga 
atgaagtgtt 
aagttgatag 
attttgtttt 
ttgaggtttt 
ttgtgtgggt 
aatattattt 
ttaagtaggt 
ggaatttggg 
ttttttagtt 
tttggagttt 
tgtatatggt 
ggttggtttt 
tgtgttgtat 
gtaagttgta 
aggtatatat 
aaatttgttt 
agaattggtt 
gttttatgtt 
tttttggggt 
aagtgagg_t.t_ 



tatagtagtt ggaggtttgt 
tgggggtggt tggggtggtg 
ggggttggtt gttgtagtgg 
gatttgtttt ggtgggtgat 
atgttttggt gtggaagtag 
tgtttagggg taggtattgg 
ttttgttgtt gagttttgag 
ttattggttg gtgttttttt 
ttttgggggt ggtgggtggg 
gttttaggga tgaatgtttt 
ggatgtttgt gttaattgtt 
aattttagta aagttagatt 
gtgttatgag aagatttgag 
gtttagtttg gttagagttg 
tttttgggtt agttaagtat 
ggtaggattt tattttgtta 
agtagtttgg tgtgggaaga 
tttggggttt ttgttaagta 
tttttgtgta ttaaaaatag 
gaaggtttgg tttttttgag 
agaaatttta tagttgtttt 
atttgtgtgg ttgttatggt 
attgttgtta gggtggggtt 
ggtgtttttt ttgtgttgtg 
gtgaaggtgt tttgtttttt 
gtatagggta gaggtaggga 
tataggggga attggtagta 
gttgtgtgtt ttggagtttt 
ggtgtttgtt tgggaaggtt 
ttttattttt ttttttttat 
tttttaaggg aggtggggga 
ttttgggaat-ggtgt-tggtt 
Sei te 433 



aggttgttat 
ttggattggt 

"ggggtttg 
gtggaggttg 

gtgtttttat 
ttgaatttgt 
atgtttatat 
agttttagtt 
gtgggtggtt 
ttttgtgggg 
agggggatta 
tagattttat 
gtttttttta 
gggtttatag 
tttgaggttg 
gtttggtttt 
ggagtttgta 
aagttgttag 
gggtggttgg 
ggtgtttgtt 
ttatataggg 

ggtgtttatt 

ttagttttgt 
gttggtattt 
ttatgaaggg 
tggttagggt 
gaggttgggg 
ttgtggtatt 
ggtatgttta 
ttttttttat 
ggtgggtagg 
tttggtgggtr- 



3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4435 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 



seq_prot.txt 

aottoaqttq ttgttttttt atttttagat atttagaagt tttggtaggg tttaggtgtt 1980 

totttttatt qagttatagg atatattttt tgggattttt tttgaggttt atttttaatt 2040 

O&tratgg ?lggatttgg gggagtaggg gtttgttgtt tatagtgttt tgtggttgtg 2100 

ttttagattt agatagaagg taggaggtgt aaatatgtaa gttttaagtt tttttttttt 2160 
aantaaaaaa tagaatatqq gaattagagt tttaaattaa gagaattatt tgaaatttaa 

??iqlqqltl t?tggtgggg gtggtttgtt ttgagtaata agttttttat tttgtagttt 2280 

?aatttt?gg aaaattgltt ggttttatta tgttaaggtg tgtatatggt gtattttgtt 2340 

a««ia?tt ?tgttttttt tgttttggat tgagggtgtt taggagaggg agaaattgtg 2400 

gggagggtat ttgtaggtaa atttgttgtt tatttatttg tttgttgtat ttttttttta ^4ou 

tqqqtqqttt aggagttaga ggagttgagt tttttttaag gagggagtat ttttagagag 2520 

ttotatqtaa tttgttgttg tgttgggtgt gttttttggg gttggggtta tttgtaatgg 2580 

ttootqqqta tqttggggtg ttagttatgg ttttttgtag ggtgtttttt gttaggagaa 2640 

SH?tt?ag a^g?I?t€g ttatgaggg? gtatatttta ggtgttgttt tttgtggatt 2700 

ttoittttta tagttggtgt tggattttgg ttggttagag gatgtttttt tattgggatt 2760 

tottttqata tttgtttggg gtgtagagtt tggagtttgt gtgtggttat tatttttgtg 2820 

agSSgtg? tttt^aggtt tttgtttgta gtttgagatt aggggtggtt aaggttttga 2880 

attttgtttt tggggtgttg gtatttattg ttatagtttg tttagtgttt tgttgagttt 2940 

aatatx.ax.at. agqtqtqqqt attggttggt tagtttgtgg tgtgttgttt gtgtttatat ^uuu 

tKgaatgtg ?ttatgggtt gttttatatg tggtttggtt ttgagtgtgt tgttgttgtt 3060 

Stgtgtatt ggtatgggta ttagatttta gttgtagttt ttgagtattt gtgatatggt 3120 

5ttqtggatg tiggttggga agtgtttgtt tgttgttttt gttgggttgt tttgggtgtt 3180 

ttttattgtg atttttgtgt ggtttaggtt tttggagttg gtgggagatg gtggtgtgtt 3240 

Sagtttgag ttttttatgt gtgatatgga gttggtggtg ttggatgggt tgtagggtgg 3300 

ttgggagtgg gttttgttta tgtttagtat tggggttgtg tgtggagggg tgtgtatagg 3360 

aoatatotat tggggttggt gtttggggtt gttttgtggt tgttttggga tttgaggttg 3420 

ttqtqtqgtg qttttgattt ttgtttttgt tgttgttgtt gttgttgttg aggtttgggt 3480 

tttg«gtgg gatttgtaat aagttttttt aagtggtggt tttgtggttt tgttttgtgt 3540 

ttqqttattt attggtggag tgtgttttga ttttttggag gtggagtttg gttgtttttg 3600 

ttttSSS Sitaagaal gttttgtggt ttttgttgtg "tttgtttg "ttgtttgg 3660 

tgttgtatta tgtgggttag gtgttttatg tttggagttt ttttttttgg agttttggga 3720 

aaqqaagtgg ggtagggagg ttgggtggga gtaggggggt ttttggtgtt tttttttttt |7»u 

tgtgggtgtt gtgtggtttt tgagagtttg ttttttgttt tgatgtttag ttaggaagtg 3840 

qqtttttggg tttttttttt gatatatata tatatatata tatatatata tatatatata 3900 

titatttttt tttttttttt tttttttttt tttttttttt ttttttttta tatatatata 3960 

SSStata qtttgttttg tgtttgaggt tttgtttgta ggttggtggt gtggggttag 4020 

Sqttqtttg tggtttagga tagggttggt tttgttgtgt agtgtttagg tttagtttgt 4080 

ttagagtttg tggatagtgt atgtttaggt aatgtgtttt aaggggttgt ggtttggagt 4140 

tqqgggattt tgggttgttt tgtttatatg agtgtgagtg tggagagtgt gggtgggggt 4200 

qtqgttttgg gggttggtta gttggagggg gtgggagtgt tgggtggggt ttgtgttgat 4260 

tttqqtattq Iqtttttggg gttgttgggt ggatggtgtg gttggatgtg ggggttttgg 4320 

gS&t&M SSSttag ggtttggggg ?ggggaggtt tttttggtgg ggtggttttg 4380 
ggtggtttgg tagiggtgga ggttgttttg agtttgtgag ttttgggtgt ttgtg 



4435 



<210> 444 
<211> 4784 
<212> DNA 

<213> Artificial Sequence 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 444 

tttoottota ttgqqattta tagagttata ttttggaggg tttttttttt gatattgata 60 

agggtStgS Satatttta gtgagtgtta tatggttggg ggttaatggg agttatattt 120 

tttigatttt tagtgtgttt tttattgagg tttattttga agattatagt gagttgtagt 180 

tggtgaggat agttttttta gattttggtg aggtggtatt atttgtttgt f«gtggatg 240 

altttgtgat tgttttgatg gtgattttgg atgttgattt tattaagatg attttaaagt 300 

aaaggattga ttttttgtat aggatgtgga gttttttaga agtagagttt tataatatga 360 

aattaqtqtt ggtggtgaat aatagattat ttgatatgtt ggtttttatg gttggtttgg 4^0 

SaSgilaa llaggtggtg gagaatgggg tttttttttt ttggaagttg 9|ttgttttt 480 

toaattaqaa taqtqtgttt gatatttatg gtgtagaggt ttttgttagg gagggtgtaa i40 

tgtSSttS Jtttggttat tttgtggtgg gttggtatat tgttaataag aagttttttt 600 

tttttaaatg tgtttggagg tagatttatg ttatatttat atttgttatt |«attgggt 660 

ttttaattat ggttatttag gagtttttat ttaggattgt gttaattttt atatttttag 7|0 

ttattgtttt tttaatagag attatggttt ttttagt^ag gga«ttgtt tttgggaaat 780 

ttataattat tatttqqatt tgaggtgtta ttatttaaat tttaattpta ggttttattt Hgu 

■.JJg||gJJ ggigtllgaa gttggtltta tagtttttgg ttagatttgt ttaatgatga 900 
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ttatttttgg 
agttatgagt 
tttgtaggtt 
tttttattat 
gagtgttttg 
tagatgtttg 
atgaggatat 
tgggtgagaa 
atagtagtta 
tggttgtgga 
taaggtttaa 
agattgtttt 
ttttgtagaa 
tggagttttg 
gaaaattttg 
ttgtgatggg 
gagtgttttt 
aggatgatgt 
ttgttggtat 
aggattaggt 
attttaagtt 
tatttttttt 
ttatgggaga 
tattgttttt 
attggttatt 
agggtagggg 
tgtttattgg 
gttttttttg 
tttatttttt 
tggttggttt 
atttgtaggt 
tattgttttt 
aagattaaat 
ttatttaaat 
agtttttagg 
tattattttt 
ttttttggtt 
ggggttggaa 
attagtggtt 
ggtggttgtg 
gggaggttaa 
tttttttttt 
tttggtgaag 
gttttgtttt 
ttagttttag 
agtaatagtt 
atatagtttt 
tgtatgagtt 
tttttttatt 
ttagtagaat 
tgttttgatt 
agtgttattt 
gattgagggt 
tatattgatt 
agggggggag 
ttatagggtt 
tgtttttgtg 
tttttaagtt 
gtgagattat 
gttttgaaag 
tatgtagaga 
tagtgaatag 
gggttggtga 
attaattatt 
ttgtaaagaa 



ttatgtggag 
atttatatta 
aattaagaaa 
tagattggaa 
tggtggagaa 
ggttggtatt 
tattattgat 
gttttgggta 
tgtgggtaaa 
tgtttttgag 
ggttaagttt 
ggtaaagaaa 
tattatttgg 
ttttaaggag 
gtttgttttt 
ttttggtagt 
agaggtgttg 
ttatttgtat 
tattgttatg 
tatttttatt 
tttatttttt 
tgagtatttt 
gtatatgttt 
tatagtattt 
ttttttttat 
tttggagatg 
gagttgatat 
gtttttttaa 
gtttaatagt 
tgaaagtatt 
agtgttgggt 
tttatttatg 
ttaaaggatt 
ttgtatgtat 
gtgttttaaa 
ttatattatg 
ttggtttatg 
gttggagggg 
tatgattaag 
gtttttaatt 
aggtttttat 
tttttttaaa 
atgtaatatt 
aatagtttat 
gttttgggtt 
tataataaaa 
aaattgtttt 
tttaagtgtt 
ttttttaatg 
atttgtttgt 
ttttttttgt 
tgggtatttt 
gggtgggtgg 
gttttgtaga 
aggtttttgt 
tagttttttt 

ggggagtttg 

agagtttttt 
ttttgatgat 
gttttttaaa 
tgtggttggt 
aataataata 
tggttgttgg 
gatgtgtttt 
atatattaaa 



tttattgtag 
aaattagtaa 
ttatggatat 
attgttttat 
tttaattagt 
tattttgagg 
aagttgaagt 
tagtttaata 
tatgagtatt 
atttatgttt 
gtgggtgatt 
ttggtttttg 
ggttttattg 
tagattgttg 
tttaatgttt 
tgttggtatt 
tttatagaag 
atagttattt 
atttgttatt 
aagaaggggg 

tttagtatgt 
aattagagtg 
ttgtgggatg 
atggagggta 
gttttatttt 
atatggtgtt 
ttgatttagt 
attttaaagt 
ttttggtttg 
atgtggagtg 
ggttttttgg 
tgtgtttagt 
tttagaagtt 
aatttttggg 
tttagagaag 
ttaatttggt 
ttaagttttt 
ttttttgggt 

gggtttttat 

ttggtgtttt 
tatattaatt 
aggaatatta 
tttatgttat 
agttttgttt 
tgaattattg 
atttgttagt 
attaaaaaaa 
gattttatgg 
gtttaaattt 
ttatttgtat 
ggaagaaatt 
ttagggagga 
atttaggttt 
ggatttgtgt 
tttattgttt 
tttttgtttt 
taaatgtggg 
tggtgtttta 
tttattaaaa 
tagttttgtt 
ttagaatggt 
gtaataatgt 
attttgtgag 
tatttttttt 
ttaaataaaa 



<210> 445 
<211> 4784 



seq_prot . 
ttgttatttt 
tgtttttaat 
tttggttagt 
tgtttatttg 
gtttagagtt 
tgaagatttt 
tgattttgaa 
gtaatagtta 
ttatgtatgt 
ataggtgttt 
tggtattggt 
tttttggaga 
tggtggaatg 
ggttgagttg 
tagagtttga 
tatagtttat 
tgtttgatag 
tggttgtggt 
gtaagaagtg 
tgtttattat 
tatttatttt 
tgtttgagat 
aggattttaa 
agggtttttg 
aatttgtaag 
gtttgtggag 
atatattgat 
agttggagag 
tttatagaga 
gaggtggagg 
ttttttgtgt 
tgtaggatgt 
aaggttaagt 
tgggtatggg 
attaagggat 
tgttttgata 
ggtgttgggt 
tatggatatt 
ttttatgaaa 
agggtgggtt 
tattttgttt 
tggttttttg 
gtgatgtttt 
tgatttattt 
gaaaaggttt 
ttttaaagtt 
agatttaaaa 
aattgatggt 
tggaattata 
ttttgtttta 
attttgagta 
atgttttttg 
tttttgtata 
gtgttgtttt 
tttagagggt 
agtttttagg 
tttagtggat 
gagataggag 
aataaataat 
gtttttagta 
ttgttgttat 
ttttgtaggt 
atggagagtt 
tatatgattt 
gtagtgtttt 



txt 

ttttataatt 
tgattttatt 
gttttgggtt 
tatttgtatt 
taagaattat 
gttagatatt 
attgtgggag 
gtttatgtat 
tatagataag 
ttaaggggat 
gttgaatgat 
ttgaaattgt 
gattaataat 
ttggattgtt 
ttttaaggtt 
ttttgtggta 
ggattttgag 
ggttgtagtt 
gaagggtaag 
ttttgtagat 
gtaggaggag 
tatttttttg 
tgtgtttttt 
ttttaagaat 
tgtttgggtg 
attggtggtt 
ataggggttt 
attttgggga 
attttttgtt 
gagtgaggaa 
tttgttttta 
aatatggaaa 
ttttatgttt 
gaattgtttt 
agtatttttt 
ttttagagtt 
ttgttttttt 
tttattttta 
aagggatttt 
agttgtttgt 
tatttttttt 
aaatatttag 
ttttttattt 
tatttttttt 
ggtggttggg 
aattttttat 
tggtgatgtt 
tttgtttgtt 
ttggggtttt 
tgatttaggg 
tgattttttt 
taataatgta 
tagagtagtt 
aggaggggag 
attttttttt 
gggtattttg 
tattggtgat 
tataagtggg 
ttttaatgtt 
attttattat 
agtaattgga 
agtttgtttt 
aattttatat 
taagatgtgt 
tatt ■* 1 



attattaaga 
attattatga 
attattaaag 
attattagtg 
attgataggg 
ttttatgatt 
tagtagttgg 
ggtttttttg 

gggggtttgt 

agggtttttg 
atttataaga 
agtattatta 
atattgtttt 
gaggatgatg 
ataagtatta 
ttatttagga 
aagagtagtg 
attttgttta 
tttatttttg 
gaattggatg 
aaggtttttt 
aattaggata 
tattagtttt 
atgattttat 
gaggtagggt 
tgtagattat 
ggataagttt 
tttttttatt 
ttatttttga 
ttatgaatga 
atattaattg 
atagtaatta 
aatttgttgt 
gttaaaaata 
attaaaggaa 
tgattggggg 

gttgttgtta 
gtttatgtat 
aagaggtagt 

gggggtattt 

ttgtgtattg 

tgggggatat 

gattttggtt 
ttttggtatt 
gaggagtgtt 
taaagttttt 
tatagtagtt 
ttgatttttt 
tttgtttttt 
gtgtttattt 
tgatgtttga 
tttatttttt 
atttttaagt 
ggttggtagg 
gtgttttttt 
gagtgagtag 
tgggtttatg 
atttgatttg 
ttaggtgagg 

tgggtattgt 
ggtgatgggg 

tttgagtgtt 
ttaagtgttt 
tttttgtatt 



960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4784 
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seq_prot.txt 

<212> dna 

<213> Artificial Sequence 

<220> j A 

<223> chemically treated genomic DNA (Homo sapiens} 
<400> 445 



aataaagata ttgtttttat ttagtttgat atgttttttt atagaatgta gaaaatatat 60 

tttaaaatta tatagaagga aataaaaata tattagtggt tggtgaatat ttgaatgtga 120 

gattggtttt ttattttata gagtttaatg attattatta gtttagtgtt taggggagta 180 

ggttgtttgt aaaggtattg ttgttgttgt tattttgttt attgttttat tgtttttagt 240 

tgttatggta ataggttatt ttgggttagt tatgtttttg tatggtagtg tttaatggtg 300 

gagttgttag gggtgatgga gttgtttgga aggtttttta aagtttttat ttggaatatt 360 

gggaattgtt tattttttga tgaggttatt agaaataatt ttattaggtt agattttatt 420 

tatgtttttg tttttggggt attagggaaa ttttgatttg gaggtatgag tttagttatt 480 

agtggtttat tgagtttgta tttataggtt tttttatagg ggtattgttt attttagagt 540 

attttttggg gattggggta ggggagaggg gttgggtttt gtgggagaag gtgtaagggg 600 

aaatgttttt tggagagtag taggatagag attttttttt tttttttatt agtttttttt 660 

ttttgaggta gtatatataa attttttgta aaatagttga tatggtttag aagtagttgt 720 

tttgtgtgta aagggagttt gggtttattt atttattttt aattagggaa tggatatatt 780 
attgtgaaag gtattttttt ttaaaaagta tttaaaataa tgttttagat attaagaaaa 
gttatgttta aaatggtttt ttttatagga ggagaattga agtagagtgg gtgtttttga 
gttatgggat agagatgtag atggatggat ggatgttttg ttaaaaaagg taaaagaatt 

ttagtgtaat tttagaattt aaattattag gaaaagtagg gggaaaagaa ttaaaataag 1020 

taaagttgtt agttttatgg aattaatatt taagagtttg tataaattgt tgtaagtatt 1080 

attattttaa attttttttt taataaaata atttgaggtt gtataaaaat tttagtaaaa 1140 

aattagtttt gagagttaat agatttttat tatgaattat tgttggtatt tttttttagt 1200 

tgttagattt tttttagtag tttaggttta aggtttggga ttggagtgtt agagaaaagg 1260 

gatggggtgg gttggggtag gattgtggat tgttaggata aagtggttaa ggttaagtga 1320 

ggaiigagta ttatatgati taaaaatatt gtatttttat taaaatgttt tttattgagt 1380 

gttttaaaaa attgtgatat tttttttagg aggaaaaaaa aaaataatgt atagaaaaag 1440 

ggtaaaataa aataggttga tgtggtggag atttttgatt ttttaggtgt ttttataagt 1500 

agttggttta ttttggagta ttaaagttgg gggttatagt tattattgtt ttttggagtt 1560 

ttttttttat ggaaatgaag attttttggt tgtgggttat tagtgtatat gggttggaag 1620 

tggggatgtt tatggtttaa gagatttttt tagtttttag tttttggtaa tagtgggaga 1680 

gtaaatttag tattagggat ttggtgtgag ttagggttgg gaggttttta attaggtttt 1740 

ggggtattag agtaattaag ttgatgtagt gtgaaaaaat agtatttttt tgataaaaaa 1800 

tattgttttt tggttttttt taagtttgaa atattttggg agtttatttt tagtaaagta 1860 

attttttata tttatttaaa aattatatat ataagtttag gtaaatagta gattaaatgt 1920 

aaaaatttaa ttttaatttt tgaaagtttt ttgaatttaa tttttagtta ttgtttttta 1980 

tqttatattt tgtagttaga tatatgtgaa tagaaaaaat agtatagtta gtgttaaagg 2040 

taaaatgtag agagttaggg ggttgtttgg tattgtttgt gagtttattt atggtttttt 2100 

gtttttttta tttttatttt atatggtgtt tttagagtta gttattaaaa atgaagtgaa 2160 

gaatttttta tgaataaatt aaaagttgtt aggtaaaaaa taaaaataaa aaagttttta 2220 

aagttttttt agttgttttg gggtttggga ggattagaga gggtgggttt gtttaggttt 2280 

ttgtgttagt gtgtgttagg ttaggtgttg gtttttggtg ggtaatggtt tgtaggttat 2340 

tggtttttat agataatatt atgttgtttt taggtttttg ttttattttg tttttattta 2400 

ggtgtttgtg ggttaaggtg ggatataggg aggaggtgat tggtatgggg ttatgttttt 2460 

gggatgggag tttttgtttt ttatgggtgt tgtgaagggt ggtgggggtt ggtagggagg 2520 

tgtattggga tttttatttt gtaggggtgt gtattttttt atggtgtttt ggtttagagg 2580 

agtggttttg ggtatatttt ggttggggta tttaggaggg ggtaggggag tttttttttt 2640 

ttgtagaatg agtggtatgt tggaggaggg tgggggtttg gagttgttta gtttgtttgt 2700 

aaagatgata ggtatttttt ttttgatgaa ggtggtttgg tttttaaggg taagtttgtt 2760 

tttttgtttt ttgtggtagt agattatggt aatgatgtta gtaatgagta ggatggttgt 2820 

gattattatg gttggaatga ttgtgtgtag gtagatatta tttttattgt tttttttagg 2880 

gtttttgtta ggtatttttg tgggtggtgt ttttgagggt atttttttgg gtggtattat 2940 

Igggataaat tgtaggtgtt gataattgtt agagtttgtt atagtgatgt ttgtggtttt 3000 

aaagttaggt tttagggtgt tggagaaggt aggttgaggt tttttattat ttttagtgat 3060 

ttggtggttt agtttagtga tttgtttttt ggggtagggt tttaagggta gtgtgttgtt 3120 

ggtttatttt attatgatgg agttttgggt gatattttgt agggtgatgg tgttatagtt 3180 

ttgqttttta aaggtgaagg ttagtttttt tattaaggta atttttttgt ggatgttatt 3240 

taatattagt gttgggttat ttataaattt ggttttgaat tttgtaggag ttttattttt 3300 

ttgggggtgt ttgtggatgt ggattttgaa ggtatttata gttgataggt tttttttgtt 3360 

tgtggtatgt atgaaatatt tgtgtttgtt tatgtggttg ttgttgggaa ggttatatat 3420 

gagttggttg ttgttgttga attgtattta ggattttttg tttattagtt gttgtttttg 3480 

tagttttagg gttagtttta gtttgttagt ggtggtgttt ttatggttat agaaagtgtt 3540 
tgatgggatt tttattttaa agtaggtgtt aatttaggta tttattttgt taatatggtt 

seite 436 



840 
900 
960 



3600 



tttgagtttt 
aatatgagta 

gggtattggt 

gttagttgaa 
gggaggggta 
ttggttagga 

tggggtttga 
atttttgatt 
tggtatgatt 
aggtgtgggt 
ggtgatgtgt 

aggggttttt 
ttaggagaga 
ggttgatatg 
tatttttgag 
ggtgaggttg 
agatgatatt 
gtttttaggg 
ggtttttagt 
gaggtttttt 



gggtgttggt 
ggtggtgagg 

tggggtgttt 

ggtgttgttg 

gtaattgtag 

attgtggtgt 

ataatggtgt 

ggaggagtta 

ttggatgggg 

gtagtatgga 

taatttatta 

atattatgaa 

agggttttat 

ttaaatagtt 

aagttttgta 

gtatttaaaa 

attttattag 

tagattttga 

tgtgtagtgt 

agggtgtggt 



tgggtttttt 

tagtttttaa 

gtggtttttt 

gttttggtgt 

taggttttat 

tagtttttga 

tttgagtttg 

tggtttttgt 

gtttttggat 

tttgtttttg 

tagggtagtt 

tgttaggtat 

tttttattat 

tgttatttat 

ttttgtgtag 

ttattgttaa 

ggtttgggga 

tggagaatat 

ttattgaaat 

tttgtgagtt 



seq_prot. 
gttatggggt 
tttggtgatg 
ggttggtttg 
ggatatttgt 
atagttagga 
tatttgagta 
gatggtgatt 
tggaggagta 
agttgtggtt 
gatgtgtttg 
aagttgagta 
attgttttgg 
tttttttgta 
tattggtatt 
gaggttaatt 
aatagttata 
ggttgttttt 
attggaggtt 
gtaatgtata 
ttagtgtagt 



txt 
attttattgg 
gaaattttgg 
tgagttgtgg 
ggttttttgg 
atggttattg 
ggttggatgg 
gtgggttttt 
atggttggag 

gggggtttaa 

ggaagagggg 
gatattgtgt 
tttagggagt 
ttttttgggt 
aattttatgt 
ttttgttttg 
ggtttatttg 
attgattgta 
tgggggatgt 
tttttattag 
taag 



tggtggtgtg 
tggtgatttg 
tggtggtgga 
tggtggttgt 

ttgggtgaat 
ggtttagggt 
taggaatagg 
atgtgggggt 
tggtagtgat 
gttttttatt 
tttttttggt 
agtttagttt 
tagttatgaa 
tgtgaagttt 
gggttatttt 

tagtataggt 
gtttattgtg 
ggtttttgtt 
tgttaagggg 



<210> 446 
<211> 4337 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 446 

tagtatgagg 
tttttttgat 
gaggaaaaga 
taattgtagt 
ttttattttt 
tatatttagg 
tagatagttt 
agaatttgtt 
tatttgtgga 
tagttttagg 
ttagtagtat 
gggttttgtg 
gagtttgtag 
ttttaagttg 
taagggggtt 
attattattt 
aatgttatta 
gagggtttga 
"ggtggttg 
tgttttttgt 
tttatggttg 
agttgaggtt 
tttgttaggg 
gtttttgggg 
agttatttga 
tgatgggttt 
tgttgttgtt 
gtttttggat 
aggaaggttg 
gatttgttgg 
ggagggagag 
gtagggatgg 
ttggtggggt 
ttgtttttga 
gggttttggt 
tttgattttt 
atggagagat 



aaatttagtt 
agtatttttg 
ggtgttttga 
aaatttatgg 
ttggattttt 
aaaagttgaa 
ttatgttttg 
aatggtttag 
gatgttagga 

g^tttttttt 
attaaatttt 
tggatagaat 
tgaatgttta 
ttagttttag 
gttaggagta 
gttttttaat 
gtagagggtt 
gtagtggggg 

ttggggntat 
gttgttagtg 
attttttttt 
gtggggttgg 
tttatttgat 
tgatttgggg 
gtttgagttg 
gaggagttgg 
tttgttgttg 
tagtttgttt 
ttggggatgn 
aggtatggga 
gagaggggag 
ggtttagttt 
ttgtttgttt 
gggtggtttg 
atttttagta 
tttttttgtt 
ttttaggttt 



gtaagaaagg 
ttgtgggagt 
ggaggttgat 
agtgatggtt 
aattatagga 
gtagtttttt 
tttgtttttt 
gttgtagttg 
gttaggagga 
ttttgttggg 
attattttat 
attaagtgga 
gttatgattt 
tgttagtatg 
agagtggagg 
tttttaggtg 
ttagaggagt 
tggggagttg 

tgattttggg 
tttgggaagg 
ttagttttgt 
tgttggtggg 
tttgtttttt 
gtggattgtg 
tgggatattg 
tttgttgttt 
ttgtttttgt 
ttgggtgagt 
gttttgtgtg 
tttattagga 
ttgtggggtt 
gttttgttgt 
atttatttgt 
gggtagaggg 
agttgtagta 
tttttatttt 
tgggagtgat 



agatttggga ttatgtgggt 
agatgtggta gattttgaaa 
tttggttggg ggtgaagagg 
tttattgtta tggaaataat 
ggaattatta tatttttttg 
taaaattatt tattttttga 
aatgttgttg ggatttatta 
gaaatttttg gttttagtag 
ttaggggtgg gtgtttttgg 
ttatgtgtag gttgtgaagg 
ttaagttaat tttgtttttt 
taagtttgta gggtgtttag 
gtagtggatt gatttgtgta 
agtttggttg agtttatttg 
gggtgtttaa agggaggaga 
tratagtgggg atgtagtgtg 
tggaaggggt tagggtttgg 
ggggttggtt tttgtttgtg 
tttggttttt gtggtagtgt 
aggaaggggg aagggggtgg 
tttttgttgt ttgtgggtga 
aggattgtgg tgttttgtgg 
attttttgtt ttttaggttt 
gggtggagtt gttgtttgtg 
ttgtttttgt tttttgaatg 
tatggggtgt gttgttgttt 
tgtagttgtt gtttttgatt 
gttggggatt tggggatgtg 
gaggtggttg tgataaagga 
tttttttttt ttttaaggtg 
gttggttttt ttttttagtt 
tttttttagt ttttttagga 
ttttgatgtt gattagagtt 
aggttattgt ttgtagtttt 
gggttgggag ttgtggtggg 
tggtttgtta gtgtagtttt 
tagattgtgg ^atgggg^tag 
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gttgtgttta 
gttgggagga 
agtttttttg 
tagtagaggt 
taattttgtg 
gattattgtt 
aattgatttt 
agttgaattt 
gtttttattt 
ttttattttt 
ttttatttga 
attaagtgtt 
aaaattggat 
gttgtaggaa 
ttaaggttta 
tttttaaatt 
gtggtggtgg 
ttttagtgtt 
tttgagttgt 
gttggttggg 
gttttgttta 
aggggttgag 
ggaggtgggg 
agtttggttg 
ttgtgtattg 
tggttattgt 

tgggtgttta 

ggttggggga 
ttaaggaggg 
ttgtgtggag 
ttttttttgt 
ggatgtattt 
gattgtggtg 
ataaatattg 
tgggtaggtg 
aagttttagt 
_tittggg±gi_ 



3660 
3720 
3780 
3840 
3900 
3960 

• w m^m w 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4784 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220— 



<210> 447 
<211> 4337 

<212> DNA 

<213> Artificial sequence 

^2 ^» 

<223> chemically treated genomic DNA (Homo sapiens) 
<400> 447 



seq_prot.txt 

tgatattttt aagttttggg agtgaataga ttgtggatgg gggtagtttt gggtgttgat 2280 

atttttaaqt tttgggtgtg attttttttt agagtgattt gggagtagta ttttggttgg 2340 

ttaaattttt tttttggttt ttatatttgt ttttgtggtt attttttgtt ttagtttttt 2400 

Stttttttt tttttttttt tttttatttg gttgtttttt tagttttttt tggggtattt 2460 

qqattttaqa tttagagggt tttggatttt aagtttattt tttttattgg gtaagtaagt 2520 

attttttgtt tttagtgttt tatttttttt atttgtaaaa taggtagttg taggtgtttt 2580 

gtaaggtgtg ttatttgggt atgaagttta taagtgttta atagatgtat ttgtaagttt 2640 

tttaatttgt tggttgttat attgtatgat ttggaataaa gtatgtatga attagaggtt 2700 

ataglttgaa attgttttag atttttattt ttttatagta tttttatttt taaaagtatt 2760 

ttgtttataa agggaaatag tagaattagt atttaaatta aggtagggtt ttagtggttt 2820 

SlgialS? iatttgaatt tttgtttgaa atattgtaga tgatttgggt ttaaatgtta 2880 

ttttaqttat ttagtaatta tgtgattttg agtaggttat ttaaagtttt tgggttttga 294U 

tttttttait tgtiaaatag gtataataat agaatttatt ttaaggtagt gaggattaaa 3000 

Sagtttg€g ittgaaaagt tgttaggata gggtttggta tgggaattgt tgaatgagtg 3060 

ttatttagtt tagagtttat ttttgtatat tttgtttaga tttggggtga ttttgtttgg 3120 

qgttaggata gggttggggt gttagtttga ttgtagtttt ttgggtgttt tagtttgtga 3180 

tgttaggatg aataggtgta Ittagtttta ggtttgttgt tttgttattt ttgttgtggt 3240 

tataattttt tttgttttta gattttttta tttttgggag aaggtagggt gagatggggg 3300 

tSSgttS aatggatttt atttatggtt ttgggaaagt tgggttagtt tatgggatta 3360 

tottaaaaga tggtiggttt tatgtttaat tttggaattg gatgggataa ttatagggta 3420 

gtSgttggt alaattggag agtagtgagg ggaattgagg ggatttgttt tttgaggagt 3480 

Igttagaata tttggttatt attattaatg tagttgtagg atttggttta tttttataag 3540 

tttatqattq tttttaggtt tttgttatgt attagtattt atttgttagg gttgttgtag 3600 

gtgggtttgt attttgtgta ttgtattaag gtgtttagtt gagggttgag aggatttgaa 3660 

atqtaqttta tatttagttt tttaatttaa gagttttgga gtaaaagatt ttttttgtta 3720 

tatttttaaa attaggtttg tttggagtgg tttatttttt tagttttatt aaagttttgg 3780 

aSaSgtag tgg?atttag ttatttgalt ttgtgagtat gtatttatag ttgattgagt 3840 

tgttittata ttttggagtg tttttttttt gggttttttt ttggttttta tatttatttt 3900 

Sat?tttat ttatgtgtag gttttagggt tggttgggag tttgagaggg gaatgttttg 3960 

gtaattgaat aggtttattt ttgagtgtgg tttaaggtgg ttttttgttt tattttaggg 4020 

ggtttattag atgtttttgg tagtgtgttt attttgggta ggtttgtggt gattatggga 4080 

tqttgggttt agggttttag tgtttttttt tgtatgtatt ttaggagaaa gagttatgga 4140 

attggltatt atltattttt tttttttttt gttatttatt tttttattga atttttattt 4200 

atgttttgta aggatttaga tgtttaggga tagtggggtt atgtttgtgg tttaaaataa 4260 

aattttttga agtggagtgt ttagtttgtt ttgtatttgg ggttttttat agggaatttt 4320 
aattattgtt gatgaat 



gtttattaat aataattagg gttttttatg aaggatttta gatgtagggt ^agttggatg 60 

ttttatttta agggattttg ttttaaatta tagatatagt tttattgttt ttaggtattt 120 

qoqtttttat ggggtatggg tgggaattta gtagaggagt gagtggtagg aggaaagagg 180 

ggtSgtaat Itttagtttt atgatttttt tttttaagat gtatgtaagg gagggtatta 240 

qqqttttggg tttggtgttt tatggttatt atgagtttgt ttaggatggg tatattgtta 300 

ilaaSitta aigagttttt tgggatgagg taggaaattg ttttaggtta tatttaaggg 360 

tqagtttgtt tagttattag ggtatttttt ttttggattt ttagttagtt ttaggatttg 420 

titgtaggta agagtaggag gtgggtatga gggttaggag gaagtttaga gagggggtat 480 

tttigggtat agtggtaatt tagttgatta tagatatgtg "tataagtt taggtagtta 540 

ggtgttitta tittatttta gggttttggt gggattggaa aaatgggtta "ttgggtag 600 

atttagtttt aagaatatag tagaaggggt tttttgtttt aggattttta ggttggagag 660 

ttgagtgtgg attgtatttt aggttttttt aatttttagt tgggtatttt ggtataatgt 720 

ataaaatgtl agtttattta tggtagtttt gataggtgaa tgttggtgtg tgatagaagt 780 

ttgggaatag ttatgggttt gtagaagtgg attaggtttt gtagttgtat tggtaataat 840 
aattiagtat tttagttgtt ttttgggaag taggtttttt tgattttttt tgttgttttt 
tagttttgtt ttttagttat tttgtgattg ttttatttag ««agagtt ggatatgaaa 

tttattottt tttgatatgg ttttatgagt tggtttagtt tttttaaggt tataagtgga 1020 

g£?Ktgga gS?ttgttl ttattttatt t!gttttttt ttaaaaatga gagggtttga 1080 

gggtaaaggg agttgtggtt ataatagaag tagtaagata g^ggtttgg |f«ggttat 1140 

Itttgtttat tttgatttta taagttgaag tatttaaggg g«atagtta aattaatatt 1200 

ttaattttgt tttaatttta ggtagaatta ttttagattt gggtaagatg tatagaagta 1260 
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900 
960 



aattttgagt 
ttttaaatat 
ttttatagat 
ttattaagtg 
ttaggtaggg 
tttttttttg 
ggatagtttt 
ataattaata 
tgagtaatat 
tattgggagt 
ttttgaattt 
aaaggagagg 
ttaggaaagg 
gtttaggatt 
aggatttggg 
tttggaggtt 
ggagagggag 
tgggaatatt 
gttgtttttg 
taggtggatt 
ttagatttta 
tttttttttt 
tgtgtttttg 
tttttggtag 
taggttgatt 
tagtaggagt 
gttttttagg 
tttgggttta 
tgagttgttt 
aagtgggttt 
gttttgtagt 
aggaagttgg 
tggtagtgtg 
gttttggtgg 
ttattgttta 
tttttgttag 
agggggtgga 
ttttggtgat 
gggttagtag 
gtatttatta 
ttgtttatgt 
atttaatgta 
gggaattttt 
taatattttt 
gattattagt 
aatatagaaa 
aatttttttt 
agtttaaaag 
taaatttgtt 
gagtattttt 
ggatgttgtt 
taagtttttt 



tgagtaatat 
gagttgattt 
gaagaaatta 
gttgaggtag 
ttatttgaag 
taaataggat 
agtttgtaat 
agttaaaggg 
attttgtaag 
agaaggtatt 
aaagtttaag 
gaagggtgga 
aatttagtta 
tgggaatgtt 
aatgttagtg 
tttttatgtt 
gattgggtgt 
agaattttag 

ggggtggtat 

ttgttgaaag 
tttttatgta 
tttttttttt 
ataggttttt 
tttttttttt 

tgggggttga 

agtagtagtg 
tttattgtgg 
ggtggtttgg 

tgggggtttt 

tggtaaattt 
tttggtttgg 
ttgtgagttt 
aggggtggta 
ttgttggagt 
gattttttta 
tggtgttggt 
tggtgattag 
ttttttgttt 
tttgagagtt 
taagtttagt 
gaagtttttg 
ttgttgagga 
aagattggag 
atgggtggag 
agattttgaa 
ttatttaagt 
aagtgtgtat 
agtgaaagtt 
gtaattgtag 
tttttttttt 
aagagggtgg 
tatgttg 



ttatttagtg 
aatttttatt 
agatttagaa 
tatttgaatt 
ttgttagggt 
gtttttagag 
ttttaattta 
tttgtggata 
gtatttatga 
tatttgttta 
tgttttaaga 
gggttgaaat 
attaggatgt 
agtgtttaga 
tttagagttg 
ggggtttggg 
ttgtttgttt 
tgtttatagg 
tgtggttggt 
tgtatttttt 
gaaagaaaat 
tgtgtagtgt 
tttttggttt 
tttggtttgt 
gtgtttaggt 
gtggttgagg 
tgtgtagtat 
ttggatttat 
tgtttttgga 
agtttttttg 
gtagagttta 
ggttggtttg 
gtttggggtg 
gttgggatgt 
ttgttgttta 
ttggggatat 
gttttggttt 
ttgtgattaa 
taatttttgt 
atttagttta 
ggtgggggaa 
ggtaaaattt 
tggaagttta 
tttagttttg 
attaatttag 
aataatttta 
aggattatag 
tttgttagtt 
gaaagttttt 
ttttgatttt 
gtatagtatt 



seq_prot. 
gtttttgtat 
attttgaggt 
attttaagta 
taggttattt 
tttgttttaa 
gtggggatgt 
tatatgtttt 
tatttgttgg 
ttgtttattt 
gtggaagaaa 
agagttggag 
aggaggtggt 
tatttttagg 
gttgttttta 
tttttatttg 
gttgtattgg 
attatagttt 
attgtaagta 
tttggttagt 
taagggaatt 
tgaagaaagg 
tttgaaaaaa 
tttgttatgg 
gtttttgggt 
tggaggtggt 
tggtagtgtg 
ttggagagta 
gggtggtagt 
tttggggggt 
tggggtattg 
tttgtaagtg 
tttttttttt 
ttgttgtagg 
ggatgaaagt 
ggttttggtt 
gttgtgtttt 
ttttttttta 
atgaatttgg 
ataggttaat 
ggtattttgt 
gggtaagatt 
ttataatttg 
gaggtattta 
ttgagattag 
tggattttaa 
aaggataagt 
aaaaatgtag 
atttttatgg 
ttttattttt 
tagggtttgt 
tatatggttt 



txt 

taggttttgt 
aggttttgtt 
gtttgtttaa 
gtagtatttt 
tttaagtgtt 
tgtgaggaga 
attttaagtt 
gtatttataa 
tgtagatgag 

tgggtttggg 

aagtaattgg 
tatagaagta 
ttattttgag 
tttgtagttt 
tagtttggtt 
taagttaggg 
ttggttttgt 
gtggtttttt 
attaaagata 
aggggggatg 
ggttagtggt 
gaaggggttt 
ttatttttgt 
ttttggtatt 
ggttgtagta 
ttttatgggg 

ggagtgatgg 
tttgttttgt 
gggaggtggg 
tagttttttt 
gtggagggtg 
tttttttttt 
ggttaagttt 
tgatttttag 
ttttttggtt 
tgttatgtgt 

ggtgtttttt 

ttgagtttgt 
ttattgtaag 
gggtttgttt 
agtttaagtg 
tatgtgattt 
tttttaattt 
gggtttttaa 

tagtattaag 
aattttggga 
tagttttttt 
tagtagagat 
agttaaaatt 
tatatttgtt 
taggtttttt 



tttaatgatt 
attatatttg 
ggttatatgg 
agataggggt 
gattttatta 
tggaagtttg 
atataatatg 

attttatatt 
aaaggtgggg 

atttaggatt 
atggaaggga 
gatgtgagga 
gaggggttgt 
gtttgttttt 
gtttttaggg 
atggaggggt 
tgtgatttgt 
tttattttga 
agtagatggg 
atagagtggg 
tttatagttt 
taatggattt 
atgaggtatg 
tatttagagg 
ggagtagtag 
tggtgagtta 
tgttttgtgg 
ggtttgtttt 
aaatagggtt 
gttggtgttg 
gggttgaagg 
ttttgggtgt 
gggattagta 
ttttttgttt 
tttttgggat 
ttagggagtt 
ttgtttttgt 
gttgatattg 
ttatggttgg 
gtttaatatt 
ggataatagg 
ggtaggggaa 
ttttggtttt 
ttgtagtttg 
aaataaataa 
aagttatttt 
tgtagttaag 
tattgtttta 
agttttttta 
tttataatag 
tttttatagt 



<210> 448 
<211> 4388 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 448 

agttatgagt ggattagatg 
aaatatatta gttttgattt 
gtagttgggt gtttttattt 
tgttagtatt agtaagttta 
ttggttgatt ttattggtta 



1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4337 



ttaaaatttt 
tgagttggga 
gttgaaagga 
attagttaat 
tgtttggtag 



tggttgagag aataatttta ttttggaqqa 
attttggtga tgttgtagat ttaatgtatt 
attgttgaat ttttaaattt attaattgtt 
attaagtaaa tttgtattaa atatataatt 

&£K^agtag_tagggagaaa^gtMt^t-^a- 
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60 
120 
180 
240 
300 



360 
420 
480 
540 
600 
660 
720 
780 
840 



seq_prot.txt 

gtttttgttg gaattgtgtg aaaagttgga gaggttgttt tttttttttt tttttttttt 
tttttttttt ttttttgata tatatatata tatatatata tatataattg taataataat 
aatatrtttgg ttttttatta agttaattta aattgtagaa gtttattttg ttaagtaagt 
ttqqttttag ttatatgttg ttatttagag aatttaatat tagattttat ttgattgtaa 
atgtgaatta ttaggttgta taaggaatat agtggtagat ttagggggta tttaattttt 
ataqtatttt aaatgaaaaa aaaaaaaagt aatatttaaa taagtaattt agattatgtt 
gtaggttttg aatagttttg atgttgtgtt tttatggtaa gtttttaatt tgagttgaat 
ttttttttat taaaaatgta aattttttta agattttttt ttggttttgt tatttatgat 
atatattttt tttaaaagaa attttaaatt agataatagt tgtttttttt tttatttttg 
ttattagtag tgtggtgggg tagtagagtt gtggttagtg gaggagagta gaggaggaga 900 
gttgggaaag gigaatgtta tttgtttata tttttttttg tttatttttt gttgtttatt 960 

tttttttttg tttttttgaa tgtgaattga gttttgggta ttgttttagg tttagtaggg 1020 

gataggataa agtttgtttt tttaggaatt tgtattgagg gtgtgagtgt gtgtatgtgt 1080 

qtqtttqgag gtgggagaat aaatataaat aaataaaaag gagaatttta ggtagtgata 1140 

agagtgttga gaaaaataga atggtgtgaa agaggaaggt tgagtttgta gaggtttgag 1200 

gttgttgtta ttgggtagtg gtaggttttt ttgaggaggt ggtatttgaa gattggagga 1260 

aggtttattt tagtaagtag gaatagtaag tgtaggtttt ttaagttttg ggggagttta 1320 

gtttrtttaa gggtagtata aaaattagtg tggttttgga gagtatatta ggggagagag 1380 

gtaggaagag tttggagata tggatggaag ttggattagt tgggttttgt tgaatatgga 1440 

aaggtattta gattgtattt tgagttaaat gggaagtgat gtgagagatt taataatgga 1500 

qtattttgaa ttgttttatt tatttaaaat atttattttt gtttggttga atattttatg 1560 

ttqttttttt agaagttttg gtgattttat ttgaatgtat ttaggtttta ttggagggga 1620 

atiggatttt atttgaggtt atggaggttt atggaagtta tttgtatagt aaatattttg 1680 

aaagtggttg tagggagagt gtgagggtgg gattgttttg gtaggaggtg gaaaatgaaa 1740 

aatatatggt tatgagtttt agattagggt ttttgaaagt ttttagtttt tttagttttt 1800 

attttaaagt gggtttttaa ataggaagaa agaaagattg ttaagtgttt ttggagtttt 1860 

tttttttttt ttttttaggg attttagtat ttttggggtt tgggttggtt ttaaagtagt 1920 

tttttttgtg tttttttatt tatagtaata aaggtatgga gtatttgtat agtatgaagt 1980 

gtaagaatgt ggtgtttttt tatgltttgt tgttggagat gttggatgtt tattgtttat 2040 

itgtgtttat tagttgtgga ggggtatttg tggaggagat ggattaaagt ^atttggtta 2100 

ttqtgggttt tattttattg tattttttgt aaaagtatta tattatgggg gaggtagagg 2160 

qtttttttgt tatggtttga gagttttttg gtttttatat ggtttagata atttttgttg 2220 

tattttattt ttattatgta ttlttttagt taaattttgt tttttgtata tattttggta 2280 

tgtatttaat attaatggtt ttttagatga gtggttattt atttgtttgt ttagttttta 2340 

gtggtatatt ttttgttttt tgttgggaat agttaaaggg attttaaggt taaatttttg 2400 

taatagtttt tttttttttt tgttatgtta ttaagtgtga ggatttttgt agttttttat 2460 

agttgaattt agtttatggg ttggggttta gataattttg tgtatttaag «atttgtag 2520 

agatttaggt ttggagagta gatattttgt ttttgataag tattttttaa atggttttaa 2580 

gaataagtta tagtaaagaa tttaaagtgg tttttttaat tggtgatttg gagaaagtta 2640 

ggttaagggt ttattatagt atttttttgt atttttatgg taatgtattt ttttatgaaa 2700 

gtggtatatt ttaaagtttt tatatgattg tagtagagta tttggtgatt gttaatttat 2760 

tttttttata ggaatataag gggtatatag ggaaggtaga ttttttagtt ggtaagatta 2820 

ttttaatttg atatattgta gatttagatg tgttgaaagt tttgtttttg gttttttggt 2880 

tatqqqtttt agttaattta tgttttttat ggatttatgg agagtagtaa gttgatttta 2940 

gttaigtttt tttatatgag ggataagttt ttgatttttg tttttatttt tgtgttataa 3000 

iagaaagttt tttttttttg aatttgtagt aaggttagtt ttaggatttg ttttagtggg 3060 

tattgtiittt ggattttttt ggtgtgtgtg tgttttatat aggggtgaat tgtttattgt 3120 

ggtgatgtat gatgagggta aatggtagtt gaaaggagta ggggttttgg tgttgtattt 3180 

igttttgggg tatgglgttg aatagtattt gtgtaggatt gttgtggtta ttagagaata 3240 

agagggaaag tagggtagaa attggatata gttttgaggt atagttagat ttgtttaggg 3300 

tqqttttgtt ataggttgta gttatttagg aatatttttt gtagattttg tattgttttt 3360 

tqqqqqtqtt ttgggatttt tggggtagtt tagttttttt ttatttttta gtgtggtttt 3420 

ggttggaaga agtagttgtt atagttgttg tagatagttg tgtttttata attggtttag 3480 

tlttttgggg tatgggagaa gggtggggat tgttgttgtt attatttagg ttgattgggg 3540 

tttggttaga ttatgtatgt ttttggtggt ttagagataa tttaaaatta gggtttggtt 3600 

tggggaagaa aatttttttt tttttttttt tgttttgttt tttattgttt ttatttttgt 3660 

tagtttattt tttttaattt tttttgattt ataggttaaa aaagaaaggt ttattttagt 3720 
tatagggtag ttttttttgg gtttttgttt ttttagtata attatgggtt attttttttt 3780 
ttttlataaa aaagaatgtt tgattttttt tgggtgattt tattgtttgt aattgaaatt 3840 
ttattgagag gtgatgtttg tgttagttaa tgatttaggt gagttgtttg ggtttttttt 3900 
ggtatgtttt gtttggaaaa gtggatttta tttatttttg attgtttagt taagtgatta 3960 
ttaaaqqatt qaqaatttgg gagggtaaaa aaaaaaaaaa agtttttatg tgtatttaaa 4020 
tttggagata attttatgta tttgtgttaa ggatatgttt aagaatataa tttttttgtt 4080 
gttgtttgtt taagaagtat tttagtttgt ttaagaagta ttttatatag tataatatat 4140 
atttttttga aattatattg tttgtttatt agataattga atgtagtaat tttgttttgg 4200 
atttaatttg attgggttaa tatgtaaaaa ttaaggaaaa atatttagtt tttttttttt 4260 
tttttgtata ttttttaagt tattttgtta tgtatatagt tatttatgtt taaagtttgg 4320 
tgattattta tttaaatgaa gattatattt tatattaatt tttgtattta tagtagataa 4380 
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aatagtat seq_prot.txt 

<210> 449 
<211> 4388 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> chemically treated genomic dna (Homo sapiens) 
<400> 449 



4388 



gtgttatttt 
aataattatt 
tataaaaaaa 
aattaaattt 



aaaaaaatat 
taaatagtaa 
tttttaaatt 
ttttttggtg 
gatatattaa 
tttaataggg 
tgttaagaaa 
gttttgtggt 
atgagttggt 
tttttttaaa 
tgattaggtt 
ttagggtgtt 
ttttaattag 
tatttttaaa 
tagggttatt 
tttttttttg 
ttagggttaa 
gtattattat 
gtatagtgtt 
gttttttttt 
gatttaatta 
aatttatgat 
agttaaaata 
tagggggaat 
tgtattattt 
ttttgattta 
gtttattttt 
ttgggttttt 
gtttagttgt 
agttgttata 
tgtgttatta 
tggatgtatg 
gtaaaatgta 
agggaaattt 
gtttgtagtg 

gggtgtatgt 

gtttttgtat 
tagaaaggat 
aaggaagagg 
tttaggatgg 
tgtgtgtttt 
gttattttta 
gattttattt 
aaagataata 
taagatgttt 
aatgtttttt 
ttttttgttt 
aaaggaatta 
atgaattttt 
gtagtagttt 
agtattttta 



gtttattgtg 
aggttttagg 
aaaaaaaaaa 
agaatagaat 
atattatatt 
taaaagaatt 
taagtgtata 
attatttaat 
gagaagtttg 
ttttaattat 
aaggaagtaa 
tggaatgagt 
aggagtggag 
ttaaattttg 
ttagttagtt 
gggttaattg 
ggttatgttg 
gggtaatgtg 
ttgagtaagt 
ttttttagta 
atgtaatatt 
agtgaatagt 
tattggaata 
gtaatataaa 
agattaattt 
tggaaagtta 
gttttgttaa 
gaattgataa 
ttataaaagg 
gtttttttta 
agagttattt 
ataagtagtt 
gaagagttat 
aagatttagt 
agaattgagt 
ttggagtgta 
gtagggatta 
tttgtttttt 
gttaagtggt 
aggtggtggg 
tttatgttgt 
tattttagag 
aattttaaag 
gagttgggaa 
ttatttttta 

gggtatttgt 
ttttttaata 
tgagatattt 
tattgttaaa 
tatgtttaat 
ttttttttta 
agttttttta 
ttttggtttt 
taagtttttg 
ttattgtttg 
atgtgtatat 



gatataaaag 
tataaatgat 
ttaaatattt 
tattatattt 
atataaggtg 
atgtttttaa 
taaaaatttt 
tggataatta 
agtagtttat 
agataataag 
tttataattg 
tttttttttt 
gtggtaggga 
attttggatt 
tgagtagtga 
taggaatata 
ggaaatgaag 
gggtttgtaa 
ttggttgtgt 
gttataataa 
agggtttttg 
ttatttttgt 
ggttttgaag 



gttgtttttt 
gaggtagagt 
ttaggggatt 
ttattagata 
atgtattgtt 
agttattaat 
aaaaagtgtt 
taaatgtata 
gggaattttt 
tttggaattt 
aagtaaatga 
tgtaggagat 
tttgaattgt 
ttgtgatgta 
tttggtttgt 
tgtttagtat 
atagatgttt 
ttaatttgag 
atatttagta 
agttgagggt 
ttttttatta 
tatgtaggtg 
ggatttaaat 
agttaagtag 
ttttttatgt 
aaggtttaat 
atgtgttttt 
agatttaggg 
taaatgttgt 
taggtttagt 



atttatattt 



ttgatatgaa atgtgatttt tatttaaatg 
tgtatatatg ataaggtagt ttgaaaagta 
ttttttggtt tttgtatgtt aatttagtta 
aattgtttga taaataagta atgtaatttt 
ttttttaaat aaattaaggt gttttttaaa 
atatattttt aatatagata tataaaattq 
tttttttttt ttgttttttt agatttttag 
gaaatgaatg aaatttattt ttttaaataa 
ttgggttatt ggttaatata gatattattt 
gttatttaga ggaaattaaa tatttttttt 
tgttagagaa gtaaaggttt agggaaggtt 
tagtttatag gttaaaggaa attgaaggaa 
atggggtggg ggaggaaggg ggaggatttt 
atttttaaat tattaagggt atatgtaatt 
tagtaatggt ttttattttt tttttgtgtt 
gttgtttata gtagttgtga tagttgtttt 
aagagttgga ttattttagg gattttaggg 
ggaatgtttt taggtagttg tagtttgtgg 
tttagaattg tatttagttt ttgttttatt 
ttttgtataa gtattgttta gttttatgtt 
ttttttttaa ttattattta tttttattat 
gtaaggtata tatatgttgg gaagatttaa 
ttgattttat tgtaagttta gggagggagg 
aaaattagga atttattttt tatataggga 
ataggtttat gggaggtatg aattaattgg 
ttttagtata tttgaatttg tagtgtatta 
tgtttttttt gtgtgttttt tgtattttta 
ttttgttata gttatataaa agttttaagg 
ataggaatat aagagggtgt tataataaat 
taaaggagtt attttaaatt ttttgttgtg 
tattagaggt aaaatgttta ttttttaggt 
gagttatttg agttttaatt tatagattga 
atgtttagta atatagtaag ggggaaagag 
ttttggttgt ttttaataga agatagaaga 
atggttattt atttagaaag ttattggtgt 
agaatttggt taaagtggtg tatgatgagg 
gtgggagtta gggagttttt agattgtggt 
atatttttgt aaggaatgtg atgaagtaga 
tttttttatg gatgtttttt tatggttagt 
ttttagtagt aggttataga ggggtattat 
tatgtttttg ttattgtagg tgggaagata 
ttttaggagt gttgaaattt ttagaagggg 
attttttttt ttttttgttt aaagatttat 
ttttagaagt tttaatttgg aatttatgat 
gggtggtttt atttttatat tttttttgta 
atttttatgg atttttgtgg ttttaaatga 
gtatttaaat aggattgtta aagtttttaa 
gagtgggtat tttaaatgag taaagtagtt 
tattttttat ttaatttaga atatgattta 
tggtttagtt tttatttatg tttttaagtt 
tggagttata ttgatttttg tgttgttttt 
gatttatatt tgttgttttt atttgttggg 
ttttttggaa aggtttattg ttatttagtg 
tttttttttt tatattgttt tgtttttttt 
ttttatttat ttgtgtttat ttttttgttt 

tt^tstgaa Jt_t_tttagaga_agt:aggtttt 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 

3300 

3360— 



attttgtttt 
aggggggaaa 
ttttttattt 
attggtggtg 
aaatatgtat 
aggggaaaat 
gggtttatag 
aatgttataa 
atttatgttt 
ggggttaagt 
aaaatattat 
agagagaaag 
atagaaattt 
ttagttaggg 
tgttgataag 
ttaattgtaa 
atatgttttt 
ttatggtt 



ttgttaggtt 
tgggtagtgg 
tttttttttg 

ggggtgggga 

tatgagtagt 
ttagtttaag 
tatgatttaa 
aagttaaatg 
atagttagat 
ttatttaata 
tattattatia 
agagagagag 
tatagagttt 
ttgtatatrt 
tagttaatgg 
tgtattgaat 
ttttagggta 



taaaatagtg 
gaaatgggta 
ttttttttta 
ggagaataat 
aggattaaga 
ttggagattt 
attatttgtt 
ttttttggat 
gaaatttgat 
aaatagattt 
attgtgtgtg 
agaaagagaa 
tttttttgtt 
agtgtaaatt 
atttgaaaat 
ttataatatt 
aggttatrttt 



seq_prot. 
tttagagttt 
gagggaaatg 
ttagttataa 
tattatttag 
gaaggtttta 
gttatgaaaa 
taagtgttat 
ttgttattgt 
attaaatttt 
ttgtgattta 
tgtrgtatgtg 
gtaatttttt 
gttgtttttt 
tatttagtat 
ttagtaattt 
attgaaattt 
tttagttagg 



txt 
agtttatgtt 
taggtaaatg 
ttttgttgtt 
tttgaaattt 
gaaaaarttg 
tataatatta 
tttttttttt 
gttrtttgtg 
ttgagtagta 
gattggttta 
tgtgtgtgtg 
tagtttttta 
gttagatgtg 
taattaatta 
tttttaataa 
ttagtttaga 
gattttggta 



taggaaaata 
gtattttttt 
ttattatatt 
tttttaaaag 
tatttttagt 
aagttattta 
ttttatttaa 
taatttgatg 
atatatggtt 
gtgaggaatt 
ttagagagag 
tatgatttta 
gttagtaaaa 
ggtttgttaa 
ataaaaatat 
gttaaagtta 
tttaatttat 



<210> 450 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for EGR4 
<400> 450 

agggggattg agtgttaagt 



<210> 451 
<211> 20 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Detection primer for EGR4 
<400> 451 

cccaaacata aacacaaaat 

<210> 452 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for APC 
<400> 452 

txaactacca tcaacrtcct ta 

<210> 453 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection pnmer for APC 
<400> 453 

aatttatttt tagtgttgta gtggg 



3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4388 



20 



20 



22 



25 



<210> 454 
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<211> 20 seq.prot.txt 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CDKN2A 
<400> 454 

ggggttggtt ggttattaga 

<210> 455 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CDKN2A 
<400> 455 

aaccctctac ccacctaaat 

<210> 456 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CSPG2 
<400> 456 

ggataggagt tgggattaag at 

<210> 457 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CSPG2 
<400> 457 

aaatcttttt caacaccaaa at 

<210> 458 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for ERBB2 
<400> 458 

ggagggggta gagttattag tt 

<210> 459 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for ERBB2 

<4oo> 459 , . ; . . ,'; ; .; 
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seq_prot.txt 

tatacttcct caaacaaccc tc 

<210> 460 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for STMN1 
<400> 460 

gagtttgtat ttaagttgag tggtt 

<210> 461 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for STMNl 
<400> 461 

aacaaaacaa taccccttct aa 

<210> 462 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for STMNl 
<400> 462 

gaaaggtagg gaaggatttt t 

<210> 463 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for STMNl 
<400> 463 

cctcttacta acctcaacca ac 

<210> 464 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for STKll 
<400> 464 

naaaagaagg atttttgatt gg 

<210> 465 
<213> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> seq_prot.txt 
<223> Detection primer for STKll 

<400> 465 

catcttattt acctccctcc c 

<210> 466 
<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CA9 
<400> 466 

gggaagtagg ttagggttag tt 

<210> 467 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CA9 
<400> 467 

aaatcctcct ctccaaataa at 

<210> 468 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for PAX6 
<400> 468 

ggaggggaga gggttatg 

<210> 469 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for PAX6 
<400> 469 

tactatacac accccaaaac aa 

<210> 470 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for SFN 
<400> 470 

gaagagagga gagggaggta 

<210> 471 
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seq_prot.txt 

<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection pnmer for SFN 
<400> 471 

ctatccaaca aacccaaca 

<210> 472 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> . _ _ 

<223> Detection primer for S100A2 

<400> 472 

gtttttaagt tggagaagag ga 

<210> 473 
<2U> 22 
<212> DNA 

<213> Artificial sequence 

<220> r ^ 

<223> Detection pnmer for S100A2 

<400> 473 

acctataaat cacaacccac tc 

<210> 474 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<223> Detection primer for TFFl 
<400> 474 

ttggtgatgt tgattagagt tt 

<210> 475 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for TFFl 
<400> 475 

taaaacacct tacattttcc ct 

<210> 476 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for TGFBR2 
<400> 476 
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seq_prot.txt 

gtaatttgaa gaaagttgag gg 

<210> 477 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for TGFBR2 
<400> 477 

ccaacaacta aacaaaacct ct 

<210> 478 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for TP53 
<400> 478 

ttgatgagaa gaaaggattt agt 

<210> 479 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for TP53 
<400> 479 

tcaaattcaa tcaaaaactt acc 

<210> 480 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for TP73 
<400> 480 

agtaaatagt gggtgagtta tgaa 

<210> 481 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for TP73 
<400> 481 

gaaaaacctc taaaaactac tctcc 

<210> 482 
<211> 22 
<212> DNA 

<213> Artificial sequence 
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tatatttttt 
tttttgtttt 
tattttatgg 
ttattatttt 
tatttatggt 
aattagaaat 
tattttttat 
aaataggagg 
tttaattatt 
ttagtagatt 
gtgtttatat 
tttttggatt 
ttttatttat 
gtgtagtggt 
tgttttagtt 
tgtattttta 
tttaggtgat 
gtttgattta 
taatattttt 
tttatttgag 
tgggatgagt 
tgattgtgta 
tataggattt 
gtgggagtga 
ggtgttgtgg 
tggggttttt 
tggggttttg 
ttgttagaat 
tagggttatg 
ttgaggttta 
tgtttttggg 
gaatttttgt 
aggtatgagg 
tgtttaaaat 
atgttttttt 
aattaaaatt 
aagttgtatg 
gttttttttt 
atatataaat 
gtttagatat 
attaatgttt 
ttatttaatt 
tttttttaag 
ttttgattta 
atagatgtag 
tttatttttt 
ttatttatta 
ttatttattt 
ttttatataa 
ggtgtagtat 
ttattagagg 
ggtttttttt 
gttaattttt 
gtatatatat 
tgttgttttg 
tagtttattt 
ttagtaagtt 
agggtatttt 
tttgtagatg 
agttgttata 
aaagtaatta 
gtgtttaggg 
gtggggttta 
ataggaaata 
tttagtatgt 
tagatgatta 
tttatttaaa 



gtttttttat 



attgttttga 
ttatttttag 
gtataatgta 
aaatatatat 
gaatagtaaa 
tttatagtta 
ttagatatga 
ggtagtttaa 
gtgagttaga 
atttttttgg 
agttaggatt 
aatttttttt 
ttatttattt 
ttgattttgg 
ttttgaatag 
gtaaagataa 
ttatttgttt 
gatttgtttt 
tgagattgtt 
ttttgatttt 
tttttaggta 
gattattttt 
aaaatttatg 
ggtattggat 
gttgaggttt 
ttggggtttt 

gggattttta 
tggatttttg 
agttggagtt 
gagagtttga 
ttttgattgt 
tggattggtt 
taaaggtaaa 
ttttgattta 
taaaattttt 
ttttatgaat 
ggatttgtat 
gggaaaagta 
tttttttttg 
ttggttgaat 
aatttagtag 
agttgaagat 
attgtaattt 
aggttataat 
gatttgtagt 
ttttgttata 
tttattattt 
atttatttat 
ttttgattaa 
atattttttt 
atgggattgg 
aagggttttt 
ttattaatat 
ataataaaaa 
tttttttggg 
gtgtttgttt 
ttagtattat 
atagtagtat 
aatattttta 
aaagttttaa 
tgttttgaat 
taatagatag 
tgttttatat 
tttttttgta 
ttgttttatt 
atgatgggaa 
ttttagttta 
ggtaattgtg 



gattttttta 
atttatattt 



ttgaatgaga 
tttttgttgt 
gtgttttttt 
gaagtaaatt 
gttatagaga 
tgaatttgtt 
tttagttatt 
ttagtttatt 
aatttttggg 
ttttaaatgt 
attttttgag 
tttattataa 
ttgggattat 
ggttttatta 
ttgtttttta 
ttaattttat 
ttggttaatg 
agtgagtagt 
ggggtatagt 
ggttaatatg 
aggggatgtg 
ttagtttatt 
tggtttttaa 
ggtgggattg 
tttaatattt 
gggtgatatt 
gttggggttg 
gattttgagg 
tttgggtttt 
gtaattttgt 
gatttaattg 
tttatgtaag 
ttttttttgt 
atttaagttg 
gagttatgga 
taaaatgtta 
tgatggtgaa 
agtagtttaa 
attttgagta 
tttaagaata 
agaaattttg 
atttattttt 
ttttataatt 
tttatagatt 
atttatatta 
ttattattta 
taaatttatt 
tgttataggt 
tttatgattg 
aaatataaat 
tatttattta 
atgaaagtta 
gtagggtgtt 
ttataatatg 
tttagggagt 
tttttaaatt 
aaaaatataa 
aagtagattt 
agaattgttt 
tattgattag 
tatttgtatt 
ttttagtttt 
aattgtaatt 
gttgaagaag 
gagggagatt 
aaattagttt 



seq_prot.txt 
ttaataatat ttagagaaag 
ttaaaatata agatgttttt 
ttgagaaata tgtatttttt 
ttagaattgt ttttagtaag 
ttgatatgaa gattagtatg 
atgatatgta taataataag 
ttatatgaat tagggaataa 
tatattttaa gattgattta 
taggatattg ggtggtggtt 
tttttgttta gttttgtata 
gattttagat gttttgattt 
tttttatggg ttttttgatt 
gtggagtttt attttgttat 
tttttgtttt tttggtttaa 
aggtgtgtgt tattatattt 
tgttggttag gttggttttg 
aagtgttggg ttgatttttt 
taaatagtta ggtgtagatt 
tgttttatgt tgtaattggt 
attgttttga gggttatttt 
tattttttat ttttatgtaa 
atataaaagg aggaggttat 
gtggaggagg gttgttgttt 
tggttttgaa gtttttgttt 
tggtgggatt ggtgtttttg 
gtgtttttag gatgagattt 
gtgattattt tatgagagga 
tgggagttat tggagtgatt 
aaatttgggg tttttgaagt 
gttgatattt ggagagatgg 
ttttattttt atttttggga 
tttggagtgg gtttataggg 
atataggtta tgatagaggt 
ttttgtttta atttgttttt 
aatttatata aatagtttat 
tttagttttt aggagaggtt 
aagatgtgat atttaatttt 
aattgtaaag attttgttgt 
ttttatatgt taatttattt 
atgttgttgt gaaggtgttt 
aattggattt ttttttataa 
tagattgagg ttttttaaag 
tttgagtttt tagtttgttg 
atttgaattt ttattttgtt 
atgtgagtta attttttaat 
tatttgttat atttatttat 
tttattattt ttatttattt 
tttaatatat ttattttatt 
aaattttaag aagttttttt 
gagaagtgtt aaggttgtaa 
tatttttgtt ttttgattat 
ttattagatt tatttttgtg 
tatattaatt atatatgtgt 
tatttgtgtt atatttttgg 
tgggtttgtt aaatattgag 
ttgttttata gtaatttttg 
tgttaatgtt tagaagaatt 
atttgtgttg aaggattttt 
aatatgtatt atgatattgt 
ttattttttt attatttttt 
taagtgttta atatagagtg 
ggaagattgt gtaggaggat 
tttaggtttg ttatgaatga 
ttttatttaa gtatatttat 
attttataat ttaggttatt 
ataggttatg ggtttttagt 
ttttttttta ttttgtttaa 
atatatttaa ttattttata 



gtttttattg 
ttatattttt 
tattataatt 
tagttttttg 
ataggagaat 
tttatgttat 
aagaaaatgt 
taatggtaag 
gattttatag 
atttttttaa 
aatgttttta 
ttttttttat 
ttaggttgga 
gtaatttttt 
agttaatttt 
aatttttgat 
tatttggttt 
ttttttttgg 
ttttaaatat 
gttttatgaa 
tattttatga 
attttttaaa 
atttgggggt 
ttggaatttg 
agatgaaatt 
agggtttttt 

gtggttttgg 
tttgaagatt 
ggtatggaga 
ggtitgagggt 
tgttttgtta 
tttgaaggtt 
gttgattttg 
tggagtatta 
ttttttaagt 
tagtaaatta 
gtattgaaat 
aataaagtgt 
gagttatggt 
tttagttggg 
tgtttgtgtt 
aatatgaaat 
ttttgtggat 
agtaaatttt 
gtaaattttt 
gtattattta 
atttatttat 
ggttttgttt 
atttttttta 
tttgttgttt 
gtatgtatag 
tttttgtatt 
atgtatatat 
aaatgatagt 
tatgtggtaa 
tgtttttatt 
ttattttttt 
ttatttttag 
agaaatgtta 
tattgattag 
gttgaggatg 
gtatattttg 
agtttttaaa 
tttattttta 
ttgatttttt 
ttatttaaag 
atatattatg 
tttgattatt 



1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 



seite 410 



tatttttatg 
aggttgagat 
tttgatatag 
attgttaaaa 
gtttttaaag 
ttttttgaga 
tttattgtaa 
aagttttttt 
gaaatgattt 
taaaggtttt 
aataataatg 
taagtgagaa 



tattatgaat 

tggaaatttt 

ttttatatat 

agtatagaaa 

aaaaaatgtg 

tagagtttta 

tttttgtttt 

tataggaaaa 

ttgtaattat 

tatttgttga 

tgttaaaata 

tatgtagta 



agagaaggtg 
atatttaata 
gtgtataagg 
tattgaggtt 

agggttaagt 
tttttgttgt 
ttgggtttaa 
aaggaatttg 
aaataaatta 
tgtaattaaa 
tatttatgtt 



seq_prot . 
gaattaaatt 
ttgttgtaat 
agttaatgtt 
taaaatttta 
tttatttttt 
ttaggttgga 
gagatttttt 
attgtttttt 
ttttttaatt 
tggttaaata 
tatgtgtatt 



txt 

tttattttat 
tttttttttg 
atttttgata 
ttttatttta 
attttatttt 
gtgtaatggt 
tgtttaagtt 
tatattaaat 
gtgttagata 
ttatatagaa 
taatgattag 



aataagaagg 
gatatattag 
taatttttat 
atagtgtaat 
atttatttat 
gtaattttgg 
ttttggaatt 
tagtgtaaat 
tattaatttt 
tttaagtggg 
ttgttagtta 



<210> 426 
<211> 4441 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> chemically treated genomic DNA (Homo sapiens) 



<400> 426 

ttagtataag 
agtattttgg 
tataatttag 
aataaaatat 
atattttatg 
aatagtttag 
ttgatttttg 
ttgtggttag 
taaatttgtt 
tagttttata 
aaattttatg 
atgtgtgttt 
gaaatttttt 
tgtgtgtgtg 
tttttgtggg 
tatttaggag 
tataatgtag 
aaggaagaga 
aattaaattt 

ggggtttggt 

tgttttatga 
attggaaatg 
ttaaataagt 
aattttagag 
aagagggggt 
aaatgtttgg 
ttgttttaaa 
tgttagtttt 
gtgttggtat 
tatattattt 
gtttaagtat 
gggtttattt 
ttaaaattga 
ttggtgagta 
attagttgtt 
ataatatttt 
attgttattt 
ttggtaagag 
tttaggagga 
taatgtttga 
gattatttat 
gtgggggagg 
ttattttttt 
gggatatagt 



gtttttgtta agtttatggt 
tttttggata gtatagaaaa 
gaggtttgtt agtttgattg 
atttgttggt ttttatgttg 
ttagaatgag aagtgaatta 
tgattttttg gaatttaaat 
gtttgttttt tttttgtttg 
ggttttatag agaattttag 
gtttaaatat gtgaagttta 
tgttttattt tttttataaa 
atttgatgtt gttgtggtta 
atttttagga tttatagttt 
gtaaagggtt tgaagtatgt 
tgtgtgtgta tgtgtgtgtg 
gttttttttt tttatttgtt 
tttttttata gttgaagtgt 
ataagtttta aggtatgttt 
ggtataaatt gtgttggtta 
attaaagtat gaagattgtt 
ttgttgtagg agtaaatgtt 
gataatgtgt agtatttgat 
tttattgttt ttgttttatt 
aaaaggaaag agaattttgt 
ttataaggaa ttagaaaaag 
tttgtaaaaa tatatatatt 
ttgatgtgga ttagtattga 
atgtgtttta ataaatataa 
gtttttgttt ttatattgtt 
tagtattaat ttgatttaaa 
gtttgtgttt tggttttttt 
ttggtaagta ttttaaaaat 
atattttaag aaaggatttt 
tgaatggtat gttgttggga 
agtgtgtatt gttttgatat 
atgatgttaa gttttaagtt 
gattttttaa ttatggttta 
ttttttgagg taagtataat 
tagtttatat agtataatga 
aattgaagag atatttttat 
tttttatttt agtagtagtt 
tttgtattgt tatggttttt 
ggagggtggg gtggggagaa 
tgtaattttt ttatagtttt 
ttaataatgt aatttttaat 
£t£gataagt- aagagtgggt— 



ttgaaaattt tgtttttttt 
attaaatagt tagaaattga 
agttttatta attttttaat 
tggtttgttt tagagtagtt 
tttggggata aatgaggtgt 
attaagtgaa aggatagggt 
tttatggttt tgattatgtt 
ggttatggtt tgttgtttag 
tagtttgggg gttgggagtt 
atttaaagta gattaaataa 
tattttttaa gttgtaattt 
gtttagtttt tggggggaaa 
agtggtttgt gattaaagtt 
tgtgtattta tgtgaatgtg 
gtttttatgt aaaagtatta 
tttaaatatt aatgaatgga 
tattgaatgg tgtattgtag 
gttatttaga taaaatgaag 
ataagttgtt ttaagttggt 
tatttttttt tttagatttt 
aaaaattatt tttttattaa 
atttatgttt tgaaaggtat 
ggtttaagtt ggtttgtaga 
gaaaagggaa ggaggagtta 
agattttttt tgtttggtta 
gattgagtag ttaatgttga 
gggggaggaa atggatttgg 
tttgtatagt ggtgtagggg 
ttttggtttt atatattttg 
agatgtaaaa taggaatatt 
attagttatt attattagta 
tttaatttta ttttttttgt 
aaaatttatt tatttttatt 
tttttggata gtaaataatt 
aatagtggtt aatgattgtt 
agaagttttg ggaatgaggg 
gtgttgtgga aataggttat 
ttgagtgtta tggttgtgtt 
aagagtttgt tgaagaggat 
ggattttttg aagggagaag 
ttatttttat ttggggtggg 
attatataat tttaaaaagg 
aggtttagtg aaaaattgtt 
tattgttttt ttttttttta 
gggtgagaaa -aatrg^attg 
Seite 4il 



tggatttgaa 
ataattatag 
tgtaagaagg 
atttaagttt 
gtatttgttt 
tatagggtag 
atattttttt 
ttaagaggta 
aaaggggagg 
aattggtgtg 
atttatttgt 
gagaatattt 
taggatagag 
tgagtgttta 
tatatggtag 
agtttggtta 
tgatttggga 
ttatgtggaa 
tatttttaaa 
ttttatatgt 
tgatgttttt 
tgtatatttg 
aattttaata 
gatttatatg 
gtttattatt 
atttgggtta 
gaaggtttgt 
ttatatatag 
taattttggg 
aatagaaggt 
tttggagggt 
gtgttggttt 
ttttttttaa 
gaatttgttg 
ttgggaaaaa 
tttgttagtt 
ttttgtattg 
ttttgtgtgt 
tagggggtgt 
atattgtagt 
gtggggtggg 
attatattaa 
gtaaatatag 
atttattaat 



5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 

6420 

6480 

6499 



ggtttagtta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2_640 _ 
"2700 



attattgtat 

ggaagaatgt 

agtgaatttt 

ataattttgt 

gttgatgtat 

gggaagtaag 

ggagtaataa 

tattttttat 

tttaaggaaa 

aataatgtat 

tgttatagtt 

tgtgtttttt 

gaaagatttg 

ttgatataag 

taaattgtat 

gttataatga 

aaaaaattta 

agttaaagaa 

ttaattaaaa 

gaaaattttt 

tatattatat 

tgaatttttt 

gaatgaattt 

gtagttaatt 

ttatatattt 

tttgaagttt 

tgtgttttta 

tataaatgtg 

agtttttatg 

9 



tgtatgtaaa 

ggtgtgtaat 

ttgtttagaa 

agatatttat 

ttttttattt 

taaataaata 

atttatttat 

agagttaaaa 

ttagataagt 

gatgaattta 

atatattaat 

tttttttttt 

aatattattt 

atggttgatt 

ttttagttag 

atttatagtt 

ttgggtaggt 

gtaatttttt 

atttttaaaa 

taattaagtt 

aagatttgat 

aaggagatgg 

gttatttgta 

ttagttaaaa 

ttgtaaaata 

tgtaggtgag 

agattagttg 

atttttgtaa 

tttttttttt 



taagaaatgt 

attgtgtata 

tattaaagat 

ttgatgattt 

agttaataaa 

ttgttaattt 

agtatttgtt 

tgttatttag 

aaaattattt 

tttaattatg 

ttaaaaagta 

tttttttttt 

attttaaatt 

aaggtgtttt 

agatgtaata 

tagtatttat 

gattttaatt 

tattgtgttt 

tatttgtttt 

gtttttaggt 

tgtaatattt 

attaattgat 

gtatttattt 

ttatttttgt 

agtaaataat 

atgtaataag 

aatataagaa 

tatgttttaa 

ttagtgagtg 



seq_prot. 
gttatatttg 
ttttaaatat 
aaggattaga 
ttaatttttt 



ttagaaaagg 
ttttatttat 
tttaggataa 
taataaataa 
tttttgtaat 
aggtgggagg 
aaaaaaaaaa 
tttttttttt 
aagtatatgt 
tttttggttt 
tatttttatt 
tatatgttgt 
atttgtagtt 
tagtatgatt 
agtttttttg 
ttttaaggat 
aaatattatt 
ttgtaaagat 
gtttttttaa 
aaaagatatt 
aaataaaata 
ttttgttttt 
aatgtttgat 
ttagatttat 
tgttgatttt 



txt 
tgatgttggg 
tattttaatt 
tattattttt 
atttttaaat 
ttatgtttat 
ggatattata 
tattttattt 
tatttgttag 
ttaatgaaaa 
agtgaaattt 
aggggggggt 
tattgtgtat 
gttattttaa 
gagtttatta 
atttaatatt 
tatatataag 
ttttttgtat 
atgtattttt 
ttagattttt 
aatttttttt 
ttttaagttt 
ttatttttag 
tgtttgaaat 
tgatagaaag 
aaagttaatt 
gtataatgta 
aaatattttt 
tttaaatgtt 
ttaatatggt 



tatttatata 

tattttttgt 

ttttttttgt 

gagatgaaat 

tttttaaaaa 

tatattagta 

ttaggaaatt 

ttttagagta 

ggtataatag 

aaattttttt 

aatttttttt 

tagtttttat 

gtaatatgtt 

tttttttatt 

atttttgaat 

taatgtaaga 

atttaattat 

ttatgttttt 

atattaattt 

aattatatta 

gtattttaaa 

attttaaaag 

agtttaaatt 

gaatatgttt 

tttaaagaaa 

attaaaaata 

atgtatttta 

ttttatgtag 

attattaatt 



<210> 427 
<211> 4441 
<212> DNA 

<213> Artificial sequence 

<220> . , . 

<223> chemically treated genomic DNA (Homo sapiens) 



<400> 427 

tagttgataa 

tttatataag 

gtaaaatata 

atatttttga 

gtttttttga 

aaaatgtgtt 

tagtttgaat 

ttttttgaag 

atttgaggta 

gtagtgtgat 

taagttaatg 

aaaaaatata 

tgtaattaag 

tttttgtatt 

tatttagagg 

gaatgaagag 

aaatgtatta 

tatggaaatt 

agagagagat 

aaaagaaatt 

tttgttgtat 

atgttttgag 

agattttttg 

ttgttgatat 

ttttttgaaa 

tattttgttt 

tatgaaaaag 



tattatgtta 

aagtgtttaa 

tgaaaatatt 

ttgtattatg 

aggttggttt 

tttttttatt 

tattttaaat 

tttggaggta 

tagatttgga 

tgaggaaaat 

tagaaattta 

gaaaaatata 

tgtataaaaa 

gtttgtgtat 

tagtattgag 

atggtgaatt 

tttgaagtaa 

gatatataat 

tgtttttttt 

tagattttgt 

ttttttatta 

gttgataagt 

aaaatgaagt 

gtgtgatatt 

aatgaatata 

tatttagaaa 

agaaagtagt 



aaaagttagt 

aataaatttg 

tgttaagtat 

taaaagtagg 

ttattttatt 

aaatgttttt 

attgaaaaaa 

ggtttttgta 

gggtaatatt 

tatttttaag 

gtagagaagt 

tagttatgtt 

gaattgtaga 

agtagtatgt 

tagtggggat 

taagttgaag 

tatatatgtt 

gaaaagagag 

tttttttttt 

tttttttatt 

aattataaag 

gttatttatt 

gttgttttga 

tataaataga 

atttttttta 

taaaaagtta 

atttagtttt 



atatttatta 

gttgagatat 

ttttttatat 

gtttgttgta 

tattatttat 

tataaaaata 

taaatgaatg 

agttaattag 

taaatattat 

aatttaggag 

tgaaataagt 

gaaatatagt 

tgattagaat 

aataagtatt 

atattgtatt 

aaaagtattt 

taatttgagg 

agggagagga 

ttgtttttta 

ttataattag 

aggataattt 

gttaaatggt 

aaataagtgt 

agagttggta 

gtttgttagt 

aaaattatta 

tatttttggt 



ggagagaaag 

attataaaaa 

ttaattaatt 

ttttatttat 

ttattttgta 

gttttgatta 

ttgtaagtaa 

tttatttttt 

agttaaattt 

tgatttggtt 

attttaaaaa 

aaggagattg 

tatttattta 

agattgtaaa 

tttggttaaa 

tggttaatta 

taagtggtat 

agagagagag 

aattgatgta 

gtgagtttat 

tatttgttta 

attttggttt 

tgtgagtaaa 

atatttgttt 

tgaatgaaag 

agtaagtgtt 

gttttaaata 



gtataaaaat 

ttatatttgt 

ttaaaaatat 

aaagttttaa 

aaagtatgta 

gggttagttg 

taagtttatt 

taaagaattt 

tgtgtgatgt 

aaaaaatttt 

tttttaatta 

tttttttaat 

gtaagttttt 

tttgttgtaa 

agtgtagttt 

ttttatgtta 

ttaggttttt 

agaaagatat 

taattatagt 

tatgtattat 

gtttttttaa 

tgtaggaagt 

tttattattt 

gtttattttt 

gatatattag 

tataagatta 

gaggatttat 



2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4441 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 



Seite 412 



<220> seq_prot.txt 
<223> Detection primer for plau 

<400> 482 

gagagagata gttggggagt tt 

<210> 483 
<211> 25 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for plau 
<400> 483 

caaacaaact tcatctacca aatac 

<210> 484 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for TMEFF2 
<400> 484 

tgttggttgt tgttgttgtt 

<210> 485 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for TMEFF2 
<400> 485 

ctttctaccc atcccaaaa 

<210> 486 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for ESR1 
<400> 486 

ttgttggata gaggttgagt tt 

<210> 487 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for ESRl 
<400> 487 

ctatcaattc ccccaactac t • ... 
<210> 488 '■!';',■:-. 

Seite 448 



seq_prot.txt 

<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection pnmer for SYK 
<400> 488 

22 

gtgggttttg ggtagttata ga 

<210> 489 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for SYK 



20 



<400> 489 

taacctcctc tccttaccaa 

<210> 490 
<211> 23 
<212> DNA 

<213> Artificial sequence 

<223> Detection primer for HSPBl 
<400> 490 

23 

aagagggttt agtttttatt tgg 

<210> 491 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for HSPBl 
<400> 491 

cctacctcta ccacttctca at 

♦ 

<210> 492 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> - 
<223> Detection primer for RASSF1 

<400> 492 

22 

agtgggtagg ttaagtgtgt tg 

<210> 493 

<211> 19 

<212> DNA 

<213> Artificial Sequence 

<220> „ 
<223> Detection primer for RASSF1 

<400> 493 



22 



seq_prot.txt 

ccccaaaatc caaactaaa 

<210> 494 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for TES 
<400> 494 

aggttgggga ttttagtttt t 

<210> 495 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for TES 
<400> 495 

accttcttca ctttattttc ca 

<210> 496 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for PITX2 
<400> 496 

9taggggagg gaagtagatg t 

<210> 497 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for PITX2 
<400> 497 

tcctcaactc tacaaaccta aaa 

<210> 498 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for GRIN2D 
<400> 498 

atagtttgtg gtttggattt tt 
<210> 499 

<211> 22 , 
<212^ DNA 

<213> Artificial Sequence 



Seite 450 



seq_prot.txt 

<220> 

<223> Detection pnmer for GRIN2D 
<400> 499 

aaaacctttc cctaacttca at 

<210> 500 
<211> 21 
<212> dna 

<213> Artificial sequence 
<220> 

<223> Detection primer for PSATl 
<400> 500 

gtaggtggtt aattttgggt t 

<210> 501 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for PSATl 
<400> 501 

ctcattcaca ctatatccat tea 

<210> 502 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for PSATl 
<400> 502 

taagagagag gagttgaggt tt 

<210> 503 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for PSATl 
<400> 503 

ccaaaattaa ccacctacct aa 

<210> 504 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CGA 
<400> 504 

tagtggtata agtttggaaa tgtt 
<210> 505 

Seite 451 



<211> 22 seq_prot.txt 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CGA 

<400> 505 

tccacctaca tctaaaccct aa 

<210> 506 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CYP2D6 
<400> 506 

ggggttaagg tttttatggt a 

<210> 507 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CYP2D6 
<400> 507 

cctcctaaac taaatccaac aa 

<210> 508 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for COX7A2L 
<400> 508 

ggaggtgtaa ggagaataga ga 

<210> 509 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for COX7a2l 
<400> 509 

aatcctaaaa accctaactt ttaat 

<210> 510 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for ESR2 
<400> 510 
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seq_prot.txt 



tagaggggag tagtgtttga gt 

<210> 511 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for ESR2 
<400> 511 

aaaccttccc aataacctct ta 

<210> 512 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection pnmer for PLAU 
<400> 512 

gtgatatttg gggattgtta tt 

<210> 513 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for PLAU 
<400> 513 

actccctccc ctatcttaca 

<210> 514 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for vtn 
<400> 514 

gttatttggg ttaatgtagg ga 

<210> 515 
<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for VTN 
<400> 515 

tctatcccct caaacttaaa aa 

<210> 516 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
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<220> seq^prot . txt 

<223> Detection primer for SULTlAl 
<400> 516 

gaatttaggg aaggagttag ttg 

<2I0> 517 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for SULTlAl 

• 

<400> 517 

atactaccaa acccactcaa ac 

<210> 518 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for pcaf 
<400> 518 

ggataaatga ttgagaggtt gt 

<210> 519 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for pcaf 
<400> 519 

cctcccttaa ttctcctacc 

<210> 520 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for prkcd 
<400> 520 

gatagaagga ttttagtttt tattgtt 

<210> 521 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for prkcd \ 
<400> 521 

cttaacccat cccaatca . 
<210> 522 



seq_prot.txt 

<211> 21 
<212> DNA 

<213> Artificial sequence 

<220> . _ 

<223> Detection pnmer for ONECUTZ 

<400> 522 

tttgttggga tttgttagga t 

<210> 523 
<211> 22 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for ONECUT2 
<400> 523 

aaacatttta cccctctaaa cc 

<210> 524 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for BCL6 
<400> 524 

gggtaagaaa gaaggaatta gttt 

<210> 525 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

<220> m . _ c 

<223> Detection pnmer for BCL6 

<400> 525 

catcaccact tctaaaaacc c 

<210> 526 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> - 

<223> Detection primer for WBP11 

<400> 526 

aagaggtgag gaagagtagt aaat 

<210> 527 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

< 220> - x: llD „n 

<223> Detection pnmer for WBPll 
<400> 527, 
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ctcccaacaa ctaaatcaaa at 

<2X0> 528 
<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for CMX1) 
<400> 528 

tgtaggagag gttgggaag 

<210> 529 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for (MXl) 
<400> 529 

ccaaacataa catccactaa aa 

<210> 530 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for n.n. 
<400> 530 

taggtttaag aggagaggga at 

<210> 531 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for n.n. 
<400> 531 

aaacaactac ccaaatccaa c 

<210> 532 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for app 
<400> 532 

gagtaaggaa gggggatg 

<210> 533 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Detectiion primer for app 
<400> 533 

aacccaaatc tttaatacaa aaa 

<210> 534 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for NETOl 
<400> 534 

ggagttttta gaagaggaag att 

<210> 535 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
< 220> 

<223> Detection primer for NETOl 
<400> 535 

acttcacaat aaataccctc cc 

<210> 536 
<211> 22 
<212> DNA 

<213> Artificial sequence 

<220> . 

<223> Detection primer for TBC1D3 

<400> 536 

ggtagaggaa gtagttggtt tg 

<210> 537 
<211> 23 
<212> DNA 

<213> Artificial Sequence 

<220> _ _ . 

<223> Detection primer for TBC1D3 

<400> 537 

cttttatatt tctcccaatc tec 

<210> 538 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for GRB7 
<400> 538 

ttaggaagtt ttaggaatga gg 
<210> 539 
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<212> DNA ^ 

<213> Artificial Sequence 
<220> 

<223> Detection primer for GRB7 
<400> 539 

aaaatccata accaccaaaa ta 

<210> 540 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CYP2D6 
<400> 540 

atttgtagtt tggggtgatt t 

<210> 541 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CYP2D6 
<400> 541 

aatttcctaa cccactatcc tc 

<210> 542 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CDK6 
<400> 542 

gtgtaatgat tttggattga ga 

<210> 543 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for CDK6 
<400> 543 

accttaaaca ccttcccata a 

<210> 544 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for Cchr. Ipl3.2) 
<400> S44 

S^te~458 
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aaggaaggta gagggttgag t 

<210> 545 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
2 2 

<223> Detection primer for (Chr. Ipl3.2) 
<400> 545 

aaaatccaaa attaacacca tt 

<210> 546 

<211> 22 

<212> DNA 

<213> Artificial sequence 

<220> r __ ^ 

<223> Detection primer for (Chr. 17qZ5.i) 

<400> 546 

agtagatgaa gttggggatt ag 

<210> 547 
<211> 22 
<212> DNA 

<213> Artificial sequence 

<220> _ 1N 

<223> Detection primer for (Chr. 17q25.Jj 

<400> 547 

tcctactatc ccttctcaaa aa 

<210> 548 
<211> 24 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for ABCA8 
<400> 548 

tgattgtgta gattattttt ggtt 

<210> 549 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for ABCA8 
<400> 549 

caaactctct aaacctcaat etc 

<210> 550 
<211> 22 
<212> DNA 

<213> Artificial sequence 
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<223> Detection primer for Cchr. 12ql4.3) 
<400> 550 

gatgaaagtg gaaagattat gg 

<210> 551 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for Cchr. 12ql4.3) 
<400> 551 

accctaacat tctctaaaca aca 

<210> 552 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for Cchr. 8ql2.1) 
<400> 552 

atttgaaggt tgtgtttgta ga 

<210> 553 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for Cchr. 8ql2.1) 
<400> 553 

ctccaactct cctcacctc 

<210> 554 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for MARK2 
<400> 554 

taaagtagga aggtttggtt tg 

<210> 555 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for mark2 
<400> 555 

tcaccactat cctcaataat ca 
<210> 556 
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<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for ELK1 
<400> 556 

ttagaagtga aagtagaagg gttt 

<210> 557 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for ELK1 
<400> 557 

cctctaattc ctatcaatca cc 

<210> 558 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> . 

<223> Detection primer for Q8WUT3 

<400> 558 

ggttagaagt tagaggggta gg 

<210> 559 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for Q8WUT3 
<400> 559 

ccatcccatt acctataaaa at 

<210> 560 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for CGB 
<400> 560 

tttgttttag gtggtgtgta at 

<210> 561 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for CGB 
<400> 561 



tccaccctat tttctaccaa 

<210> 562 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for BSG 
<400> 562 

ggagtaggtg aggagtattt tg 

<210> 563 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for BSG 
<400> 563 

ttatctatcc ccacacccta at 

<210> 564 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for bckdk 
<400> 564 

tttgggagag ttttaggatt ta 

<210> 565 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for bckdk 
<400> 565 

tcacctcctt ttacaaccaa t 

<210> 566 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for SOX8 
<400> 566 

gggtgggtag taggtttgtt 

<210> 567 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Detection primer for SOX8 
<400> 567 

acacactcct taaaactctt cc 

<210> 568 
<211> 23 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection primer for DAGl 
<400> 568 

tttggttatg tggagtttat tgt 

<210> 569 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for DAGl 
<400> 569 

aataccaacc caaacatcta cc 

<210> 570 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for ORC4L 
<400> 570 

ggtaatggtg ggggtaaat 

<210> 571 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for ORC4L 
<400> 571 

cactcaaaac ttccctacct ac 

<210> 572 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for SEMA4B 
<400> 572 

gggtagaggg aggttattgt t 
<210> 573 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for SEMA4B 
<400> 573 

accaaaatac tactcccaaa tc 

<210> 574 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for ESR1 Cexon8) 
<400> 574 

tatgatttgt tgttggagat gt 

<210> 575 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection primer for ESR1 Cexon8) 
<400> 575 

cttaaaatcc ctttaactat tccc 

<210> 576 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CMXl) 
<400> 576 

aacgcgcgaa agtaaa 

<210> 577 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (MXl) 
<400> 577 

ttgggaatgt gtgaaa 

<210> 578 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (MXl) 
<4po> 578 
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ttcgagttgg gtcgaga 

<210> 579 
<211> 17 
<212> DNA 

<213> Artificial Sequence 

<223> Detection oligonucleotide for (MXl) 
<400> 579 

tttgagttgg gttgaga 

<210> 580 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<223> Detection oligonucleotide for (MXl) 

<400> 580 
tatgcgcggg aagatt 

<210> 581 

<211> 16 

<212> DNA . 

<213> Artificial sequence 

"^L ^3 

<223> Detection oligonucleotide for (MXl) 

<400> 581 
gtatgtgtgg gaagat 

<210> 582 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
2 2 

<223> Detection oligonucleotide for (MXl) 

<400> 582 
atttacggtt gcgcgg 

<210> 583 
<211> 17 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for (MXl) 

<400> 583 
tatggttgtg tgggtta 

<210> 584 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
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<223> Detection oligonucleotide for n.n. 
<400> 584 

aggcgtttat agtcggt 

<210> 585 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for n.n. 
<400> 585 

aggtgtttat agttggt 

<210> 586 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for n.n. 
<400> 586 

tttcgagttc ggagta 

<210> 587 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for n.n. 
<400> 587 

ttttgagttt ggagtag 

<210> 588 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for n.n. 
<400> 588 

ttgtcggtcg tagcgg 

<210> 589 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for n.n. 
<400> 589 

tttgttggtt gtagtgg 
<210> 590 
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<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> _ 

<223> Detection oligonucleotide for N.N. 

<400> 590 

ttcgttacgg cggtag 

<210> 591 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> - 

<223> Detection oligonucleotide for N.N. 

<400> 591 

agtttgttat ggtggt 

<210> 592 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> . 

<223> Detection oligonucleotide for APP 

<400> 592 

tgaaacgagg cggaga 

<210> 593 

<211> 16 

<212> DNA 

<213> Artificial Sequence 

<220> , _ 

<223> Detection oligonucleotide for APP 

<400> 593 
tgaaatgagg tggaga 

<210> 594 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for APP 
<400> 594 
gacgttgcgt tttcgg 

<210> 595 
<211> 17 
<212> DNA 

<213> Artificial Sequence 

<220> _ 
<223> Detection oligonucleotide for APP 

<400> 595 
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ggatgttgtg tttttgg 

<210> 596 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for app 
<400> 596 
tttttagcgg gtcgga 

<210> 597 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for app 
<400> 597 

tttttagtgg gttgga 

<210> 598 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for APP 
<400> 598 

ggacgttcgt aagcgg 

<210> 599 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for APP 
<400> 599 

ggatgtttgt aagtgg 

<210> 600 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ORC4L 
<400> 600 

ttatacgcgt tgtttat 

<210> 601 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
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2 2 C3 

<223> Detection oligonucleotide for 0RC4L 
<400> 601 

tgtattatat gtgttgttt 

<210> 602 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for ORC4L 

<400> 602 
agcgtgacgg ttcgag 

<210> 603 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

■^2 2 0^^ 

<223> Detection oligonucleotide for ORC4L 

<400> 603 
agtgtgatgg tttgag 

<210> 604 
<211> 16 
<212> DNA 

<213> Artificial sequence 
■^220?* 

<223> Detection oligonucleotide for ORC4L 

<400> 604 
attaggcgag tttcgt 

<210> 605 
<211> 17 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for ORC4L 
<400> 605 

ttaggtgagt tttgttt 

<210> 606 
<211> 17 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for NETOl 
<400> 606 

tacgttcggt tttacga 
<210> 607 
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<211> 19 seq_prot.txt 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for NET01 
<400> 607 

ttatgtttgg ttttatgat 

<210> 608 
<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for NETOl 
<400> 608 

ttacgtcggt ttcgat 

<210> 609 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for NETOl 
<400> 609 

tttatgttgg ttttgatt 

<210> 610 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for NETOl 
<400> 610 

ttcggtttcg ggaaag 

<210> 611 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for NETOl 
<400> 611 

tttggttttg ggaaagg 

<210> 612 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for NETOl 
<400> 612 
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tgtcgtacgt gtttat 

<210> 613 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
^^220^^ 

<223> Detection oligonucleotide for NETOl 
<400> 613 

aatttttgtt gtatgtgt 

<210> 614 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
2 2 

<223> Detection oligonucleotide for (Chr. 17q25.1) 

<400> 614 
ttaggtcggg aggaaa 

<210> 615 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<223> Detection oligonucleotide for (Chr. 17q25.1) 

<400> 615 
ttaggttggg aggaaa 

<210> 616 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<223> Detection oligonucleotide for (Chr. 17q25.1) 

<400> 616 

ttagacgtgg ggcgat 

<210> 617 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
^c220^^ 

<223> Detection oligonucleotide for (Chr. 17q25.1) 

<400> 617 
ttagatgtgg ggtgat 

<210> 618 
<211> 16 
<212> DNA 

<213> Artificial sequence 
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<223> Detection oligonucleotide for (chr. 17q25.1) 
<400> 618 

taaggtacga gcgtgt 

<210> 619 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (chr. 17q25.1) 
<400> 619 

aaggtatgag tgtgtg 

<210> 620 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for (Chr. 17q25.1) 
<400> 620 

gtagagtacg agagatt 

<210> 621 
<2U> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for (chr. 17q25.1) 
<400> 621 

ggtagagtat gagagat 

<210> 622 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (chr. 12ql4.3) 
<400> 622 

atgacgatga ttggcga 

<210> 623 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (Chr. 12ql4.3) 
<400> 623 

gatgatgatt ggtgagt 

<210> 624 
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<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (Chr. 12ql4.3) 
<400> 624 
ttatgacgtt taatcgt 

<210> 625 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for (Chr. 12ql4.3) 

<400> 625 

agttatgatg tttaattgt 

<210> 626 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (Chr. 12ql4.3) 
<400> 626 
aatcgaacgt tggcgt 

<210> 627 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for (Chr. 12ql4.3) 

<400> 627 

aaattgaatg ttggtgt 

<210> 628 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for MARK 2 

<400> 628 

atatttcggg ggaagt 

<2X0> 629 
<211> 17 
<212> DNA 

<2J3> Artificial sequence 

<220> ^ . , r 

Detection oligonucleotide for MARK2 



<400> 629 
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tatattttgg gggaagt 

<210> 630 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for MARK2 
<400> 630 

tttcgtattt gtcgga 

<210> 631 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for MARK2 
<400> 631 

tttgtatttg ttggagt 

<210> 632 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for MARK2 
<400> 632 

ggttatatcg tagggta 

<210> 633 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for MARK2 
<400> 633 

gggttatatt gtagggt 

<210> 634 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for MARK2 
<400> 634 

agggggacga attagg 

<210> 635 
<211> 16 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Detection oligonucleotide for MARK2 

<400> 635 

gagggggatg aattag 

<210> 636 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for Q8WUT3 

<400> 636 

tagaacggcg tgggat 

<210> 637 
<2U> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for Q8WUT3 

<400> 637 

tagaatggtg tgggat 

<210> 638 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for Q8WUT3 
<400> 638 

gtcgcgatgt agttacgt 

<210> 639 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for Q8WUT3 
<400> 639 

gttgtgatgt agttatgt 

<210> 640 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for Q8WUT3 
<400> 640 
ttagtttcgg gatcgg 
<210> 641 
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<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for Q8WUT3 
. <400> 641 

tttagttttg ggattgg 

<210> 642 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for Q8WUT3 
<400> 642 
ttcgtttttc gggata 

<210> 643 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for Q8WUT3 
<400> 643 

tttgtttttt gggataaa 

<210> 644 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for BSG 
<400> 644 
tacggttcgc gttgtt 

<210> 645 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for BSG 
<400> 645 

* * 

ggagtatggt ttgtgt 

<210> 646 

<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for BSG 

<400> 646 
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gtaaggttcg gcgaga 

<210> 647 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
^c22{)^^ 

<223> Detection oligonucleotide for BSG 

<400> 647 
gtaaggtttg gtgaga 

<210> 648 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for BSG 

<400> 648 
ttacgttttc gggaag 

<210> 649 
<211> 17 
<212> DNA 

<213> Artificial Sequence 

<223> Detection oligonucleotide for BSG 
<400> 649 

ttatgttttt gggaagg 

<210> 650 
<211> 17 
<212> DNA 

<213> Artificial sequence 

2 

<223> Detection oligonucleotide for BSG 
<400> 650 

tacgtttcga ggatcgg 

<210> 651 
<211> 17 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for BSG 

<400> 651 
tatgttttga ggattgg 

<210> 652 
<211> 16 
<212> DNA 

<213> Artificial sequence 
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<2 20> seq_prot.txt 
<223> Detection oligonucleotide for BCKDK 
<400> 652 

gggcgttagg cggatt 

<210> 653 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for BCKDK 
<400> 653 

tgggtgttag gtggat 

<210> 654 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for bckdk 
<400> 654 

agagcggtta gcgtag 

<210> 655 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for bckdk 
<400> 655 

tgagagtggt tagtgt 

<210> 656 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for BCKDK 
<400> 656 

atagagggcg tgaatt 

<210> 657 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for BCKDK 
<400> 657 

agagggtgtg aatttt 

<210 > 658 
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<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for bckdk 
<400> 658 

taggatttac gaggaaa 

<210> 659 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> . ( 

<223> Detection oligonucleotide for BCKDK 

<400> 659 

aggatttatg aggaaaat 

<210> 660 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<220> , „ 

<223> Detection oligonucleotide for S0X8 

<400> 660 
ttttcggttc gaagta 

<210> 661 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SOX8 
<400> 661 

ttttggtttg aagtagg 

<210> 662 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SOX8 

<400> 662 

aggtcgtttt tatcga 

<210> 663 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for SOX8 
<400> 663 
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aggttgtttt tattgagt 

<210> 664 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for S0X8 
<400> 664 
gtagttacgg ggcgtt 

<210> 665 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SOX8 
<400> 665 

gtagttatgg ggtgtt 

<210> 666 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SOX8 
<400> 666 

tgtcgtatag gcggtt 

<210> 667 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SOX8 
<400> 667 

ttgttgtata ggtggtt 

<210> 668 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SEMA4B 
<400> 668 

agttttgggc gcgattt 

<210> 669 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
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2 2 ^) 

<223> Detection oligonucleotide for SEMA4B 

<400> 669 
agttttgggt gtgattt 

<210> 670 
<211> 18 
<212> DNA 

<213> Artificial sequence 
^c2 2 

<223> Detection oligonucleotide for SEMA4B 
<400> 670 

agcgaataga ttgcggat 

<210> 671 
<211> 18 
<212> DNA 

<213> Artificial sequence 
^220^ 

<223> Detection oligonucleotide for SEMA4B 
<400> 671 

agtgaataga ttgtggat 

<210> 672 
<211> 18 
<212> DNA 

<213> Artificial sequence 

<220> „ . , An 

<223> Detection oligonucleotide for SEMA4B 

<400> 672 

agcgattaga ttgcggat 

<210> 673 
<211> 18 
<212> DNA 

<213> Artificial sequence 
^c22^3^^ 

<223> Detection oligonucleotide for SEMA4B 
<400> 673 

agtgattaga ttgtggat 

<210> 674 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for SEMA4B 
<400> 674 
taggcgttcg attttt 
<210> 675 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SEMA4B 
<400> 675 

gggtaggtgt ttgatt 

<210> 676 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CDKN2A 
<400> 676 

ggcgttgttt aacgtat 

<210> 677 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CDKN2A 
<400> 677 

gggtgttgtt taatgta 

<210> 678 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CDKN2A 
<400> 678 

aacgtatcga atagttacgg 

<210> 679 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CDKN2A 
<400> 679 

aatgtattga atagttatgg 

<210> 680 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for cdkn2a 
<400> 680 
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tacggtcgga ggtcga 

<210> 681 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CDKN2A 

<400> 681 

tatggttgga ggttga 

<210> 682 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CA9 
<400> 682 

atggtttcga taattttt 

<210> 683 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CA9 
<400> 683 

atggttttga taatttttt 

<210> 684 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CA9 

<400> 684 

tgtacgtata gttcgta 

<210> 685 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CA9 
<400> 685 

ttaatgtatg tatagtttgt 

<210> 686 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
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„^ seq_prot.txt 
<220> 

<223> Detection oligonucleotide for CA9 
<400> 686 

atatatcgtg tgttggg 

<210> 687 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for ca9 
<400> 687 

atatattgtg tgttggg 

<210> 688 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for CA9 
<400> 688 

atagttagtc gtatggt 

<210> 689 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for ca9 
<400> 689 

atagttagtt gtatggtt 

<210> 690 
<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PAX6 
<400> 690 

tattgtttcg gttgttag 

<210> 691 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PAX 6 
<400> 691 

tattgttttg gttgttag 
<210> 692 
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<211> 16 
<212> DNA 

<213> Artificial Sequence 
^c220^^ 

<223> Detection oligonucleotide for PAX6 
<400> 692 
ggcgacgcgg ttagtt 

<210> 693 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
2 2 

<223> Detection oligonucleotide for PAX6 
<400> 693 
ggtgatgtgg ttagtt 

<210> 694 

<211> 16 

<212> DNA 

<213> Artificial sequence 

<220> _ 

<223> Detection oligonucleotide for PAX 6 

<400> 694 
taggtcgcgt agattt 

<210> 695 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PAX 6 
<400> 695 

agtttaggtt gtgtaga 

<210> 696 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> . , ^ 

<223> Detection oligonucleotide for PAX6 

<400> 696 

tagcgtattt ttcggt 

<210> 697 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> „ 

<223> Detection oligonucleotide for PAX6 

<400> 697 
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tagtgtattt tttggttg 

<210> 698 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for sfn 
<400> 698 

agtaggtcga acgtta 

<210> 699 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SFN 
<400> 699 

agagtaggtt gaatgtt 

<210> 700 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for sfn 
<400> 700 

ttgcgaagag cgaaat 

<210> 701 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SFN 
<400> 701 

tgtgaagagt gaaattt 

<210> 702 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for sfn 
<400> 702 

ttcgaggtgc gtgagt 

<210> 703 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



Seite 486 



seq_prot.txt 

<220> . J ^ 

<223> Detection oligonucleotide for SFN 

<400> 703 

-tttgaggtgt gtgagta 

<210> 704 
<211> 16 
<212> ON A 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for SFN 

<400> 704 

tgtgcgatat cgtgtt 

<210> 705 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SFN 
<400> 705 
tgtgatattg tgttggg 

<210> 706 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for TP73 

<400> 706 

ttcgttcgcg aagtta 

<210> 707 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for TP73 
<400> 707 

ggtttgtttg tgaagtta 

<210> 708 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PLAU 
<400> 708 
aagaggtcgt cgggat 
<210> 709 
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<211> 16 seq.prot.txt 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for plau 
<400> 709 

aagaggttgt tgggat 

<210> 710 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for plau 
<400> 710 

ttatcgcggg tatttt 

<210> 711 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for plau 
<400> 711 

ttggttattg tgggtat 

<210> 712 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for plau 
<400> 712 

ttcgatttcg ttattatg 

<210> 713 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for plau 
<400> 713 

tttgattttg ttattatgag 

<210> 714 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for plau 

<400 > 714 
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gtcgtgagcg atttta 

<210> 715 

<211> 16 

<212> DNA 

<213> Artificial sequence 
2 2 ^)«^ 

<223> Detection oligonucleotide for PLAU 

<400> 715 
ttggttgtga gtgatt 

<210> 716 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRIN2D 
<400> 716 

atttcgattt ggaggcgg 

<210> 717 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for GRIN2D 
<400> 717 

attttgattt ggaggtgg 

<210> 718 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<220> . , ^ 

<223> Detection oligonucleotide for PSATl 

<400> 718 

ttcgtcggtg ttacgt 

<210> 719 
<2U> 17 
<212> DNA 

<213> Artificial sequence 

< 220 > ^ * 

<223> Detection oligonucleotide for PSATl 

<400> 719 
ttttgttggt gttatgt 

<210> 720 
<211> 16 
<212> DNA 

<213> Artificial sequence 

* 
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<220> seq_prot.txt 
<223> Detection oligonucleotide for PSAT1 
<400> 720 

ggcgagttcg ggtagt 

<210> 721 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PSATl 
<400> 721 

ggtgagtttg ggtagt 

<210> 722 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PSATl 
<400> 722 

aagttttcgc gagcgg 

<210> 723 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PSATl 
<400> 723 

aagtttttgt gagtgg 

<210> 724 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PSATl 
<400> 724 

aggaagttcg gcgagg 

<210> 725 
<211> 16 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PSATl 
<400> 725 

aggaagtttg gtgagg 

<210> 726 
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<211> 16 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for CYP2D6 

<400> 726 
tacgacgatt ttcgtt 

<210> 727 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<223> Detection oligonucleotide for CYP2D6 
<400> 727 

gagtatgatg atttttgt 

<210> 728 
<211> 17 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for CYP2D6 

<400> 728 
ttcgtcgatt aagtcgg 

<210> 729 
<211> 18 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for CYP2D6 
<400> 729 

tttgttgatt aagttggt 

<210> 730 
<211> 16 
<212> DNA 

<213> Artificial sequence 
2 2 

<223> Detection oligonucleotide for CYP2D6 

<400> 730 
gtggcgcgag tagagg 

<210> 731 
<211> 16 
<212> DNA 

<213> Artificial sequence 

t 

<223> Detection oligonucleotide for CYP2D6 
<400> 731 
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seq_prot.txt 

gtggtgtgag tagagg 

<210> 732 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CYP2D6 
<400> 732 

aacgtttacg tgttcgt 

<210> 733 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CYP2D6 
<400> 733 

gtaatgttta tgtgtttgt 

<210> 734 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SULTlAl 

<400> 734 

ttcgagtcgt tttgat 

<210> 735 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for SULTlAl 
<400> 735 

tttgagttgt tttgatg 

<210> 736 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for SULTlAl 

<400> 736 

ttcgtcgtgt acggtt 

<210> 737 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
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2 2 ^) 

<223> Detection oligonucleotide for SULTlAl 

<400> 737 
tttgttgtgt atggttt 

<210> 738 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for SULTlAl 

<400> 738 
aggatttcgt tttcgg 

<210> 739 
<211> 17 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for SULTlAl 

<400> 739 
aggattttgt ttttggg 

<210> 740 
<211> 17 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for SULTlAl 

<400> 740 

ttttcggttg aagtcgg 

<210> 741 
<211> 17 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for SULTlAl 

<400> 741 
tttttggttg aagttgg 

<210> 742 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<223* Detection oligonucleotide for PRKCD 
<400> 742 
atttcgcgtt cggatt 
<210> 743 
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<211> 17 seq_prot.txt 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for prkcd 

<400> 743 

gattttgtgt ttggatt 

<210> 744 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for EGR4 
<400> 744 

aagcgtattt atcgga 

<210> 745 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for EGR4 
<400> 745 

ggaagtgtat ttattgga 

<210> 746 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for EGR4 
<400> 746 

tatcggacgg tcggtt 

<210> 747 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for EGR4 
<400> 747 

atttattgga tggttgg 

<210> 748 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for EGR4 

<400> 748_ 
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aggcgtagcg ttttag 

<210> 749 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
^c22^)^* 

<223> Detection oligonucleotide for EGR4 

<400> 749 
tgaggtgtag tgtttt 

<210> 750 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
2 2 0 

<223> Detection oligonucleotide for EGR4 

<400> 750 
aacgttatag ttcgagt 

<210> 751 

<211> 19 

<212> DNA 

<213> Artificial Sequence 

<220> _ . . _ ^ 

<223> Detection oligonucleotide for EGR4 

<400> 751 

aatgttatag tttgagttt 

<210> 752 
<211> 16 
<212> DNA 

<213> Artificial sequence 
^c2 2 

<223> Detection oligonucleotide for TP73 
<400> 752 
gtgcgagtta gtcgga 

<210> 753 
<211> 16 
<212> DNA 

<213> Artificial sequence 
2 2 0^ 

<223> Detection oligonucleotide for TP73 

<400> 753 
gtgtgagtta gttgga 

<2l6> 754 
<211> 16 
<2l2> Dl'JA 

<213> Artificial Sequence 
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<220> seq_prot.txt 
<223> Detection oligonucleotide for TP73 
<400> 754 

tatcggttcg gagtta 

<210> 755 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for tp73 
<400> 755 

aggatattgg tttggag 

<210> 756 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TP73 
<400> 756 

agagtcgttc ggaatt 

<210> 757 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for TP73 
<400> 757 

tgagagttgt ttggaat 

<210> 758 
<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for tes 
<400> 758 

tagaagtcgg ttcgtg 

<210> 759 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TES 
<400> 759 

agaagttggt ttgtgg 
<210> 760 



Seite 495 



seq_prot.txt 

<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TES 
<400> 760 
gattgggcgg cggaag 

<210> 761 

<211> 16 

<212> DNA 

<213> Artificial Sequence 

<220> . , _ 

<223> Detection oligonucleotide for TES 

<400> 761 

attgggtggt ggaagt 

<210> 762 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> . , ^ 

<223> Detection oligonucleotide for TES 

<400> 762 

tagcggagtc ggaggt 

<210> 763 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<220> . , e 

<223> Detection oligonucleotide for TES 

<400> 763 

tagtggagtt ggaggt 

<210> 764 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TES 

<400> 764 

aattcggtcg tgggat 

<210> 765 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> . . , . 

<223> Detection oligonucleotide for TES 

<400> 765 , 
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aatttggttg tgggat 

<210> 766 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRIN2D 
<400> 766 
gagagtcggg atgatt 

<210> 767 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRIN2D 
<400> 767 

ggagagttgg gatgat 

<210> 768 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRIN2D 
<400> 768 

tagggtcgag atttgg 

<210> 769 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRIN2D 
<400> 769 

ttagggttga gatttgg 

<210> 770 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRIN2D 
<400> 770 

agtgtggcga atattg 

<210> 771 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
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<223> Detection oligonucleotide for GRIN2D 

<400> 771 
gtgtggtgaa tattgaa 

<210> 772 
<211> 19 
<212> DNA 

<213> Artificial sequence 
^» 2» 2 (3 

<223> Detection oligonucleotide for PRKCD 
<400> 772 

atttattttt cgttgtagg 

<210> 773 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
2 2 

<223> Detection oligonucleotide for PRKCD 
<400> 773 

tatttatttt ttgttgtagg 

<210> 774 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
2 2 

<223> Detection oligonucleotide for PRKCD 

<400> 774 
tttcggaaac gggaat 

<210> 775 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PRKCD 
<400> 775 

tagttttgga aatggga 

<210> 776 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> _ . . r- ^ 

<223> Detection oligonucleotide for PRKCD 
<400> 776 
ggacggagtt atcggt 
<2l0> 777 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for prkcd 
<400> 777 

ggatggagtt attggta 

<210> 778 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for prkcd 
<400> 778 

gtttagcgga gggata 

<210> 779 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for prkcd 
<400> 779 

tgtttagtgg agggat 

<210> 780 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ESR1 (exon8) 
<400> 780 

ttgttacggt ttgagag 

<210> 781 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ESR1 (exonS) 
<400> 781 

ttgttatggt ttgagagt 

<210> 782 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ESR1 (exon8) 

<400> 782 
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tttgttatag tttgagagt 

<210> 783 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for ESRl (exon8) 

<400> 783 
tttgttacgg tttgag 

<210> 784 
<211> 17 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for ESRl (exon8) 

<400> 784 
tttgttatgg tttgaga 

<210> 785 
<211> 18 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for ESRl (exon8) 
<400> 785 

tttgttatag tttgagag 

<210> 786 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for S100A2 
<400> 786 

tttaattgcg gttgtgtg 

<210> 787 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for S100A2 
<400> 787 

tttaattgtg gttgtgtg 

<210> 788 
<211> 18 
<212> DNA 

<2i3> Artificial Sequence 
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<223> Detection oligonucleotide for S100A2 
<400> 788 

tatataggcg tatgtatg 

<210> 789 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for S100A2 
<400> 789 

tatataggtg tatgtatg 

<210> 790 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for S100A2 
<400> 790 

tgtatacgag tattgga 

<210> 791 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for S100A2 
<400> 791 

tatgtatatg agtattgga 

<210> 792 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for S100A2 
<400> 792 

agttttagcg tgtgttta 

<210> 793 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for S100A2 
<400> 793 

agttttagtg tgtgttta 
<210> 794 



seq_prot.txt 

<211> 20 
<212> DNA 

<213> Artificial Sequence 

^2 ^^^^ 

<223> Detection oligonucleotide for TFFl 
<400> 794 

agaatttatc gtataaaaag 

<210> 795 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> , . . 

<223> Detection oligonucleotide for TFFl 

<400> 795 

aatttattgt ataaaaaggt 

<210> 796 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
^c22^)^^ 

<223> Detection oligonucleotide for TFFl 
<400> 796 
ggacgtcgat ggtatt 

<210> 797 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> . 

<223> Detection oligonucleotide for TFFl 

<400> 797 
agggatgttg atggta 

<210> 798 

<211> 16 

<212> DNA 

<213> Artificial Sequence 

<223> Detection oligonucleotide for TFFl 

<400> 798 
aacggtgtcg tcgaaa 

<210> 799 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TFFl 
<400> 799 
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aatggtgttg ttgaaat 

<210> 800 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ONECUT2 
<400> 800 

tacgtagttg cgcgtt 

<210> 801 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ONECUT2 
<400> 801 

gtatgtagtt gtgtgtt 

<210> 802 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ONECUT2 
<400> 802 

ttttgtgcgt acggat 

<210> 803 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for 0NECUT2 
<400> 803 

tttttgtgtg tatggat 

<210> 804 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ONECUT2 
<400> 804 

ttaagcgggc gttgat 

<210> 805 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
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<223> Detection oligonucleotide for ONECUT2 

<400> 805 
ttaagtgggt gttgat 

<210> 806 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<223> Detection oligonucleotide for ONECUT2 

<400> 806 
tagaggcgcg ggttat 

<210> 807 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

^^^^ 

<223> Detection oligonucleotide for ONECUT2 

<400> 807 
tagaggtgtg ggttat 

<210> 808 
<211> 17 
<212> DNA 

<213> Artificial Sequence 

<223> Detection oligonucleotide for PSATl 

<400> 808 
tttcgattcg gtttaga 

<210> 809 
<211> 18 
<212> DNA 

<213> Artificial sequence 
^c220^^ 

<223> Detection oligonucleotide for PSATl 
<400> 809 

aattgttttg atttggtt 

<210> 810 
<211> 16 
<212> DNA 

<2i3> Artificial sequence 
^c22C)^^ 

<223> Detection oligonucleotide for PSATl 
<400> 810 
taatggggcg tcgatt 
<210> 811 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PSAT1 
<400> 811 

ttaatggggt gttgatt 

<210> 812 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PSATl 
<400> 812 

tatcgtagcg gttagg 

<210> 813 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PSATl 
<400> 813 

tattgtagtg gttaggaa 

<210> 814 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PSATl 
<400> 814 

aggaacgtta gtcgtt 

<210> 815 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for psatI 
<400> 815 

taggaatgtt agttgttt 

<210> 816 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PSATl 
<400> 816 
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ggtcgtcgta ttatgga 

<210> 817 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PSATl 
<400> 817 

tggttgttgt attatgga 

<210> 818 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PSATl 

<400> 818 

atagtaaacg cgagga 

<210> 819 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PSATl 

<400> 819 

agtaaatgtg aggagg 

<210> 820 
<211> 16 
<212> DNA 

<213> Artificial sequence 



<223> Detection oligonucleotide for DAG1 

<400> 820 

tttcgtggcg gagaat 

<210> 821 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for DAGl 

<400> 821 

ttttgtggtg gagaat 

<210> 822 
<211> 16 
<212> DNA 

<213> Artificial sequence 

< 
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<220> 

<223> Detection oligonucleotide for DAGl 
<400> 822 
tacggatatt tcggtt 

<2I0> 823 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for DAGl 
<400> 823 

aattatggat attttggtt 

<210> 824 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for DAGl 
<400> 824 
ttacgattcg taggtt 

<210> 825 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for DAGl 
<400> 825 

tattattatg atttgtaggt 

<210> 826 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SYK 
<400> 826 
gaagttatcg cgttgg 

<210> 827 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for syk 
<400> 827 

agaagttatt gtgttgg 
<210> 828 



seq_prot.txt 

<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for SYK 
<400> 828 
gatcgatgcg gtttat 

<210> 829 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SYK 
<400> 829 
gggattgatg tggttta 

<210> 830 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<220> BJ ^ 

<223> Detection oligonucleotide for SYK 

<400> 830 
gttcggcggg aggaga 

<210> 831 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<220> „ BJ ^ 

<223> Detection oligonucleotide for SYK 

<400> 831 
gtttggtggg aggaga 

<210> 832 
<211> 17 
<212> DNA 

<213> Artificial Sequence 

<220> , #J ^ 

<223> Detection oligonucleotide for SYK 

<400> 832 

agtcgatttt cgtttag 

<210> 833 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> . ^ 

<223> Detection oligonucleotide for SYK 

<400> 833 
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tagttgattt ttgtttagt 

<210> 834 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for SYK 
<400> 834 

ggaagagtcg cgggtt 

<210> 835 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for SYK 
<400> 835 

ggaagagttg tgggtt 

<210> 836 
<2U> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for cga 
<400> 836 

atatttattt tcggaaattt 

<210> 837 
<211> 21 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Detection oligonucleotide for cga 
<400> 837 

ttatttttgg aaatttatag t 

<210> 838 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CGA 
<400> 838 

tgatrttgtc gttattatt 

<210> 839 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> n . J ^ 

<223> Detection oligonucleotide for CGA 

<400> 839 

ttgattttgt tgttattatt 

<210> 840 

<211> 18 

<212> DNA 

<213> Artificial Sequence 

<220> , ... - 

<223> Detection oligonucleotide for CGA 

<400> 840 

taaattgacg ttatggta 

<210> 841 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> n BJ _ 

<223> Detection oligonucleotide for CGA 

<400> 841 

aaattgatgt tatggtaaa 

<210> 842 
<211> 18 
<212> DNA 

<213> Artificial sequence 

<220> . ^ 

<223> Detection oligonucleotide for CGA 

<400> 842 

aattgacgtt atggtaat 

<210> 843 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CGA 
<400> 843 

taaaaattga tgttatggt 

<210> 844 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> . . ^ 

<223> Detection oligonucleotide for CYP2D6 

<400> 844 
gagatcgcgt tttcgt 
<210> 845 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CYP2D6 
<400> 845 

agagattgtg tttttgt 17 

<210> 846 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CYP2D6 
<400> 846 

attcgcggcg aggata 16 

<210> 847 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CYP2D6 
<400> 847 



16 



16 



gatttgtggt gaggat 

<210> 848 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CYP2D6 
<400> 848 
gtcgtttcgg ggacgt 

<210> 849 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CYP2D6 
<400> 849 

gttgttttgg ggatgtg 17 

<210> 850 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CYP2D6 
<400> 850 
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taagtagcgt cgatag 

<210> 851 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for CYP2D6 
<400> 851 
aagtagtgtt gataggg 

<210> 852 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
^c22^)^^ 

<223> Detection oligonucleotide for RASSFl 
<400> 852 

agtaaatcgg attagga 

<210> 853 
<211> 18 

<212> DNA 

<213> Artificial Sequence 

<220> „ _ _ 

<223> Detection oligonucleotide for RASSFl 

<400> 853 

agtaaattgg attaggag 

<210> 854 
<2U> 17 
<212> DNA 

<213> Artificial Sequence 

<220> _ _ _ 

<223> Detection oligonucleotide for RASSFl 

<400> 854 

tacgggtatt ttcgcgt 

<210> 855 

<211> 18 

<212> DNA 

<213> Artificial Sequence 

<220> . . _ 

<223> Detection oligonucleotide for RASSFl 

<400> 855 

atatgggtat ttttgtgt 

<210> 856 
<211> 16 
<212> DNA 

<213> Artificial sequence 
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<223> Detection oligonucleotide for RASSFl 
<400> 856 
tgcgagagcg cgttta 

<2I0> 857 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for RASSFl 
<400> 857 

ttgtgagagt gtgttta 

<210> 858 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for TP53 
<4Q0> 858 

tattaggtcg gcgaga 

<210> 859 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for TP53 
<400> 859 

aggttggtga gaattt 

<210> 860 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TP53 
<400> 860 

ttcggtaggc ggatta 

<210> 861 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TP53 
<400> 861 

tttttggtag gtggat 
<210> 862 
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<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide 

<400> 862 

atattttgcg ttcggg 

<210> 863 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide 

<400> 863 

atattttgtg tttgggt 

<210> 864 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide 

<400> 864 

tacgacggtg atacgt 

<210> 865 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for 
<400> 865 

tttatgatgg tgatatgt 

<210> 866 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for 

<400> 866 

tacgaatgcg tggcgg 

<210> 867 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for 

<400> 867 



seq_prot.txt 
for TP53 

16 

for TP53 

17 

for TP53 

16 

TP53 

18 

CDK6 

16 

CDK6 
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tatgaatgtg tggtgga 

<210> 868 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CDK6 
<400> 868 

tttcggagta ggcgag 

<210> 869 
<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CDK6 
<400> 869 

ttttggagta ggtgag 

<210> 870 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CDK6 
<400> 870 

tacgttagtt tcgcgg 

<210> 871 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CDK6 
<4Q0> 871 

tatgttagtt ttgtggg 

<210> 872 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CDK6 
<400> 872 

attgagacgc gtttgg 

<210> 873 
<211> 16 
<212> DNA 

<213> Artificial sequence 



seq_prot.txt 

<220> n . ^ _ 

<223> Detection oligonucleotide for CDK6 

<400> 873 
gagatgtgtt tgggta 

<210> 874 

<211> 17 

<212> DNA 

<213> Artificial Sequence 

<220> . , ^ 

<223> Detection oligonucleotide for TMEFF2 

<400> 874 

tatcgtagtt cgttcgg 

<210> 875 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TMEFF2 
<400> 875 

attgtagttt gtttggt 

<210> 876 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> . , _ 

<223> Detection oligonucleotide for TMEFF2 

<400> 876 
aaacgtttat cggttg 

<210> 877 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TMEFF2 
<400> 877 

aatgtttatt ggttgga 

<210> 878 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TMEFF2 
<400> 878 

ttcgtagaag aatacgcgta 
<210> 879 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TMEFF2 
<400> 879 

tttgtagaag aatatgtgta 

<210> 880 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STKll 
<400> 880 
attaatcgtc gttcgg 

<210> 881 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STKll 
<400> 881 

gattaattgt tgtttggg 

<210> 882 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for STKll 
<400> 882 

taatcgttag cggcgg 

<210> 883 
<2U> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for STKll 
<400> 883 

ttaattgtta gtggtgg 

<210> 884 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STKll 
<400> 884 
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gtcgttttcg cgagga 

<210> 885 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



<223> Detection oligonucleotide for STKll 
<400> 885 

gttgtttttg tgaggag 

<210> 886 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STKll 

<400> 886 

taatgagcgc gttgta 

<210> 887 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STKll 
<400> 887 

atgagtgtgt tgtattt 

<210> 888 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for HSPBl 

<400> 888 

agggtattcg tcggtt 

<210> 889 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for HSPBl 
<400> 889 
agggtatttg ttggtt 

<210> 890 
<211> 16 
<212> DNA 

<213> Arti f i ci al Sequence 
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<223> Detection oligonucleotide for HSPBl 
<400> 890 
agtcgtgtta cggtag 

<210> 891 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for hspb! 
<400> 891 

agttgtgtta tggtagg 

<210> 892 

<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for HSPBl 

<400> 892 

gaattcgaga gcgcga 

<210> 893 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for HSPBl 
<400> 893 

tgaatttgag agtgtga 

<210> 894 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for HSPBl 
<400> 894 

ttttttcgtt aaggaaag 

<210> 895 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for HSPBl 
<400> 895 

ttttttttgt taaggaaag 
<210> 896 



seq_prot. txt 

<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TGFBR2 

<400> 896 

aaaacgtgga cgtttt 

<210> 897 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TGFBR2 
<400> 897 

gaaaatgtgg atgtttt 

<210> 898 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TGFBR2 
<400> 898 

tgaaagtcgg ttaaagt 

<210> 899 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for TGFBR2 
<400> 899 

tgaaagttgg ttaaagt 

<210> 900 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TGFBR2 

<400> 900 

ttggacgtcg aggaga 

<210> 901 

<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TGFBR2 

<400> 901 
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ttggatgttg aggaga 

<210> 902 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for TGFBR2 
<400> 902 
ttttcgggcg gagaga 

<210> 903 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for tgfbr2 

<400> 903 

aaggtttttg ggtgga 

<210> 904 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for CSPG2 
<400> 904 
ttcggttagt ttcgtat 

<210> 905 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CSPG2 
<400> 905 

ttttggttag ttttgtatt 

<210> 906 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CSPG2 
<400> 906 

ttcgggttat tacgttt 

<210> 907 
<211> 19 
<212> DNA 

<213> Artificial Sequence 



seq_prot.txt 

<220> , „ f- , 

<223> Detection oligonucleotide for CSPGZ 

<400> 907 

ttttgggtta ttatgtttt 

<210> 908 
<211> 16 
<212> DNA 

<213> Artificial sequence 
2 2 0^^ 

<223> Detection oligonucleotide for CSPG2 
<400> 908 
tttagtcgcg tagcgt 

<210> 909 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
^c2 2 

<223> Detection oligonucleotide for CSPG2 
<400> 909 

atttagttgt gtagtgtt 

<210> 910 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<220> . , ^ 

<223> Detection oligonucleotide for CSPG2 

<400> 910 
aattcgcgag uttaga 

<210> 911 
<211> 18 
<212> DNA 

<213> Artificial sequence 

^c22^)^' 

<223> Detection oligonucleotide for CSPG2 
<400> 911 

gaaaaaaatt tgtgagtt 

<210> 912 
<211> 16 
<212> DNA 

<213> Artificial sequence 

^£ 2 2 

<223> Detection oligonucleotide for ERBB2 
<400> 912 
tgtgagaacg gttgta 
<210> 913 
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<211> 16 seq_prot.txt 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ERBB2 
<400> 913 

tgagaatggt tgtagg 

<210> 914 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for ERBB2 
<400> 914 

rtaggcgttt cggcgt 

<210> 915 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for ERBB2 
<400> 915 

tttaggtgtt ttggtgt 

<210> 916 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ERBB2 
<400> 916 

taggtttgcg cgaaga 

<210> 917 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for erbb2 
<400> 917 

tttgtgtgaa gagagg 

■ 

<210> 918 
<211> 17 
<212> DNA 

<213> Artificial Sequence ' 
<220> 

<223> Detection oligonucleotide for ERBB2 
<400> 918 



seq_prot.txt 

taattatcgg agaagga 

<210> 919 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ERBB2 
<400> 919 

taattattgg agaaggag 

<210> 920 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ELKl 
<400> 920 

ggtcggcgtt gatttta 

<210> 921 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ELKl 
<400> 921 

ggttggtgtt gatttta 

<210> 922 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> , ^ 

<223> Detection oligonucleotide for ELKl 

<400> 922 
gtcgggattc gaacgg 

<210> 923 

<211> 16 

<212> DNA 

<213> Artificial Sequence 

<220> t e 

<223> Detection oligonucleotide for ELKl 

<400> 923 

gttgggattt gaatgg 

<210> 924 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
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<220> seq.prot.txt 
<223> Detection oligonucleotide for ELKl 
<400> 924 

gtcggaagtt tcggga 

<210> 925 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ELKl 
<400> 925 

gttggaagtt ttgggat 

<210> 926 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ELKl 
<400> 926 

atatcgtagg gtaggcgg 

<210> 927 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ELKl 
<400> 927 

atattgtagg gtaggtgg 

<210> 928 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for apc 
<400> 928 

ggtttcgttt aatcgt 

<210> 929 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for apc 
<400> 929 

gggttttgtt taattgta 
<210> 930 



seq_prot.txt 

<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> , , 

<223> Detection oligonucleotide for apc 

<400> 930 
ttcgtattta gcggat 

<210> 931 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for APC 
<400> 931 

ggtttgtatt tagtgga 

<210> 932 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for APC 

<400> 932 
atcggcgggt tttcga 

<210> 933 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for APC 
<400> 933 

aattggtggg tttttga 

<210> 934 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> . , ^ 

<223> Detection oligonucleotide for APC 

<400> 934 

attttcgagt tcggta 

<210> 935 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
2 2 

<22!3> Detection oligonucleotide for APC 
<400> 935 
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tttttgagtt tggtagt 

<210> 936 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ESR2 
<400> 936 

atttcgagga ttacgtt 

<210> 937 
<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for esr2 
<400> 937 

attttgagga ttatgtttt 

<210> 938 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ESR2 
<400> 938 

agatggcgtt tttcgta 

<210> 939 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ESR2 
<400> 939 

tagatggtgt tttttgta 

<210> 940 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ESR2 
<400> 940 

attttcgaat cgattttt 

<210> 941 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Detection oligonucleotide for ESR2 
<400> 941 

ggagtatttt tgaattgat 

<210> 942 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

< 220> . ^ 

<223> Detection oligonucleotide for ESR2 

<400> 942 

agttcgacgg ttttag 

<210> 943 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ESR2 
<400> 943 
agggagtttg atggtt 

<210> 944 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ESR2 

<400> 944 

agtttacgtg atcgag 

<210> 945 

<211> 18 

<212> DNA 

<213> Artificial Sequence 
< 220 > 

<223> Detection oligonucleotide for ESR2 

<400> 945 

agtttatgtg attgagtt 

<210> 946 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for WBPll 
<400> 946 
ttacgagaag cgggta 
<210> 947 

Seite 529 



<211> 17 seq_prot.txt 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for WBPll 
<400> 947 

attatgagaa gtgggta 

<210> 948 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for WBPll 
<400> 948 

agggggcgat tttcgg 

<210> 949 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for WBPll 
<400> 949 

tagggggtga tttttgg 

<210> 950 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for WBPll 
<400> 950 

ttagcgtcgt ttgatt 

<210> 951 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for WBPll 
<400> 951 

ttttagtgtt gtttgatt 

<210> 952 
<2U> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for WBPll 
<400> 952 



seq_prot.txt 

agttcgtttt attgcgt 

<210> 953 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> . 

<223> Detection oligonucleotide for WBPll 

<400> 953 

gagtttgttt tattgtgt 

<210> 954 
<211> 17 
<212> DNA 

<213> Artificial Sequence 

<220> , . , r 

<223> Detection oligonucleotide for CGB 

<400> 954 

ttacgtcgtg gttttta 

<210> 955 

<211> 18 

<212> DNA 

<213> Artificial sequence 

<220> . ^ 

<223> Detection oligonucleotide for CGB 

<400> 955 

ttatgttgtg gtttttag 

<210> 956 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> . ^ 

<223> Detection oligonucleotide for CGB 

<400> 956 

ggcgtgaatt tcgtgg 

<210> 957 

<211> 17 

<212> DNA 

<213> Artificial sequence 

<220> , ^ 

<223> Detection oligonucleotide for CGB 

<400> 957 

ggtgtgaatt ttgtggt 

<210> 958 
<211> 17 
<212> DNA 

<213> Artificial sequence 
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<220> seq_prot.txt 
<223> Detection oligonucleotide for CGB 
<400> 958 

tttcgagttt attcggt 

<210> 959 
<211> IS 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CGB 
<400> 959 

ttttgagttt atttggtt 

<210> 960 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CGB 
<400> 960 

ttatcgcgat gtgcgt 

<210> 961 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for CGB 
<400> 961 
attattgtga tgtgtgt 

<210> 962 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for ESRl 
<400> 962 

tgcggttgta tacgtag 

<210> 963 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ESRl 
<400> 963 

tgtgtggttg tatatgt 
<210> 964 

_______ 



s eq_prot.txt 

<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> _ . , _ ^ 

<223> Detection oligonucleotide for ESRl 

<400> 964 

ttcgtgttag atttcgatat 

<210> 965 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> . , _ 

<223> Detection oligonucleotide for ESRl 

<400> 965 

tttgtgttag attttgatat 

<210> 966 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<220> . , . _ 

<223> Detection oligonucleotide for ESRl 

<400> 966 
aacgcgaaag acggat 

<210> 967 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> . , 

<223> Detection oligonucleotide for ESRl 

<400> 967 

ataaatgtga aagatgga 

<210> 968 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ESRl 
<400> 968 
gggcgtacga ggattt 

<210> 969 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<223> Detection oligonucleotide for ESRl 
<400> 969 
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gggtgtatga ggattt 

<210> 970 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PITX2 
<400> 970 

agtcgggaga gcgaaa 

<210> 971 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PITX2 
<400> 971 

agttgggaga gtgaaa 

<210> 972 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PITX2 
<400> 972 

aagagtcggg agtcgga 

<210> 973 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PITX2 
<400> 973 

aagagttggg agttgga 

<210> 974 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for pitx2 
<400> 974 

ggtcgaagag tcggga 

<210> 975 
<211> 16 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Detection oligonucleotide for PITX2 
<400> 975 
ggttgaagag ttggga 

<210> 976 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PITX2 

<400> 976 

atgttagcgg gtcgaa 

<210> 977 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PITX2 

<400> 977 

tagtgggttg aagagt 

<210> 978 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PCAF 

<400> 978 

gagcggtagg tgtcgaa 

<210> 979 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PCAF 
<400> 979 

gagtggtagg tgttgaa 

<210> 980 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PCAF 
<400> 980 
taagatttcg cgggta 
<2l6> 981 
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<211> 18 seq_prot.txt 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for pcaf 
<400> 981 

tgtaagattt tgtgggta 

<210> 982 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for pcaf 
<400> 982 
agttcgtagt ttcgag 

<210> 983 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PCAF 
<400> 983 

gtttgtagtt ttgagga 

<210> 984 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for pcaf 
<400> 984 
tagggcgcgg agtaga 

<210> 985 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PCAF 
<400> 985 

tagggtgtgg agtaga 

<210> 986 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for pcaf 
<400> 986 



seq_prot.txt 

agcgtcggta cgtata 

<210> 987 
<211> 16 
<212> DNA 

<213> Artificial Sequence 



<223> Detection oligonucleotide for PCAF 

<400> 987 

ggtagtgttg gtatgt 

<210> 988 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TBC1D3 

<400> 988 

tattcgcggg cggttt 

<210> 989 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TBC1D3 

<400> 989 

tagtatttgt gggtgg 

<210> 990 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for TBC1D3 
<400> 990 
attcggcggg agatta 

<210> 991 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for TBC1D3 

<400> 991 

agtaaatttg gtggga 

<210> 992 
<211> 17 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Detection oligonucleotide for TBC1D3 
<400> 992 

agattagtcg aaagagt 

<2i0> 993 
<211> IS 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TBC1D3 
<400> 993 

gagattagtt gaaagagt 

<210> 994 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for TBC1D3 
<400> 994 

tatatttcgg ggttttaa 

<210> 995 
<211> 19 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for TBC1D3 
<400> 995 

tatattttgg ggttttaaa 

<210> 996 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ABCA8 
<400> 996 

atttggtttc gaagttt 

<210> 997 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for ABCA8 
<400> 997 

tatttggttt tgaagttt 
<210> 998 
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<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> - 

<223> Detection oligonucleotide for ABCA8 

<400> 998 

ttttcggaat tcgggt 

<210> 999 
<2U> 17 
<212> DNA 

<213> Artificial Sequence 
^ 2 2 

<223> Detection oligonucleotide for ABCA8 
<400> 999 

ttttggaatt tgggtgt 

<210> 1000 
<211> 17 
<212> DNA 

<213> Artificial sequence 

<220> . . _ 

<223> Detection oligonucleotide for ABCAo 

<400> 1000 
tttcggtttt taacggt 

<210> 1001 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

<220> , _ A0 ^ AO 

<223> Detection oligonucleotide for ABCAo 

<400> 1001 
ttttggtttt taatggtg 

<210> 1002 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
2 2 0^^ 

<223> Detection oligonucleotide for ABCA8 
<400> 1002 
aaaatttacg agggga 

<210> 1003 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for ABCA8 
<400> 1003 
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ttaaaattta tgagggga 

<210> 1004 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for BCL6 
<400> 1004 

atttcgaaat atgtcgg 

<210> 1005 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for BCL6 
<400> 1005 

attttgaaat atgttggt 

<210> 1006 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for BCL6 
<400> 1006 

attcgagacg ttttgt 

<210> 1007 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for BCL6 
<400> 1007 

tttgagatgt tttgttta 

<210> 1008 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for BCL6 
<400> 1008 

ttcgagtttc gaatcgg 

<210> 1009 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
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<220> _ . . _ c 

<223> Detection oligonucleotide for BCLb 

<400> 1009 
tttgagtttt gaattgga 

<210> 1010 
<211> 16 
<212> DNA 

<213> Artificial Sequence 

<220> . . , _ 

<223> Detection oligonucleotide for BCLb 

<400> 1010 
atagcgaagg cgtcga 

<210> 1011 
<211> 17 
<212> DNA 

<213> Artificial sequence 

<220> „ . 4 r - 

<223> Detection oligonucleotide for BCLb 

<400> 1011 
tatagtgaag gtgttga 

<210> 1012 

<211> 16 

<212> DNA 

<213> Artificial sequence 

<220> . J ^ 

<223> Detection oligonucleotide for STMNl 

<400> 1012 
ttaggcggtt cggatt 

<210> 1013 
<211> 16 
<212> DNA 

<213> Artificial sequence 

<220> . . _ 

<223> Detection oligonucleotide for STMNl 

<400> 1013 

ttaggtggtt tggatt 

<210> 1014 

<211> 16 

<212> DNA 

<213> Artificial sequence 

<220> . f ^ 

<223> Detection oligonucleotide for STMNl 

<400> 1014 
tatcggttcg ggaatt 
<210> 1015 
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<211> 17 seq_prot.txt 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STMNl 
<400> 1015 

tattggtttg ggaattt 

<210> 1016 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STMNl 
<400> 1016 

tttcgcgcgg aggtta 

<210> 1017 
<211> 16 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STMNl 
<400> 1017 
ttttgtgtgg aggtta 

<210> 1018 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STMNl 
<400> 1018 

ggtaagaacg tatatagt 

<210> 1019 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STMNl 
<400> 1019 

tggtaagaat gtatatagt 

<210> 1020 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STMNl 

< 400> 1Q20 
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tttcggttaa tgcgga 

<210> 1021 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STMNl 

<400> 1021 

tttttggtta atgtgga 

<210> 1022 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for STMNl 

<400> 1022 

tacgttcgcg atttgt 

<210> 1023 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for stmnI 

<400> 1023 

agggttatgt ttgtga 

<210> 1024 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for STMNl 

<400> 1024 

gatacgtcgg tgtcgg 

<210> 1025 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STMNl 

<400> 1025 

tgatatgttg gtgttgg 

<210> 1026 
<211> 16 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Detection oligonucleotide for stmnI 

<400> 1026 

ttacggcgag attatt 

<210> 1027 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for STMNl 
<400> 1027 

ttttatggtg agattattt 

<210> 1028 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for vtn 
<400> 1028 
ttcgggttcg cgaaag 

<210> 1029 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for vtn 
<400> 1029 
tttgggtttg tgaaag 

<210> 1030 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for VTN 
<400> 1030 

ttttgttcgc gttgaa 

<210> 1031 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for VTN 
<400> 1031 
ttgtttgtgt tgaagta 
<210> 1032 
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<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for VTN 

<400> 1032 

tgggtcgcga ggtagt 

<210> 1033 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for VTN 

<400> 1033 

tgggttgtga ggtagt 

<210> 1034 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for VTN 

<400> 1034 

ggtggtatcg attgat 

<210> 1035 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for VTN 

<400> 1035 

tggtggtatt gattgat 

<210> 1036 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for VTN 

<400> 1036 

ttcgatggcg gtttcga 

<210> 1037 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for VTN 
<400> 1037 
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tttgatggtg gttttga 

<210> 1038 

<211> 16 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for vtn 

<400> 1038 
tagtgattcg cgggga 

<210> 1039 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for vtn 

<400> 1039 

tagtgatttg tgggga 

<210> 1040 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for vtn 
<400> 1040 
ttatgtcgga ggatga 

<210> 1041 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for VTN 
<400> 1041 
attatgttgg aggatga 

<210> 1042 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for vtn 
<400> 1042 
atacggttta tgacgat 

<210> 1043 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Detection oligonucleotide for VTN 
<400> 1043 

atatggttta tgatgatgg 

<210> 1044 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for plau 

<400> 1044 

tatttgtcgc gttgat 

<210> 1045 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PLAU 

<400> 1045 

atttgttgtg ttgatga 

<210> 1046 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PLAU 

<400> 1046 

tgtaattcgg ggattt 

<210> 1047 
<211> 17 
<212> DNA 

<213> Artificial sequence 

* 

<220> 

<223> Detection oligonucleotide for PLAU 

<400> 1047 

ttgtaatttg gggattt 

<210> 1048 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PLAU 
<400> 1048 
aggaagtacg gagaat 
<210> 1049 
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<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for plau 

<400> 1049 

aggaagtatg gagaatt 

<210> 1050 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for PLAU 

<400> 1050 

ttcgttggag atcgcgt 

<210> 1051 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for plau 

<400> 1051 

tttgttggag attgtgt 

<210> 1052 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PLAU 

<400> 1052 

ttgcggaagt acgcgg 

<210> 1053 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for PLAU 

<400> 1053 

ttgtggaagt atgtgg 

<210> 1054 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (chr. Ipl3.2) 
<400> 1054 
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taaattcgac gggttt 

<210> 1055 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (Chr. Ipl3.2) 

<400> 1055 

atttgatggg tttttgt 

<210> 1056 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for Cchr. Ipl3.2) 

<400> 1056 

ttttcgttcg gcggag 

<210> 1057 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for Cchr. Ipl3.2) 

<400> 1057 

tttgtttggt ggaggtt 

<210> 1058 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for Cchr. Ipl3.2) 

<400> 1058 

ttcgcgttta tcgtgt 

<210> 1059 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for Cchr. Ipl3.2) 

<400> 1059 

tggtttgtgt ttattgt 

<210> 1060 
<211> 16 
<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Detection oligonucleotide for (chr. Ipl3.2) 
<400> 1060 
tttcgcggtt cgtagt 
61 

<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for (chr. Ipl3.2) 
<400> 1061 

tttgtggttt gtagttta 

<210> 1062 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRB7 
<400> 1062 

atagtttcgt tatttgtat 

<210> 1063 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRB7 
<400> 1063 
ggtatagttt tgttatttg 

<210> 1064 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRB7 
<400> 1064 

tttagtacgg ggtgta 

<210> 1065 

<211> 17 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRB7 

<400> 1065 

ttttagtatg gggtgta 
<210> 1066 
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<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRB7 

<400> 1066 

ggcgttatag ttacgttt 

<210> 1067 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRB7 

<400> 1067 

gggtgttata gttatgtt 

<210> 1068 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRB7 

<400> 1068 

tgtttatcga aggtaga 

<210> 1069 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for GRB7 

<400> 1069 

tgtttattga aggtagaa 

<210> 1070 
<211> 16 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for Cchr. 8ql2.1) 

<400> 1070 

tattcgggtt tcgcga 

<210> 1071 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (Chr. 8ql2.1) 
<400> 1071 

seite 551 



seq_prot . txt 

atttgggttt tgtgag 

<210> 1072 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (Chr. 8ql2.1) 

<400> 1072 

tattgttacg cgtcga 

. <210> 1073 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (chr. 8ql2 . 1) 

<400> 1073 

attgttatgt gttgattt 

<210> 1074 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (chr. 8ql2 . 1) 
<400> 1074 
gacgtgtagg tcgtat 

<210> 1075 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (chr. 8ql2.1) 

<400> 1075 

gatgtgtagg ttgtatt 

<210> 1076 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for (chr. 8ql2 . 1) 

<400> 1076 

ttcgggaacg attttt 

<210> 1077 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Detection oligonucleotide for (Chr. 8ql2.1) 

<400> 1077 

gggtttggga atgatt 

<210> 1078 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for COX7A2L 

<400> 1078 

ttgttcgaag atcgtt 

<210> 1079 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for COX7A2L 

<400> 1079 

gttgtttgaa gattgttt 

<210> 1080 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for COX7A2 L 

<400> 1080 

tagcgtaagg attcggt 

<210> 1081 
<211> 18 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for COX7A2L 

<400> 1081 

ttagtgtaag gatttggt 

<210> 1082 
<211> 18 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for COX7A2L 
<400> 1082 
agagttcggt ttttcgta 
<210> 1083 
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<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for COX7A2 L 

<400> 1083 

agagtttggt tttttgta 

<210> 1084 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for COX7A2L 

<400> 1084 

attcgtattt gcgggtta 

<210> 1085 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Detection oligonucleotide for C0X7A2L 

<400> 1085 

atttgtattt gtgggtta 

<210> 1086 
<211> 16 
<212> DNA 
. <213> Artificial Sequence 

<220> 

<223> Detection oligonucleotide for COX7A2L 
<400> 1086 
aattcgatcg cgggta 

<210> 1087 
<211> 17 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Detection oligonucleotide for COX7A2 L 
<400> 1087 
atttgattgt gggtaga 



